Floating point add gives different results than compiler-rt on arm-unknown-linux-gnueabi #90

mattico · 2016-10-03T19:41:52Z

I'm still trying to figure out what was wrong with #48, and I'd like another pair of eyes on this because I have no idea what is going on.

I added an implementation of arbitrary_float! again, and ran the tests. Only the arm-unknown-linux-gnueabi target failed. So I grab the arguments that made the test fail and put them into a regular test. On the next build, the test passes because we're actually getting the right answer in this case. It's gcc_s and compiler-rt that have the wrong answer.

Either:

There's some subtle bug in the test infrastructure
gcc_s and libcompiler-rt both have the same bug but we don't despite the code being almost identical to compiler-rt
QEMU is broken but only in this specific way
The C compiler's optimizations cause rounding in very small numbers, but rustc does not
Something else?

The text was updated successfully, but these errors were encountered:

alexcrichton · 2016-10-03T22:12:24Z

Oh that's actually a really good point that we should always compile compiler-rt with optimizations. Right now compiler-rt is conditionally optimized, but gcc_s is pulled in from the system so I think that's always optimize.

japaric · 2016-10-04T00:20:20Z

The C compiler's optimizations cause rounding in very small numbers, but rustc does not

Do the errors go away if you test with optimizations enabled (cargo test --release)?

Are these roundings, -ffast-math optimizations? Who's doing the rounding (less precision) C or Rust?

we should always compile compiler-rt with optimizations.

This should be a simple change in the build.rs via gcc::Config (I expect)

But ... enabling optimizations shouldn't change the results of the compiler-rt intrinsics, right? That doesn't sound quite right. You should get the same output regardless of the optimization level.

mattico · 2016-10-05T16:19:47Z

Do the errors go away if you test with optimizations enabled (cargo test --release)?
Are these roundings, -ffast-math optimizations? Who's doing the rounding (less precision) C or Rust?

No, release builds don't change our results. Our answer agrees with wolfram alpha which uses arbitrary precision arithmetic so it seems like compiler-rt and gcc_s have the wrong answer in this case.

I feel like I was a bit unclear about this so here is a table:

Answers for 1.1540496e-38 + -3.59621e-39 on arm-unknown-linux-gnueabi

Method	Answer
Wolfram Alpha	7.944286e-39
Rust Constant Folding	7.944286e-39
rustc-builtins	7.944286e-39
gcc_s	4.133629e-39
compiler-rt	4.133629e-39

The check! test gives answers for rustc-builtins, gcc_s, and compiler-rt and the #[test] test is probably using constant folding.

I think it's still possible that gcc is less strict about reordering or combining float operations than rust/llvm, but this page makes it look like that only happens with non-standard flags so perhaps that is not the case.

I'm going to start looking at the disassembly since it doesn't seem like there's a simple answer.

mattico · 2016-10-05T17:49:47Z

Hmm. The Rust disassembly is much longer and uses many more registers than the C version. I was expecting them to be a bit more similar. I guess it's time to install IDA.

mattico · 2016-10-05T19:09:01Z

Compiling compiler-rt with clang gives an assembly output that is much more similar to the rust version. I'm going to see if it gives the same results.

japaric · 2016-10-05T19:30:59Z

Compiling compiler-rt with clang gives an assembly output that is much more similar to the rust version.

That makes given that both clang and rust are based on LLVM.

This sounds like gcc has different defaults around rounding compared to LLVM.

this page makes it look like that only happens with non-standard flags so perhaps that is not the case.

Perhaps try compiling compiler-rt using the "full IEEE 754 compliance" flags: -frounding-math -fsignaling-nans. It seems that gcc defaults to not being fully compliant with IEEE 754. I wonder if LLVM is fully compliant with IEEE 754 by default; that could be the difference.

mattico · 2016-10-05T19:41:10Z

Adding the flags didn't change anything.

mattico · 2016-10-05T20:08:59Z

I used a little C program to test the compiler-rt version of __addsf3 when compiled with different compilers, but all of them give 7.944286e-39.

japaric · 2016-10-05T21:40:14Z

Adding the flags didn't change anything.
I used a little C program to test the compiler-rt version of __addsf3 when compiled with different compilers, but all of them give 7.944286e-39.

😕

So, uhmm, calling the same function from C vs from Rust produces different output? Could this be a calling convention problem? Seems unlikely though as you only get different outputs (between C and Rust) for a few inputs. The only other thing I can think about is side effects: it's possible to change the rounding mode using fesetround; perhaps, some pre-main/initialization code is setting that in different ways for C vs for Rust.

mattico · 2016-10-07T04:54:33Z

@japaric Yeah I agree it's a bit strange... I'm still investigating.

Here's a rough summary of what I did (from memory):

$ arm-linux-gnueabi-gcc -O2 -fno-builtins -c addsf3.c # from compiler-rt
$ cat test_arm.c
#include <stdio.h>
extern float __addsf3(float, float);
int main() {
    printf("%e\n", __addsf3(1.1540496e-38, -3.59621e-39));
    return 0;
}
$ arm-linux-gnueabi-gcc -fno-builtins test_arm.c addsf3.o -o test_arm_gcc
$ ./test_arm_gcc # runs in qemu...

And I did the equivalent for clang as well.

I'll look into the crt initialization stuff to see if they mess with floating point rounding.

I guess I should also make an equivalent rust program, which should tell us if the error only happens inside our check! tests for whatever reason.

mattico · 2016-11-11T05:55:24Z

rust-lang/compiler-rt#26 (comment)

mattico · 2016-11-12T20:22:36Z

I'm currently looking into two possible theories:

japaric suggested a while ago that this is caused by undefined behaviour in compiler-rt (and gcc_s presumably) in floatsisf. I ran the test with the updated compiler-rt which is supposed to fix this issue and nothing changed, but I may have forgotten something. I'm in the process of porting floatsisf anyway.
powi only: don't override arm calling convention compiler-rt#26 (comment) above has to do with arm calling conventions so that possibly applies here. There may be some other calling convention issue as well.

parched · 2016-11-12T21:00:48Z

@mattico I don't think 2. will be the issue as that only affects hard float.

ithinuel · 2016-11-27T00:27:22Z

I don't know if it may be related to this but i noticed that when the test starts running, QEMU complains about qemu: Unsupported syscall: 384.

…float)

japaric · 2016-11-27T00:47:18Z

@ithinuel I see that all the time when running Rust programs compiled for ARM under QEMU. Never bothered investigating but it hasn't been an issue with programs that don't do (too many) floating point operations.

Amanieu · 2016-11-27T00:48:52Z

That's the getrandom system call, but having it unsupported isn't a big problem since the code just falls back to /dev/urandom.

japaric · 2016-11-29T20:59:34Z

So, using a recent nightly, I tested changing the intrinsics that involve floating point arguments/return values from extern "C" to extern "aapcs" but that didn't fix these tests; I still see the same errors on both gnueabi and gnueabihf.

Amanieu · 2016-11-29T21:04:44Z

@japaric Did you change the ABI for the extern block in compiler-builtins/compiler-rt/compiler-rt-cdylib/src/lib.rs? On ARM, all of the intrinsics use the aapcs soft-float calling convention.

japaric · 2016-11-29T21:26:33Z

@Amanieu No, I didn't because those functions get immediately turned into usizes so I don't think the CC they are declared with matters.

On ARM, all of the intrinsics use the aapcs soft-float calling convention.

I only tested changing the calling convention of addsf3 and adddf3 as those are the only intrinsics that are causing problems. Here are my changes: https://gist.github.com/japaric/13755ba481a6cd1199fd07f2f32d8c1e

Amanieu · 2016-11-29T21:55:25Z

@japaric This line should use "aapcs" instead of "C":

-            fn $name($f: extern fn($($farg),*) -> $fret,
+            fn $name($f: extern "C" fn($($farg),*) -> $fret,

EDIT: ah nevermind, I see you duplicated the whole thing just for aapcs

mattico changed the title ~~Floating point add gives different results when called inside check! macro~~ Floating point add gives different results than compiler-rt on arm-unknown-linux-gnueabi Oct 5, 2016

japaric mentioned this issue Oct 6, 2016

switch to a current nightly #93

Merged

mattico mentioned this issue Oct 16, 2016

Implement floatunsisf (WIP) #106

Closed

japaric mentioned this issue Oct 16, 2016

Add float quickcheck #111

Merged

mattico mentioned this issue Nov 12, 2016

__aeabi_ functions should use the AAPCS calling convention, at least with LLVM 4.0+ #116

Closed

ithinuel added a commit to ithinuel/compiler-builtins that referenced this issue Nov 27, 2016

Same issue as rust-lang#90 except this is limited to gnueabihf (hard …

e416382

…float)

ithinuel mentioned this issue Nov 27, 2016

compiler_rt and gcc_s returns odd result on arm-unknown-linux-gnueabihf and armv7-unknown-linux-gnueabihf ithinuel/compiler-builtins#1

Closed

japaric mentioned this issue Apr 10, 2017

no-std friendly test suite #155

Merged

japaric closed this as completed in #155 Apr 11, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Floating point add gives different results than compiler-rt on arm-unknown-linux-gnueabi #90

Floating point add gives different results than compiler-rt on arm-unknown-linux-gnueabi #90

mattico commented Oct 3, 2016 •

edited

Loading

alexcrichton commented Oct 3, 2016

japaric commented Oct 4, 2016

mattico commented Oct 5, 2016

mattico commented Oct 5, 2016

mattico commented Oct 5, 2016

japaric commented Oct 5, 2016

mattico commented Oct 5, 2016

mattico commented Oct 5, 2016

japaric commented Oct 5, 2016

mattico commented Oct 7, 2016

mattico commented Nov 11, 2016

mattico commented Nov 12, 2016

parched commented Nov 12, 2016

ithinuel commented Nov 27, 2016

japaric commented Nov 27, 2016

Amanieu commented Nov 27, 2016

japaric commented Nov 29, 2016

Amanieu commented Nov 29, 2016

japaric commented Nov 29, 2016 •

edited

Loading

Amanieu commented Nov 29, 2016 •

edited

Loading

Floating point add gives different results than compiler-rt on arm-unknown-linux-gnueabi #90

Floating point add gives different results than compiler-rt on arm-unknown-linux-gnueabi #90

Comments

mattico commented Oct 3, 2016 • edited Loading

alexcrichton commented Oct 3, 2016

japaric commented Oct 4, 2016

mattico commented Oct 5, 2016

mattico commented Oct 5, 2016

mattico commented Oct 5, 2016

japaric commented Oct 5, 2016

mattico commented Oct 5, 2016

mattico commented Oct 5, 2016

japaric commented Oct 5, 2016

mattico commented Oct 7, 2016

mattico commented Nov 11, 2016

mattico commented Nov 12, 2016

parched commented Nov 12, 2016

ithinuel commented Nov 27, 2016

japaric commented Nov 27, 2016

Amanieu commented Nov 27, 2016

japaric commented Nov 29, 2016

Amanieu commented Nov 29, 2016

japaric commented Nov 29, 2016 • edited Loading

Amanieu commented Nov 29, 2016 • edited Loading

mattico commented Oct 3, 2016 •

edited

Loading

japaric commented Nov 29, 2016 •

edited

Loading

Amanieu commented Nov 29, 2016 •

edited

Loading