Infinite loop optimized out by inlining? #42009

zackw · 2017-05-15T15:06:33Z

This crate

fn noret() -> ! { loop { } }
pub fn test_noret() -> () { noret() }

when compiled without optimization (rustc --crate-type rlib --emit asm), produces

_ZN4test5noret17hecd4eea31d87a48aE:
        .cfi_startproc
        jmp     .LBB0_1
.LBB0_1:
        jmp     .LBB0_1
.Lfunc_end0:
        .size   _ZN4test5noret17hecd4eea31d87a48aE, .Lfunc_end0-_ZN4test5noret17hecd4eea31d87a48aE
        .cfi_endproc
_ZN4test10test_noret17h492e3702bacafd6bE:
        .cfi_startproc
        pushq   %rax
.Lcfi0:
        .cfi_def_cfa_offset 16
        callq   _ZN4test5noret17hecd4eea31d87a48aE
.Lfunc_end1:
        .size   _ZN4test10test_noret17h492e3702bacafd6bE, .Lfunc_end1-_ZN4test10test_noret17h492e3702bacafd6bE
        .cfi_endproc

but when compiled with optimization on (rustc --crate-type rlib --emit asm -O), no assembly instructions at all are emitted for test_noret:

_ZN4test10test_noret17h492e3702bacafd6bE:
        .cfi_startproc
.Lfunc_end0:
        .size   _ZN4test10test_noret17h492e3702bacafd6bE, .Lfunc_end0-_ZN4test10test_noret17h492e3702bacafd6bE
        .cfi_endproc

I expected to get something like

_ZN4test10test_noret17h492e3702bacafd6bE:
        .cfi_startproc
.LBB0_1:
        jmp     .LBB0_1
.Lfunc_end0:
        .size   _ZN4test10test_noret17h492e3702bacafd6bE, .Lfunc_end0-_ZN4test10test_noret17h492e3702bacafd6bE
        .cfi_endproc

Identical behavior observed with rustc 1.17.0 (56124baa9 2017-04-24) and rustc 1.19.0-nightly (386b0b9d3 2017-05-14).

The text was updated successfully, but these errors were encountered:

hanna-kruppe · 2017-05-15T15:11:01Z

#28728

zackw · 2017-05-15T15:11:09Z

This smells like LLVM deducing that test_noret can never be called, but it is my understanding that you are allowed to call a function whose signature is (...) -> !, it just never returns. In any case, under no circumstances should a public function compile to zero assembly instructions; a public function that can genuinely never be called by correct code (e.g. pub fn dont_call_me() -> ! { unreachable!() } should at least produce a trap instruction.

RalfJung · 2017-05-15T16:10:36Z

Well, but this function can be called, so emitting a trap would be wrong.
What is the behavior of a public function with no code? Does it just do nothing, or is this a violation of the ABI?

EDIT: Oh, I guess "no code" means it doesn't return to the caller, but instead "falls through" to whatever is next in the text? That's indeed bad. LLVM is probably responsible for removing the loop, but that doesn't explain why there is no return.

zackw · 2017-05-15T16:18:59Z

@RalfJung From a low-level perspective, yes, it is possible to call the function, but any program that does call it is incorrect. This gets into the argument about the scope of "undefined behavior"...

And yes, a function containing no machine instructions will indeed fall through to whatever is next. LLVM is trying to have cake and eat it too -- it has removed the return instruction because it was unreachable, but it was only unreachable because of the infinite loop, which has also been removed.

Note that If you actually write pub fn dont_call_me() -> ! { unreachable!() }, that works out fine, because unreachable!() compiles to an unconditional call to rust_begin_panic, which doesn't get removed.

RalfJung · 2017-05-15T16:46:28Z

Are we sure LLVM is doing both of these transformations? I think that would be incorrect even in C, and I cannot get clang to do the same: https://godbolt.org/g/tyCz9C
C says you may assume any loop without side-effects terminates, but IIRC that doesn't mean diverging programs are UB.

Couldn't it be be that rustc is already removing/not emitting the return (because it thinks the function will never terminate), and then LLVM removes the loop (because C says you can do that)?

hanna-kruppe · 2017-05-15T16:54:51Z

@RalfJung The direct Rust equivalent of your C code doesn't have the loop eliminated either. In both languages, there is no return left in the IR by the time the optimizer is done with it (and presumably already long before that), for example the clang IR reads:

; Function Attrs: norecurse noreturn nounwind readnone uwtable
define i32 @square()() local_unnamed_addr #0 {
  br label %1

; <label>:1:                                      ; preds = %0, %1
  br label %1
}

RalfJung · 2017-05-15T16:59:27Z

Ah, so it takes that indirection through another function to trigger the bug. Interesting.
And indeed, the same thing then happens in clang: https://godbolt.org/g/1ApTJq
I don't think this is allowed even by C...

hanna-kruppe · 2017-05-15T17:00:11Z

Indeed LLVM's current behavior also violates the C standard, because it doesn't account for C's provisions for loop with cetain conditions being allowed to loop forever.

Couldn't it be be that rustc is already removing/not emitting the return (because it thinks the function will never terminate), and then LLVM removes the loop (because C says you can do that)?

Other parts of LLVM will gladly assume that code after an infinite loop is dead (e.g., simplifycfg will kill blocks not reachable from the entry block), so even if we added redundant instructions after diverging expressions this wouldn't be a robust workaround. In fact I'm wouldn't be surprised if simplifycfg ran at least once before the passes which remove infinite loops, so it may never help.

hanna-kruppe · 2017-05-15T17:01:36Z

Regardless, it seems pretty clear now this is a duplicate, so let's close this and continue discussion in the older issue.

RalfJung · 2017-05-15T17:02:05Z

I disagree. The other issue is not about a bug in LLVM or LLVM violating the C spec. It is about a mismatch between the Rust semantics and the LLVM IR semantics, where the latter permits removing infinite loops under some circumstances and the former does not.

OTOH, this bug here seems to actually be about a bug in LLVM, where LLVM performs optimizations that are not even valid under the LLVM IR semantics.

hanna-kruppe · 2017-05-15T17:24:35Z

We don't track LLVM bugs on this issue tracker, though. We track bugs in Rust (some of which may be ultimately rooted in LLVM bugs).

But aside from that, I would strongly argue that both are the exaxct same LLVM issue — both issues are a matter of no-progress loops getting optimized out, whether you want to describe this as "invalid optimization" or "lacking ability to express no-progress loops in IR". In fact, there's a single bug on the LLVM bugzille for this: https://bugs.llvm.org//show_bug.cgi?id=965 (note the many duplicates).

RalfJung · 2017-05-15T17:51:33Z

We don't track LLVM bugs on this issue tracker, though. We track bugs in Rust (some of which may be ultimately rooted in LLVM bugs).

Sure.

But aside from that, I would strongly argue that both are the exaxct same LLVM issue

I see. My latest information on #28728 was that currently, there's not enough interest from the LLVM side to support languages that do not permit removing infinite side-effect-free loops. Seems I was wrong, and the issue is more severe than that -- they don't even support C properly...

What I am saying is LLVM could fix https://bugs.llvm.org//show_bug.cgi?id=965 by compiling https://godbolt.org/g/1ApTJq to a function that returns immediately. That would be a correct fix for the bug as far as C is concerned, but #28728 would remain unfixed. In contrast, any fix to https://bugs.llvm.org//show_bug.cgi?id=965 will fix this bug at hand.

zackw · 2017-05-15T17:58:02Z

C does not permit the loop to be removed in static int diverge() { while(1){} }. The exact wording of the C standard (quoted in #28728) is "An iteration statement whose controlling expression is not a constant expression [and whose body has no observable side effects] ... may be assumed to terminate." while(1){} has a constant controlling expression and therefore must be preserved. If I understand the discussion at http://lists.llvm.org/pipermail/llvm-dev/2015-July/088095.html correctly, this is a difference from C++, which does allow the implementation to assume that all loops eventually terminate.

RalfJung · 2017-05-15T18:00:50Z

Oh, I see. I was not aware C and C++ differ here.
However, for Rust, even the "lighter" C version of this is unsound -- there is no "free lunch" when it comes to loop termination, as the type system otherwise becomes unsound. C doesn't have an empty type like ! (nor a sound type system, for that matter), so it doesn't really care. It was my understanding that's what #28728 tracks.

But, yeah, considering how LLVM treats this bug and comments like #28728 (comment), I have to agree this is a duplicate.

arielb1 · 2017-05-16T09:28:19Z

@RalfJung

A while COND { FOO } Rust loop can be "lowered" to a

while (true) {
    if(!COND) break;
    FOO
}

loop, which §6.8.5.6 seems to imply to be well-defined even when divergent.

RalfJung · 2017-05-16T11:05:58Z

Ah, nice hack. :) If LLVM implemented the C spec properly, that should indeed be a sound compilation scheme.

Funny enough, this shouldn't even make any difference (in the CFG, after some normalization) compared to lowering the loop the "obvious" way. Which I guess is the reason why LLVM is broken.

alexcrichton · 2017-08-25T16:39:38Z

I'm going to close this in favor of #28728

@zackw

Enable TrapUnreachable in LLVM. This patch enables LLVM's TrapUnreachable flag, which tells it to translate `unreachable` instructions into hardware trap instructions, rather than allowing control flow to "fall through" into whatever code happens to follow it in memory. This follows up on #28728 (comment). For example, for @zackw's testcase [here](#42009 (comment)), the output function contains a `ud2` instead of no code, so it won't "fall through" into whatever happens to be next in memory. (I'm also working on the problem of LLVM optimizing away infinite loops, but the patch here is useful independently.) I tested this patch on a few different codebases, and the code size increase ranged from 0.0% to 0.1%.

zackw mentioned this issue May 15, 2017

LLVM loop optimization can make safe programs crash #28728

Closed

Mark-Simulacrum added the A-LLVM Area: Code generation parts specific to LLVM. Both correctness bugs and optimization-related issues. label Jun 22, 2017

Mark-Simulacrum added the C-enhancement Category: An issue proposing an enhancement or a PR with one. label Jul 27, 2017

alexcrichton closed this as completed Aug 25, 2017

sunfishcode mentioned this issue Nov 10, 2017

Enable TrapUnreachable in LLVM. #45920

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Infinite loop optimized out by inlining? #42009

Infinite loop optimized out by inlining? #42009

zackw commented May 15, 2017

hanna-kruppe commented May 15, 2017

zackw commented May 15, 2017

RalfJung commented May 15, 2017 •

edited

Loading

zackw commented May 15, 2017

RalfJung commented May 15, 2017

hanna-kruppe commented May 15, 2017

RalfJung commented May 15, 2017

hanna-kruppe commented May 15, 2017 •

edited

Loading

hanna-kruppe commented May 15, 2017

RalfJung commented May 15, 2017 •

edited

Loading

hanna-kruppe commented May 15, 2017

RalfJung commented May 15, 2017

zackw commented May 15, 2017

RalfJung commented May 15, 2017 •

edited

Loading

arielb1 commented May 16, 2017

RalfJung commented May 16, 2017

alexcrichton commented Aug 25, 2017

Infinite loop optimized out by inlining? #42009

Infinite loop optimized out by inlining? #42009

Comments

zackw commented May 15, 2017

hanna-kruppe commented May 15, 2017

zackw commented May 15, 2017

RalfJung commented May 15, 2017 • edited Loading

zackw commented May 15, 2017

RalfJung commented May 15, 2017

hanna-kruppe commented May 15, 2017

RalfJung commented May 15, 2017

hanna-kruppe commented May 15, 2017 • edited Loading

hanna-kruppe commented May 15, 2017

RalfJung commented May 15, 2017 • edited Loading

hanna-kruppe commented May 15, 2017

RalfJung commented May 15, 2017

zackw commented May 15, 2017

RalfJung commented May 15, 2017 • edited Loading

arielb1 commented May 16, 2017

RalfJung commented May 16, 2017

alexcrichton commented Aug 25, 2017

RalfJung commented May 15, 2017 •

edited

Loading

hanna-kruppe commented May 15, 2017 •

edited

Loading

RalfJung commented May 15, 2017 •

edited

Loading

RalfJung commented May 15, 2017 •

edited

Loading