JIT Optimization: Merge multiple `inc` into one `add` #65025

deeprobin · 2022-02-08T21:39:43Z

https://sharplab.io/#v2:EYLgxg9gTgpgtADwGwBYA0AXEBLANgHwAEAmARgFgAoQgZgAIS6BhOgbyrs4fuwDsM6AQQAUfAQEMAlGw5c54gNQKA3LLmdFKtesIB2OuNWU5AX23badMXQBCo/gentj6jXQUBeOsSOvOeg18uM0oTIA

The codegen of B is better (less cpu cycles).

    public int A(int a) {
        a++; // inc edx
        a++; // inc edx
        return a;
    }
    
    public int B(int a) {
        a += 2; // add edx, 2
        return a;
    }

Expected result

- inc edx
- inc edx
+ add edx, 2
  mov eax, edx
  ret

ghost · 2022-02-08T21:39:47Z

Tagging subscribers to this area: @JulieLeeMSFT
See info in area-owners.md if you want to be subscribed.

Issue Details

https://sharplab.io/#v2:EYLgxg9gTgpgtADwGwBYA0AXEBLANgHwAEAmARgFgAoQgZgAIS6BhOgbyrs4fuwDsM6AQQAUfAQEMAlGw5c54gNQKA3LLmdFKtesIB2OuNWU5AX23badMXQBCo/gentj6jXQUBeOsSOvOeg18uM0oTIA

The codegen of B is better (less cpu cycles).

    public int A(int a) {
        a++; // inc edx
        a++; // inc edx
        return a;
    }
    
    public int B(int a) {
        a += 2; // add edx, 2
        return a;
    }

Expected result

- inc edx
- inc edx
+ add edx, 2
  mov eax, edx
  ret

<table>
  <tr>
    <th align="left">Author:</th>
    <td>deeprobin</td>
  </tr>
  <tr>
    <th align="left">Assignees:</th>
    <td>-</td>
  </tr>
  <tr>
    <th align="left">Labels:</th>
    <td>

`area-CodeGen-coreclr`, `untriaged`

</td>
  </tr>
  <tr>
    <th align="left">Milestone:</th>
    <td>-</td>
  </tr>
</table>
</details>

danmoseley · 2022-02-09T00:44:45Z

This does not seem to be what C++ compilers do: https://godbolt.org/z/r1f7Pxdfa
Curious to know why.

omariom · 2022-02-09T01:57:51Z

@danmoseley
With optimization flags https://godbolt.org/z/MPcor1xEP

danmoseley · 2022-02-09T04:06:33Z

@omariom thanks, I thought that was default.

deeprobin · 2022-02-09T05:43:57Z

Same for `dec`

https://sharplab.io/#v2:EYLgxg9gTgpgtADwGwBYA0AXEBDAzgWwB8ABAJgEYBYAKGIGYACMhgYQYG8aHunGBLAHYYGAQQAUg4dgCUHLjwXY4cANzyF3JavUbiAdgbY11BQF8dO+g0kMAQhKGHZnExs0M4AXgaljb7vqGfjzm1KZAA==

Expected result

- dec edx
- dec edx
+ add edx, 0xfffffffe
  mov eax, edx
  ret

And `inc`+`dec` elimination would be nice

https://sharplab.io/#v2:EYLgxg9gTgpgtADwGwBYA0AXEBDAzgWwB8ABAJgEYBYAKGIGYACMhgYQYG8aHunGBLAHYYGAQQAUg4dgCUHLjwXYA1EoDc8hd2xw466pp7K1GzcQDsDbHoUBfEyfoNJDAEIShl2Z30HLK677mlgHcdtQ2QA=

Expected result

- inc edx
- dec edx
  inc edx
  mov eax, edx
  ret

dubiousconst282 · 2022-02-09T19:26:55Z

IMO this should be generalized to all operators.

Interestingly, copying to temporary variables will result in optimal code, so maybe this is related to SSA/constant propagation:

public int M(int x) {
    x *= 7;
    x *= 7;
    return x;
}
public int M2(int x) {
    int t1 = x * 7;
    int t2 = t1 * 7;
    return t2;
}

C.M(Int32)
    L0000: imul edx, 7
    L0003: imul edx, 7
    L0006: mov eax, edx
    L0008: ret

C.M2(Int32)
    L0000: imul eax, edx, 0x31
    L0003: ret

Sharplab

JulieLeeMSFT · 2022-02-09T21:21:21Z

cc @dotnet/jit-contrib

AndyAyersMS · 2022-02-09T21:32:35Z

Note forward sub (#63720) does not handle cases where a local has multiple definitions and uses, even if each definition has just one use. It might not be too hard to extend it to cover these cases.

Eg if we run it again once SSA is built and track the number of uses of each SSA def.

dotnet-issue-labeler bot added area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI untriaged New issue has not been triaged by the area owner labels Feb 8, 2022

JulieLeeMSFT removed the untriaged New issue has not been triaged by the area owner label Feb 9, 2022

JulieLeeMSFT added this to the Future milestone Feb 9, 2022

jakobbotsch mentioned this issue Dec 11, 2022

JIT: Add a pass of early liveness and use it for forward sub and last-use copy elision for implicit byrefs #79346

Merged

10 tasks

ghost added the in-pr There is an active PR which will close this issue when it is merged label Dec 11, 2022

jakobbotsch closed this as completed in #79346 Jan 11, 2023

jakobbotsch closed this as completed in db717e3 Jan 11, 2023

ghost removed the in-pr There is an active PR which will close this issue when it is merged label Jan 11, 2023

ghost locked as resolved and limited conversation to collaborators Feb 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

JIT Optimization: Merge multiple `inc` into one `add` #65025

JIT Optimization: Merge multiple `inc` into one `add` #65025

deeprobin commented Feb 8, 2022

ghost commented Feb 8, 2022

Expected result

danmoseley commented Feb 9, 2022

omariom commented Feb 9, 2022

danmoseley commented Feb 9, 2022

deeprobin commented Feb 9, 2022

dubiousconst282 commented Feb 9, 2022

JulieLeeMSFT commented Feb 9, 2022

AndyAyersMS commented Feb 9, 2022

JIT Optimization: Merge multiple inc into one add #65025

JIT Optimization: Merge multiple inc into one add #65025

Comments

deeprobin commented Feb 8, 2022

Expected result

ghost commented Feb 8, 2022

Expected result

danmoseley commented Feb 9, 2022

omariom commented Feb 9, 2022

danmoseley commented Feb 9, 2022

deeprobin commented Feb 9, 2022

Same for dec

Expected result

And inc+dec elimination would be nice

Expected result

dubiousconst282 commented Feb 9, 2022

JulieLeeMSFT commented Feb 9, 2022

AndyAyersMS commented Feb 9, 2022

JIT Optimization: Merge multiple `inc` into one `add` #65025

JIT Optimization: Merge multiple `inc` into one `add` #65025

Same for `dec`

And `inc`+`dec` elimination would be nice