You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
If something has any dependency other than a write, it gets labeled an input, which then marks it as readonly and it comes out unmodified.
Writes inside of reductions are not considered for classifying an output.
Everything designated an output gets moved to the end of the signature, so when the compiled kernel is called the arguments are all scrambled.
Arguments that are unused somehow get stripped out during compilation and then when the compiled kernel is called the arguments are all scrambled.
So far, I've found that the easiest thing was to just mark everything as an output. Not sure what other implications that has besides passing up optimizations dependent on things being read-only. I think this also still doesn't fix problem 4.
The text was updated successfully, but these errors were encountered:
The current logic for inputs and outputs has a number of issues:
iree-turbine/iree/turbine/kernel/compiler/kernel_codegen.py
Lines 250 to 289 in e4550f3
(amongst other places)
So far, I've found that the easiest thing was to just mark everything as an output. Not sure what other implications that has besides passing up optimizations dependent on things being read-only. I think this also still doesn't fix problem 4.
The text was updated successfully, but these errors were encountered: