Fix for #6903 - enable vectorization of loops with non-constant lower bound. #6926

ArchRobison · 2014-05-22T22:48:48Z

This improvements enables vectorization of the #6903. It also improves performance of some of my other @simd benchmarks.

I timed 5 different variations of loop lowering, including the normal one that Julia uses, and this is one of three variations that came out the fastest (within measurement noise) for aforementioned benchmarks, and did significantly better than using Julia's normal loop-lowering scheme.

When we get to LLVM 3.5, let's try using Julia's usual loop lowering. LLVM 3.4's vectorizer seemed to do much better than LLVM 3.3 with the usual lowering.

The change avoids creating multiple induction variables, which seem to thwart LLVM 3.3's vectorizer (but not LLVM 3.4).

Fix for #6903 - enable vectorization of loops with non-constant lower bound.

Fix for "simd does not vectorize some loops JuliaLang#6903".

587f409

The change avoids creating multiple induction variables, which seem to thwart LLVM 3.3's vectorizer (but not LLVM 3.4).

JeffBezanson added a commit that referenced this pull request May 22, 2014

Merge pull request #6926 from ArchRobison/adr/simdvar

8ec8346

Fix for #6903 - enable vectorization of loops with non-constant lower bound.

JeffBezanson merged commit 8ec8346 into JuliaLang:master May 22, 2014

simonster mentioned this pull request May 23, 2014

Use atsign-simd for sum #6928

Merged

ArchRobison deleted the adr/simdvar branch December 9, 2014 22:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix for #6903 - enable vectorization of loops with non-constant lower bound. #6926

Fix for #6903 - enable vectorization of loops with non-constant lower bound. #6926

ArchRobison commented May 22, 2014

Fix for #6903 - enable vectorization of loops with non-constant lower bound. #6926

Fix for #6903 - enable vectorization of loops with non-constant lower bound. #6926

Conversation

ArchRobison commented May 22, 2014