Enable 64-bit vectorization #8452

ArchRobison · 2014-09-23T19:03:45Z

Yes, the title is correct despite the innocent appearance of the diff. The patch represents jl_array_t more accurately in LLVM. I implemented only the data and length fields since there was no clear benefit for representing the rest of the fields.

Julia's pointer casting was confusing stride computations in LLVM's vectorizer, so it mistook unit-stride load/store as high cost gather/scatter. I had been wondering why the vectorizer was pricing a vector load at 51 clock cycles! I don't know why the problem did not exist for Float32, but it seemed some LLVM transform was pushing casts around differently when the size of the array element type matched sizeof(jl_value_t*). So maybe title is wrong for 32-bit targets :-)

Though we could try to get LLVM fixed, I think it's better for the code generator to "say what it means" and generate code closer to what Clang generates, since Clang is a prime client of LLVM.

With LLVM 3.3, the 64-bit versions of the benchmarks in test/perf/simd ran 1.6x to 3.9x faster for me with the patch than without, using an Intel 4th Generation Core i7 processor ("Haswell").

… arithmetic.

quinnj · 2014-09-23T19:07:57Z

Sweet!

JeffBezanson · 2014-09-23T19:10:39Z

Awesome! cc @JuliaBackports

I wonder if this is related to the performance difference reported in JuliaLang/LinearAlgebra.jl#141 ?

ViralBShah · 2014-09-23T19:22:07Z

Amazing!

Keno · 2014-09-23T20:31:58Z

src/codegen.cpp

+#endif
+    };
+    Type* jl_array_llvmt = 
+        StructType::create(getGlobalContext(), 


Should be jl_LLVMContext.

Thanks. In earlier lines, I see jl_LLVMContext is used sometimes, and getGlobalContext is used other times. What's the difference? Or is it just house-cleaning that hasn't been done yet?

Currently there's no functional difference because we only use one context. I imagine with the threading work progressing we may at some point have more than one, so jl_LLVMContext is preferred.

timholy · 2014-09-23T21:37:26Z

🍰

Enable 64-bit vectorization

ivarne · 2014-09-27T20:33:43Z

I can't solve the conflict that arises when cherry-picking this PR to release-0.3 with proper confidence.

If it is wanted, someone with a better understanding of the c++ files needs to do git cherry-pick dc2bca9 and git cherry-pick 4039a800.

ArchRobison · 2014-09-29T14:15:05Z

I'll take a look.

… arithmetic. Backport of dc2bca9 and 4039a80 Ref: #8452

ArchRobison · 2014-09-29T20:07:57Z

Pushed into release-0.3 as commit 84e90c5c19afa24ec7ed4e7b21340d4b8ae7ae85

Render array objects more accurately. Enables vectorization of 64-bit…

dc2bca9

… arithmetic.

Keno reviewed Sep 23, 2014
View reviewed changes

Change getGlobalContext() to jl_LLVMContext

4039a80

Keno added a commit that referenced this pull request Sep 23, 2014

Merge pull request #8452 from ArchRobison/adr/array

fa362dc

Enable 64-bit vectorization

Keno merged commit fa362dc into JuliaLang:master Sep 23, 2014

ivarne added the backport pending label Sep 23, 2014

This was referenced Sep 24, 2014

BLAS.scal! only supports StridedVectors JuliaLang/LinearAlgebra.jl#141

Closed

Use atsign-simd for sum #6928

Merged

ArchRobison pushed a commit that referenced this pull request Sep 29, 2014

Render array objects more accurately. Enables vectorization of 64-bit…

84e90c5

… arithmetic. Backport of dc2bca9 and 4039a80 Ref: #8452

ivarne removed the backport pending label Sep 29, 2014

ArchRobison deleted the adr/array branch December 9, 2014 22:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable 64-bit vectorization #8452

Enable 64-bit vectorization #8452

ArchRobison commented Sep 23, 2014

quinnj commented Sep 23, 2014

JeffBezanson commented Sep 23, 2014

ViralBShah commented Sep 23, 2014

Keno Sep 23, 2014

ArchRobison Sep 23, 2014

Keno Sep 23, 2014

timholy commented Sep 23, 2014

ivarne commented Sep 27, 2014

ArchRobison commented Sep 29, 2014

ArchRobison commented Sep 29, 2014

Enable 64-bit vectorization #8452

Enable 64-bit vectorization #8452

Conversation

ArchRobison commented Sep 23, 2014

quinnj commented Sep 23, 2014

JeffBezanson commented Sep 23, 2014

ViralBShah commented Sep 23, 2014

Keno Sep 23, 2014

Choose a reason for hiding this comment

ArchRobison Sep 23, 2014

Choose a reason for hiding this comment

Keno Sep 23, 2014

Choose a reason for hiding this comment

timholy commented Sep 23, 2014

ivarne commented Sep 27, 2014

ArchRobison commented Sep 29, 2014

ArchRobison commented Sep 29, 2014