add detection for zen 5 #56967

simeonschaub · 2025-01-06T12:03:47Z

ref llvm/llvm-project@149a150

simeonschaub · 2025-01-06T12:05:12Z

src/processor_x86.cpp

@@ -236,6 +237,7 @@ constexpr auto znver2 = znver1 | get_feature_masks(clwb, rdpid, wbnoinvd);
 constexpr auto znver3 = znver2 | get_feature_masks(shstk, pku, vaes, vpclmulqdq);
 constexpr auto znver4 = znver3 | get_feature_masks(avx512f, avx512cd, avx512dq, avx512bw, avx512vl, avx512ifma, avx512vbmi,
                                                   avx512vbmi2, avx512vnni, avx512bitalg, avx512vpopcntdq, avx512bf16, gfni, shstk, xsaves);
+constexpr auto znver5 = znver4 | get_feature_masks(avxvnni, movdiri, movdir64b, avx512vp2intersect, /*prefetchi,*/ avxvnni);


I assume prefetchi needs to be added to src/features_x86.h, but I didn't know how

julia/src/features_x86.h

Line 113 in 4750dc2

JL_FEATURE_DEF(avxvnni, 32 * 9 + 4, 120000)

needs to be added here.

Now you need to look in the CPU docs for how prefetchi is encoded.

From the "Processor Programming Reference" https://www.amd.com/content/dam/amd/en/documents/processor-tech-docs/programmer-references/57896.zip

https://github.com/llvm/llvm-project/blob/3edbe36c3eb01d1c35ac1761da108e3a493258ee/clang/lib/Headers/cpuid.h#L220 The bits are here, though you will to add the

// EAX=7,ECX=1: EDX

branch IIUC

Thanks for the hints! What I don't get is where the 32 * 8, 32 * 9 etc. is coming from.

Is this the correct patch or are the 32 * 9 bits incorrect?

diff --git a/src/features_x86.h b/src/features_x86.h index 2ecc8fee32..b817781404 100644 --- a/src/features_x86.h +++ b/src/features_x86.h @@ -113,6 +113,9 @@ JL_FEATURE_DEF(wbnoinvd, 32 * 8 + 9, 0) JL_FEATURE_DEF(avxvnni, 32 * 9 + 4, 120000) JL_FEATURE_DEF(avx512bf16, 32 * 9 + 5, 0) +// EAX=7,ECX=1: EDX +JL_FEATURE_DEF(prefetchi, 32 * 9 + 20, 0) + // EAX=0x14,ECX=0: EBX JL_FEATURE_DEF(ptwrite, 32 * 10 + 4, 0)

I'm implementing it and maybe adding some comments

imciner2 · 2025-01-06T12:17:42Z

Won't we need to wait for #56130 to be merged before we can use Zen5 since that is only in LLVM 19?

simeonschaub · 2025-01-06T12:27:13Z

Yes, to take full advantage of zen 5 features I believe LLVM 19 is needed, but this PR is still an improvement since we now fall back to the znver4 target instead of the generic one

vchuravy · 2025-01-06T14:42:38Z

src/features_x86.h

+JL_FEATURE_DEF(avx512vnniw, 32 * 4 + 2, 0)
+JL_FEATURE_DEF(avx512fmaps, 32 * 4 + 3, 0)
 JL_FEATURE_DEF(uintr, 32 * 4 + 5, 140000)


Isn't the last statement a comment which LLVM version introduced support?

As it turns out those were never implemented :)

add detection for zen 5

77f4abc

ref llvm/llvm-project@149a150

simeonschaub commented Jan 6, 2025

View reviewed changes

gbaraldi force-pushed the sds/znver5 branch from eb9ffab to 75ac0cf Compare January 6, 2025 14:15

vchuravy reviewed Jan 6, 2025

View reviewed changes

Add missing features to features_x86

bdae908

gbaraldi force-pushed the sds/znver5 branch from 58f6a3c to bdae908 Compare January 6, 2025 15:19

Fix typo

7e6579b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add detection for zen 5 #56967

add detection for zen 5 #56967

simeonschaub commented Jan 6, 2025

simeonschaub Jan 6, 2025

vchuravy Jan 6, 2025

gbaraldi Jan 6, 2025

simeonschaub Jan 6, 2025 •

edited

Loading

gbaraldi Jan 6, 2025

imciner2 commented Jan 6, 2025

simeonschaub commented Jan 6, 2025

vchuravy Jan 6, 2025

gbaraldi Jan 6, 2025

add detection for zen 5 #56967

Are you sure you want to change the base?

add detection for zen 5 #56967

Conversation

simeonschaub commented Jan 6, 2025

simeonschaub Jan 6, 2025

Choose a reason for hiding this comment

vchuravy Jan 6, 2025

Choose a reason for hiding this comment

gbaraldi Jan 6, 2025

Choose a reason for hiding this comment

simeonschaub Jan 6, 2025 • edited Loading

Choose a reason for hiding this comment

gbaraldi Jan 6, 2025

Choose a reason for hiding this comment

imciner2 commented Jan 6, 2025

simeonschaub commented Jan 6, 2025

vchuravy Jan 6, 2025

Choose a reason for hiding this comment

gbaraldi Jan 6, 2025

Choose a reason for hiding this comment

simeonschaub Jan 6, 2025 •

edited

Loading