Fixes for AVXVNNI instruction set - Clang Compiler #11027

Srihari-mcw · 2024-12-31T13:00:33Z

The PR is to fix issues with the AVX_VNNI instruction set with the clang compiler. The updates were built across compilers and was seen to be building fine post the changes

Error seen :

The performance was tested before and after changes and were found to be similar - Tested with Linux GCC 12.3

model	size	params	backend	threads	test	t/s	speedup	Commit id
llama 7B Q4_0	3.56 GiB	6.74 B	CPU	14	pp 512	52.69 ± 0.15	0.02%	2a4e792
llama 7B Q4_0	3.56 GiB	6.74 B	CPU	14	pp 512	52.68 ± 0.19		7909e858
llama 7B Q4_0	3.56 GiB	6.74 B	CPU	14	tg 128	19.91 ± 0.25	0.55%	2a4e792
llama 7B Q4_0	3.56 GiB	6.74 B	CPU	14	tg 128	19.80 ± 0.46		7909e858

The perplexity was tested for 32 chunks and was found to be the same for Q4_0 model before and after changes - 5.4993 +- 0.13676

Model - Meta LLama2 7B - https://huggingface.co/meta-llama/Llama-2-7b

slaren · 2024-12-31T13:43:13Z

Thanks, this also fixes AVX VNNI with MSVC, so I have enabled it for MSVC as well.

ggml/src/ggml-cpu/ggml-cpu-aarch64.cpp

ggml/src/ggml-cpu/ggml-cpu-quants.c

ggml/src/ggml-cpu/llamafile/sgemm.cpp

Fixes for clang AVX VNNI

2a4e792

github-actions bot added the ggml changes relating to the ggml tensor library for machine learning label Dec 31, 2024

Srihari-mcw mentioned this pull request Dec 31, 2024

Make updates to fix issues with clang-cl builds while using AVX512 flags #10314

Merged

4 tasks

slaren added 2 commits December 31, 2024 14:40

Merge remote-tracking branch 'origin/master' into clang_avxvnni_branch

75be008

enable AVX VNNI and alder lake build for MSVC

9ad89bc

slaren approved these changes Dec 31, 2024

View reviewed changes

slaren reviewed Dec 31, 2024

View reviewed changes

ggml/src/ggml-cpu/ggml-cpu-aarch64.cpp Outdated Show resolved Hide resolved

ggml/src/ggml-cpu/ggml-cpu-quants.c Outdated Show resolved Hide resolved

ggml/src/ggml-cpu/llamafile/sgemm.cpp Outdated Show resolved Hide resolved

Apply suggestions from code review

1c8ba92

slaren merged commit 0827b2c into ggerganov:master Dec 31, 2024
48 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixes for AVXVNNI instruction set - Clang Compiler #11027

Fixes for AVXVNNI instruction set - Clang Compiler #11027

Srihari-mcw commented Dec 31, 2024 •

edited

Loading

slaren commented Dec 31, 2024

Fixes for AVXVNNI instruction set - Clang Compiler #11027

Fixes for AVXVNNI instruction set - Clang Compiler #11027

Conversation

Srihari-mcw commented Dec 31, 2024 • edited Loading

slaren commented Dec 31, 2024

Srihari-mcw commented Dec 31, 2024 •

edited

Loading