[pull] master from ggerganov:master #120

pull · 2024-06-23T15:13:26Z

See Commits and Changes for more details.

Can you help keep this open source service alive? 💖 Please sponsor : )

* Refactor Vulkan backend to allow multiple contexts * Fix too many shader groups called validation error in llama3 on AMD and Intel GPUs * Fix Vulkan debug build error

* test-backend-ops : increase cpy max nmse * server ci : disable thread sanitizer

* hf bitnet v1 * hf bitnet e2e v2 * finish bitnet e2e * finish f16 hf bitnet e2e * remove unsed * finish bitnet i2 e2e * move i2s to quantize v1 * move i2 to quantize * clean code * clean code 2 * fix codestyle * fix code * fix * fix code * fix merge * remove unused * change table name * fix whitespace * delete redundant * i2_s to absmax * finish i2_s/i8_s vec_dot x86 simd * i2s->q22 * fix code * remove block scale * add dequantize * fix seq * update avx2 * remove q2_2 * remove q22_grid * fix whitespace * reuse llm_build_kv * fix bo --------- Co-authored-by: root <root@wangjinheng>

* ggml : remove ggml_task_type and GGML_PERF * check abort_callback on main thread only * vulkan : remove usage of ggml_compute_params * remove LLAMA_PERF

0cc4m and others added 4 commits June 23, 2024 10:21

Refactor Vulkan backend to allow multiple contexts (#7961)

45c0e2e

* Refactor Vulkan backend to allow multiple contexts * Fix too many shader groups called validation error in llama3 on AMD and Intel GPUs * Fix Vulkan debug build error

fix CI failures (#8066)

b6b9a8e

* test-backend-ops : increase cpy max nmse * server ci : disable thread sanitizer

Fix typo in llama_set_embeddings comment (#8077)

11318d9

server : fix JSON-Scheme typo (#7975)

6a2f298

github-actions bot added examples devops server Vulkan testing labels Jun 23, 2024

github-actions bot added the python label Jun 23, 2024

pull bot added ⤵️ pull and removed examples devops python server Vulkan testing labels Jun 23, 2024

ggml : remove ggml_task_type and GGML_PERF (#8017)

95f57bb

* ggml : remove ggml_task_type and GGML_PERF * check abort_callback on main thread only * vulkan : remove usage of ggml_compute_params * remove LLAMA_PERF

github-actions bot added examples devops python server ggml Vulkan testing build labels Jun 24, 2024

teleprint-me closed this Jun 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[pull] master from ggerganov:master #120

[pull] master from ggerganov:master #120

pull bot commented Jun 23, 2024 •

edited

Loading

[pull] master from ggerganov:master #120

[pull] master from ggerganov:master #120

Conversation

pull bot commented Jun 23, 2024 • edited Loading

pull bot commented Jun 23, 2024 •

edited

Loading