Releases · tinglou/llama.cpp

24 Jan 03:46

564804b

b4539 Latest

Latest

tests: fix some mul_mat test gaps (#11375)

Now that we have batched mat-vec mul Vulkan shaders for up to n==8,
these tests weren't actually exercising the mat-mat mul path. Test
n==9 as well. Also, change to use all_types.

Assets 23

cudart-llama-bin-win-cu11.7-x64.zip

303 MB 2025-01-24T03:46:39Z
cudart-llama-bin-win-cu12.4-x64.zip

373 MB 2025-01-24T03:46:46Z
llama-b4539-bin-macos-arm64.zip

19.8 MB 2025-01-24T03:46:53Z
llama-b4539-bin-macos-x64.zip

21.3 MB 2025-01-24T03:46:54Z
llama-b4539-bin-ubuntu-x64.zip

23.2 MB 2025-01-24T03:46:55Z
llama-b4539-bin-win-avx-x64.zip

13.8 MB 2025-01-24T03:46:56Z
llama-b4539-bin-win-avx2-x64.zip

13.9 MB 2025-01-24T03:46:57Z
llama-b4539-bin-win-avx512-x64.zip

13.9 MB 2025-01-24T03:46:57Z
llama-b4539-bin-win-cuda-cu11.7-x64.zip

152 MB 2025-01-24T03:46:58Z
llama-b4539-bin-win-cuda-cu12.4-x64.zip

151 MB 2025-01-24T03:47:02Z
Source code (zip)

2025-01-23T20:51:24Z
Source code (tar.gz)

2025-01-23T20:51:24Z

20 Jan 02:08

github-actions

b4511

92bc493

b4511

tests : increase timeout when sanitizers are enabled (#11300)

* tests : increase timeout when sanitizers are enabled

* tests : add DEFAULT_HTTP_TIMEOUT

Assets 23

15 Jan 15:11

github-actions

b4488

1d85043

b4488

fix: ggml: fix vulkan-shaders-gen build (#10448)

* fix: ggml: fix vulkan-shaders-gen build

The vulkan-shaders-gen target was not being built correctly
in case of cross-compilation.
Other outputs need to be built for the cross compile target,
but vulkan-shaders-gen needs to be built for the host.

* refactor: ggml: Improve vulkan-shaders-gen toolchain setup

- Add GGML_SHADERS_GEN_TOOLCHAIN CMake option.
- Auto-detect host toolchain if not set.

* refactor: ggml: Improve vulkan-shaders-gen toolchain setup

Use configure_file to generate host_toolchain.cmake from template

* fix: ggml: Fix compile error

Fix compile error not finding vulkan-shaders-gen

* fix: vulkan-shaders-gen build and path handling

Fix build issues with vulkan-shaders-gen:
- Add target dependency for correct build order
- Use CMAKE_HOST_SYSTEM_NAME for executable suffix
- Fix MSVC output directory in host toolchain
- Normalize path handling for cross-compilation

* fix: improve host compiler detection in vulkan shader build

Improve host compiler detection for vulkan shader generation:
- Add NO_CMAKE_FIND_ROOT_PATH to all compiler searches
- Consolidate compiler detection logic
- Fix Windows-specific MSVC detection
- Ensure correct compiler search in cross-compilation

* refactor: Simplify CMake function for detecting host compiler

Simplified the CMake function to improve the process of detecting the host compiler.

* fix: Remove unnecessary Vulkan library linkage in CMakeLists.txt

Since `vulkan-shader-gen.cpp` only requires the `glslc` executable
and not the Vulkan headers or libraries, CMakeLists.txt needs to
be corrected.
(See: ecc93d0558fc3ecb8a5af69d2ece02fae4710ade)

* refactor: Rename host_toolchain.cmake.in

- Rename host_toolchain.cmake.in to cmake/host-toolchain.cmake.in

* refactor: GGML_VULKAN_SHADERS_GEN_TOOLCHAIN

Rename the macro GGML_SHADERS_GEN_TOOLCHAIN to GGML_VULKAN_SHADERS_GEN_TOOLCHAIN

Assets 23

13 Jan 14:51

github-actions

b4473

a29f087

b4473

contrib : add naming guidelines (cont) (#11177)

Assets 23

12 Jan 07:41

github-actions

b4462

c05e8c9

b4462

gguf-py: fixed local detection of gguf package (#11180)

* updated path to gguf package for non-installed setups

* added reader.py to readme

* Bumped gguf version to 0.15.0

Assets 23

09 Jan 08:32

github-actions

b4450

8d59d91

b4450

fix: add missing msg in static_assert (#11143)

Signed-off-by: hydai <z54981220@gmail.com>

Assets 23

07 Jan 03:08

github-actions

b4431

dc7cef9

b4431

llama-run : fix context size (#11094)

Set `n_ctx` equal to `n_batch` in `Opt` class. Now context size is
a more reasonable 2048.

Signed-off-by: Eric Curtin <ecurtin@redhat.com>

Assets 23

30 Dec 06:59

github-actions

b4397

a813bad

b4397

vulkan: im2col and matmul optimizations for stable diffusion (#10942)

* tests: Add im2col perf tests

* vulkan: optimize im2col, more elements per thread

* vulkan: increase small tile size for NV_coopmat2

* vulkan: change im2col to 512 elements per workgroup

Assets 23

27 Dec 03:51

github-actions

b4393

d79d8f3

b4393

vulkan: multi-row k quants (#10846)

* multi row k quant shaders!

* better row selection

* more row choices

* readjust row selection

* rm_kq=2 by default

Assets 23

13 Dec 02:18

github-actions

b4318

d583cd0

b4318

ggml : Fix compilation issues on ARM platform when building without f…

Assets 22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Releases: tinglou/llama.cpp

b4539

b4511

b4488

b4473

b4462

b4450

b4431

b4397

b4393

b4318