Skip to content

Releases: teleprint-me/llama.cpp

b4600

31 Jan 00:04
553f1e4
Compare
Choose a tag to compare
`ci`: ccache for all github worfklows (#11516)

b4557

26 Jan 02:56
f35726c
Compare
Choose a tag to compare
build: apply MSVC /bigobj option to c/cpp files only (#11423)

b4549

25 Jan 02:54
466ea66
Compare
Choose a tag to compare
CANN: Add Ascend CANN build ci (#10217)

* CANN: Add Ascend CANN build ci

* Update build.yml

* Modify cann image version

* Update build.yml

* Change to run on x86 system

* Update build.yml

* Update build.yml

* Modify format error

* Update build.yml

* Add 'Ascend NPU' label restrictions

* Exclude non PR event

Co-authored-by: Yuanhao Ji <jiyuanhao@apache.org>

* Update build.yml

---------

Co-authored-by: Yuanhao Ji <jiyuanhao@apache.org>

b4519

21 Jan 00:34
80d0d6b
Compare
Choose a tag to compare
common : add -hfd option for the draft model (#11318)

* common : add -hfd option for the draft model

* cont : fix env var

* cont : more fixes

b4508

19 Jan 05:49
a1649cc
Compare
Choose a tag to compare
Adding linenoise.cpp to llama-run (#11252)

This is a fork of linenoise that is C++17 compatible. I intend on
adding it to llama-run so we can do things like traverse prompt
history via the up and down arrows:

https://github.com/ericcurtin/linenoise.cpp

Signed-off-by: Eric Curtin <ecurtin@redhat.com>

b4503

18 Jan 09:15
44e18ef
Compare
Choose a tag to compare
vulkan: fix coopmat2 flash attention for non-contiguous inputs (#11281)

Add code similar to mul_mm_cm2 to force alignment of strides, to avoid
a performance regression.

Add noncontiguous FA tests in test-backend-ops.

Fixes #11268.

b4431

06 Jan 23:53
dc7cef9
Compare
Choose a tag to compare
llama-run : fix context size (#11094)

Set `n_ctx` equal to `n_batch` in `Opt` class. Now context size is
a more reasonable 2048.

Signed-off-by: Eric Curtin <ecurtin@redhat.com>

b4404

02 Jan 03:16
0827b2c
Compare
Choose a tag to compare
ggml : fixes for AVXVNNI instruction set with MSVC and Clang (#11027)

* Fixes for clang AVX VNNI

* enable AVX VNNI and alder lake build for MSVC

* Apply suggestions from code review

---------

Co-authored-by: slaren <slarengh@gmail.com>

b4381

23 Dec 02:12
b92a14a
Compare
Choose a tag to compare
llama : support InfiniAI Megrez 3b (#10893)

* Support InfiniAI Megrez 3b

* Fix tokenizer_clean_spaces for megrez

b4349

17 Dec 20:55
081b29b
Compare
Choose a tag to compare
tests: add tests for GGUF (#10830)