[pull] master from ggerganov:master #140

pull · 2024-08-09T14:34:44Z

See Commits and Changes for more details.

Can you help keep this open source service alive? 💖 Please sponsor : )

* llama : avoid useless copies in dummy session writer * llama : avoid double tensor copy when saving session to buffer

This commit adds the `--pooling` option to the README.md file in the `examples/embedding` directory. The motivation for adding this options is that currently if the model used does not specify a pooling type the embedding example will fail with the following error message: ```console main: error: pooling type NONE not supported ``` This commit also updates the name of the executable in the examples section.

* ggml: use vulkan as gpu backend when available Signed-off-by: Matt Stephenson <mstephenson6@users.noreply.github.com> * whisper: enable using vk as default buffer type Signed-off-by: Matt Stephenson <mstephenson6@users.noreply.github.com> --------- Signed-off-by: Matt Stephenson <mstephenson6@users.noreply.github.com>

* init * rename * add run android for termux in readme * add android readme * add instructions in readme * change name in readme * Update README.md * fixed line * add result in readme * random pos_embed * add positions index * change for ollama * change for ollama * better pos_embed in clip * support ollama * updata cmakelist * updata cmakelist * rename wrapper * clear code * replace and organize code * add link * sync master * fix warnings * fix warnings * fix bug in bicubic resize when need resize iamge smaller * receive review comments and modify * receive review comments and modify * put all code into llava dir * fix quality problem in pr code * change n_layer * add space in "-1" * imitate reshape bug of python code * fix bug in clip * fix issues for merging * fix llama-minicpmv-cli in cmake file * change pr readme * fix code review * remove in line 33 directory in the /cmakelists.txt (not in example, in the main dir * fix cmakefile * add warn * fix KEY_HAS_MINICPMV_PROJ * remove load_image_size into clip_ctx * remove the extern "C", MINICPMV_API * fix uhd code for review comment * delete minicpmv-wrapper in pr * remove uhd_image_embed * Modify 2 notes * clip : style changes * del common.h in clip * fix Type-Check error * fix Type-Check error * fix Type-Check error * fix Type-Check error * fix makefile error * fix ubuntu-make error * try fix clip * try fix 1 --------- Co-authored-by: Hongji Zhu <fireyoucan@gmail.com> Co-authored-by: harvestingmoon <leewenyeong@gmail.com> Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

* llama : better replace_all (cont) ggml-ci * code : deduplicate replace_all ggml-ci

ggml-ci

Co-authored-by: Stanisław Szymczyk <sszymczy@gmail.com>

compilade and others added 7 commits August 8, 2024 23:54

llama : reduce useless copies when saving session (#8916)

345a686

* llama : avoid useless copies in dummy session writer * llama : avoid double tensor copy when saving session to buffer

server : add one level list nesting for embeddings (#8936)

daef3ab

llama : fix typo in llama_tensor_get_type comment [no ci] (#8937)

6f6496b

sync : ggml

4305b57

github-actions bot added examples python server ggml Vulkan script labels Aug 9, 2024

ggerganov and others added 4 commits August 9, 2024 18:23

llama : better replace_all (cont) (#8926)

45a55b9

* llama : better replace_all (cont) ggml-ci * code : deduplicate replace_all ggml-ci

make : fix llava obj file race (#8946)

272e3bd

ggml-ci

llama : add support for lora adapters in T5 model (#8938)

6afd1a9

Co-authored-by: Stanisław Szymczyk <sszymczy@gmail.com>

Merge commit from fork

b72942f

teleprint-me closed this Aug 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[pull] master from ggerganov:master #140

[pull] master from ggerganov:master #140

pull bot commented Aug 9, 2024

[pull] master from ggerganov:master #140

[pull] master from ggerganov:master #140

Conversation

pull bot commented Aug 9, 2024