llama : expose llama_model_n_head_kv in the API #11997

vlovich · 2025-02-21T07:46:13Z

It's useful to be able to have this from the library layer as it's a key parameter of the model (e.g. to figure out how much KV cache memory is needed).

llama : expose llama_model_n_head_kv in the API

a47d19f

It's useful to be able to have this from the library layer as it's a key parameter of the model (e.g. to figure out how much KV cache memory is needed).

ggerganov approved these changes Feb 21, 2025

View reviewed changes

This was referenced Feb 21, 2025

Unsatisfied link symbol for llama_model_head_kv in shared library utilityai/llama-cpp-rs#667

Open

Make sure search paths inside OUT_DIR precede external paths rust-lang/cargo#15221

Open

ggerganov merged commit 3e9a286 into ggml-org:master Feb 25, 2025
46 checks passed

vlovich deleted the expose-n-head-kv branch March 1, 2025 06:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama : expose llama_model_n_head_kv in the API #11997

llama : expose llama_model_n_head_kv in the API #11997

vlovich commented Feb 21, 2025

llama : expose llama_model_n_head_kv in the API #11997

llama : expose llama_model_n_head_kv in the API #11997

Conversation

vlovich commented Feb 21, 2025