Added --chat-template-file to llama-run #11961

engelmi · 2025-02-19T15:17:00Z

Relates to: #11178

Added --chat-template-file CLI option to llama-run. If specified, the file will be read and the content passed for overwriting the chat template of the model to common_chat_templates_from_model.

This also enables running the granite-code model from ollama:

# using a jinja chat template file 
# (when prefix, e.g. hf://, is not specified, llama-run pulls from ollama)
$ llama-run  --chat-template-file ./chat.tmpl granite-code
> write code

Here is a code snippet in Python:

"""
def f(x):
    return x**2
"""

# without a jinja chat template file
$ llama-run granite-code
> write code
failed to apply the chat template

Make sure to read the contributing guidelines before submitting a PR

engelmi · 2025-02-19T15:20:49Z

Preceding PR: #11922

And still this error even though there is no merge commit:

Merge is not an allowed merge method in this repository.
This branch must not contain merge commits.

Relates to: ggml-org#11178 Added --chat-template-file CLI option to llama-run. If specified, the file will be read and the content passed for overwriting the chat template of the model to common_chat_templates_from_model. Signed-off-by: Michael Engel <mengel@redhat.com>

ericcurtin · 2025-02-20T11:39:58Z

examples/run/run.cpp

+    if(!chat_template_file.empty()){
+        chat_template = read_chat_template_file(chat_template_file);
+    }
+    auto chat_templates = common_chat_templates_init(llama_data.model.get(), chat_template.empty() ? nullptr : chat_template);


We should do:

chat_template.empty() ? "" : chat_template

here. Passing nullptr to a reference is not allowed. I wish the compiler caught these things.

common_chat_templates_ptr common_chat_templates_init( const struct llama_model * model, const std::string & chat_template_override, const std::string & bos_token_override = "", const std::string & eos_token_override = "")

Maybe the std::string class is smart enough to interpret all these as the same thing:

"", '', 0, NULL, nullptr

and that's why it compiles/works 🤷 . So it might be just implicitly converting it to "".

github-actions bot added the examples label Feb 19, 2025

engelmi mentioned this pull request Feb 19, 2025

Added --chat-template-file to llama-run #11922

Closed

engelmi force-pushed the added-chat-template-file-to-llama-run branch 3 times, most recently from 193eb87 to d1b57cf Compare February 19, 2025 20:12

engelmi force-pushed the added-chat-template-file-to-llama-run branch from d1b57cf to 530ba31 Compare February 19, 2025 20:14

ggerganov merged commit 0d55958 into ggml-org:master Feb 20, 2025
46 checks passed

ericcurtin reviewed Feb 20, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added --chat-template-file to llama-run #11961

Added --chat-template-file to llama-run #11961

engelmi commented Feb 19, 2025

engelmi commented Feb 19, 2025

ericcurtin Feb 20, 2025 •

edited

Loading

ericcurtin Feb 20, 2025 •

edited

Loading

Added --chat-template-file to llama-run #11961

Added --chat-template-file to llama-run #11961

Conversation

engelmi commented Feb 19, 2025

engelmi commented Feb 19, 2025

ericcurtin Feb 20, 2025 • edited Loading

Choose a reason for hiding this comment

ericcurtin Feb 20, 2025 • edited Loading

Choose a reason for hiding this comment

ericcurtin Feb 20, 2025 •

edited

Loading

ericcurtin Feb 20, 2025 •

edited

Loading