Added --chat-template-file to llama-run #11922

engelmi · 2025-02-17T08:17:51Z

Relates to: #11178

Added --chat-template-file CLI option to llama-run. If specified, the file will be read and the content passed for overwriting the chat template of the model to common_chat_templates_from_model.

This also enables running the granite-code model from ollama:

# using a jinja chat template file 
# (when prefix, e.g. hf://, is not specified, llama-run pulls from ollama)
$ llama-run  --chat-template-file ./chat.tmpl granite-code
> write code

Here is a code snippet in Python:

"""
def f(x):
    return x**2
"""

# without a jinja chat template file
$ llama-run granite-code
> write code
failed to apply the chat template

Make sure to read the contributing guidelines before submitting a PR

engelmi · 2025-02-17T08:19:13Z

@ericcurtin PTAL
(This change would allow ramalama to pass converted Go Templates to llama-run)

examples/run/run.cpp

ericcurtin · 2025-02-19T14:35:41Z

This code LGTM waiting on builds

engelmi · 2025-02-19T14:37:04Z

This code LGTM waiting on builds

Should pass now. Didn't review the latest changes from master thoroughly.

Not sure why I get

    Merge is not an allowed merge method in this repository.
    This branch must not contain merge commits.

examples/run/run.cpp

ericcurtin · 2025-02-19T14:41:10Z

examples/run/run.cpp

+        return "";
+    }
+
+    FILE* file = ggml_fopen(chat_template_file.c_str(), "r");


This would be better if it used the File class from this code. It would automatically fclose the file when necessary. I would also change fopen -> ggml_fopen in that File class.

The file class from this code is behind the LLAMA_USE_CURL macro. So I'd need to move it since the --chat-template should also work without that macro. What do you think?

Yeah I'd move it out of the LLAMA_USE_CURL macro. It doesn't actually do any curl stuff.

Done. PTAL @ericcurtin

ericcurtin · 2025-02-19T14:43:36Z

examples/run/run.cpp

+        printe("Error reading chat template file '%s': %s", chat_template_file.c_str(), strerror(errno));
+        return "";
+    }
+    return std::string(data.begin(), data.end());


One could actually fread directly to a std::string and save a copy. It's a optimisation, a small one. A std::string is basically a vector of chars.

ericcurtin · 2025-02-19T14:53:10Z

This code LGTM waiting on builds

Should pass now. Didn't review the latest changes from master thoroughly.

Not sure why I get
    Merge is not an allowed merge method in this repository.
    This branch must not contain merge commits. 

Never seen that before 🤷

Relates to: ggml-org#11178 Added --chat-template-file CLI option to llama-run. If specified, the file will be read and the content passed for overwriting the chat template of the model to common_chat_templates_from_model. Signed-off-by: Michael Engel <[email protected]>

engelmi · 2025-02-19T15:16:43Z

This code LGTM waiting on builds

Should pass now. Didn't review the latest changes from master thoroughly.
Not sure why I get
    Merge is not an allowed merge method in this repository.
    This branch must not contain merge commits. 
Never seen that before 🤷

Hmm... closing and reopening it. Hopefully this fixes that error.

engelmi · 2025-02-19T15:19:53Z

#11961

github-actions bot added the examples label Feb 17, 2025

engelmi mentioned this pull request Feb 17, 2025

Added chat template support to llama-run #11215

Closed

engelmi force-pushed the added-chat-template-file-to-llama-run branch from 36a0c4f to 2ec2107 Compare February 19, 2025 14:29

ericcurtin reviewed Feb 19, 2025

View reviewed changes

examples/run/run.cpp Show resolved Hide resolved

ericcurtin approved these changes Feb 19, 2025

View reviewed changes

engelmi force-pushed the added-chat-template-file-to-llama-run branch from 2ec2107 to 6e50443 Compare February 19, 2025 14:35

ericcurtin reviewed Feb 19, 2025

View reviewed changes

examples/run/run.cpp Outdated Show resolved Hide resolved

ericcurtin reviewed Feb 19, 2025

View reviewed changes

engelmi force-pushed the added-chat-template-file-to-llama-run branch from 6e50443 to f01a139 Compare February 19, 2025 14:41

ericcurtin reviewed Feb 19, 2025

View reviewed changes

engelmi force-pushed the added-chat-template-file-to-llama-run branch from f01a139 to 86f68a8 Compare February 19, 2025 15:13

engelmi closed this Feb 19, 2025

engelmi mentioned this pull request Feb 19, 2025

Added --chat-template-file to llama-run #11961

Merged

Added --chat-template-file to llama-run #11922

Added --chat-template-file to llama-run #11922

Uh oh!

Conversation

engelmi commented Feb 17, 2025

Uh oh!

engelmi commented Feb 17, 2025

Uh oh!

Uh oh!

ericcurtin commented Feb 19, 2025

Uh oh!

engelmi commented Feb 19, 2025

Uh oh!

Uh oh!

ericcurtin Feb 19, 2025

Choose a reason for hiding this comment

Uh oh!

engelmi Feb 19, 2025

Choose a reason for hiding this comment

Uh oh!

ericcurtin Feb 19, 2025

Choose a reason for hiding this comment

Uh oh!

engelmi Feb 19, 2025

Choose a reason for hiding this comment

Uh oh!

ericcurtin Feb 19, 2025

Choose a reason for hiding this comment

Uh oh!

engelmi Feb 19, 2025

Choose a reason for hiding this comment

Uh oh!

ericcurtin commented Feb 19, 2025

Uh oh!

engelmi commented Feb 19, 2025

Uh oh!

engelmi commented Feb 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants