Skip to content

Conversation

@jhen0409
Copy link
Collaborator

@jhen0409 jhen0409 commented Nov 13, 2024

The llama_chat_apply_template will use chatml as a fallback, so we will never gets res < 0 for models that don't supported chat template, and the warning will never be printed.

https://github.com/ggerganov/llama.cpp/blob/54ef9cfc726a799e6f454ac22c4815d037716eda/src/llama.cpp#L21822-L21843

@jhen0409 jhen0409 force-pushed the fix-server-tmpl-check branch from 5cd0000 to 71c2c7f Compare November 13, 2024 02:45
@ggerganov ggerganov merged commit 0e712a5 into ggml-org:master Nov 13, 2024
53 checks passed
@jhen0409 jhen0409 deleted the fix-server-tmpl-check branch November 13, 2024 11:51
arthw pushed a commit to arthw/llama.cpp that referenced this pull request Nov 15, 2024
…10272)

* server : fix validate_model_chat_template

* server : fix chat res
arthw pushed a commit to arthw/llama.cpp that referenced this pull request Nov 17, 2024
…10272)

* server : fix validate_model_chat_template

* server : fix chat res
arthw pushed a commit to arthw/llama.cpp that referenced this pull request Nov 18, 2024
…10272)

* server : fix validate_model_chat_template

* server : fix chat res
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants