nvidia uses the LLaMAForCausalLM string in their config.json, example… #9485

csabakecskemeti · 2024-09-14T19:04:30Z

Nvidia uses the LLaMAForCausalLM string in their config.json so though there's support for LlamaForCausalLM
@Model.register("LlamaForCausalLM", "MistralForCausalLM", "MixtralForCausalLM")
the conversion failes on the case.

I've added the LLaMAForCausalLM

Example models with this arch string:

nvidia/Llama3-ChatQA-2-8B
nvidia/Llama3-ChatQA-2-70B
I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

… nvidia/Llama3-ChatQA-2-8B

Co-authored-by: Csaba Kecskemeti <[email protected]>

nvidia uses the LLaMAForCausalLM string in their config.json, example…

aaf7f53

… nvidia/Llama3-ChatQA-2-8B

github-actions bot added the python python script changes label Sep 14, 2024

ggerganov approved these changes Sep 15, 2024

View reviewed changes

ggerganov merged commit 3c7989f into ggml-org:master Sep 15, 2024
9 checks passed

dsx1986 pushed a commit to dsx1986/llama.cpp that referenced this pull request Oct 29, 2024

py : add "LLaMAForCausalLM" conversion support (ggml-org#9485)

575edd4

Co-authored-by: Csaba Kecskemeti <[email protected]>

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Nov 15, 2024

py : add "LLaMAForCausalLM" conversion support (ggml-org#9485)

217ff68

Co-authored-by: Csaba Kecskemeti <[email protected]>

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Nov 18, 2024

py : add "LLaMAForCausalLM" conversion support (ggml-org#9485)

ae4b72a

Co-authored-by: Csaba Kecskemeti <[email protected]>

Nexesenex pushed a commit to Nexesenex/croco.cpp that referenced this pull request Feb 25, 2025

py : add "LLaMAForCausalLM" conversion support (ggml-org#9485)

ed93780

Co-authored-by: Csaba Kecskemeti <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

nvidia uses the LLaMAForCausalLM string in their config.json, example… #9485

nvidia uses the LLaMAForCausalLM string in their config.json, example… #9485

Uh oh!

csabakecskemeti commented Sep 14, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

nvidia uses the LLaMAForCausalLM string in their config.json, example… #9485

nvidia uses the LLaMAForCausalLM string in their config.json, example… #9485

Uh oh!

Conversation

csabakecskemeti commented Sep 14, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants