common : add -hfd option for the draft model #11318

ggerganov · 2025-01-20T19:50:05Z

Same as -hf arg, but for the draft model.

Example

llama-server \
    -hf  ggml-org/DeepSeek-R1-Distill-Qwen-32B-Q8_0-GGUF \
    -hfd ggml-org/DeepSeek-R1-Distill-Qwen-1.5B-Q4_0-GGUF \
    --ctx-size 32768 -fa -t 1 --draft-max 16 --draft-min 2

* common : add -hfd option for the draft model * cont : fix env var * cont : more fixes

eng1n88r · 2025-02-04T18:27:14Z

How would you specify draft model local path in this case?

i.e.

llama-server \
    --hf-repo ggml-org/Qwen2.5-Coder-7B-Q8_0-GGUF \
    --hf-file qwen2.5-coder-7b-q8_0.gguf \
    --port 8012 -ngl 99 -fa -ub 1024 -b 1024 \
    --model /custom/model/path/Qwen2.5-Coder-7B-Q8_0.gguf \
    --ctx-size 0 --cache-reuse 256

* common : add -hfd option for the draft model * cont : fix env var * cont : more fixes

ggerganov added 3 commits January 20, 2025 21:46

common : add -hfd option for the draft model

6ef22f0

cont : fix env var

b14bb87

cont : more fixes

2ec8898

ggerganov requested a review from ngxson as a code owner January 20, 2025 20:16

github-actions bot added examples server labels Jan 20, 2025

ggerganov merged commit 80d0d6b into master Jan 20, 2025
45 checks passed

ggerganov deleted the gg/arg-add-hfd branch January 20, 2025 20:29

anagri pushed a commit to BodhiSearch/llama.cpp that referenced this pull request Jan 26, 2025

common : add -hfd option for the draft model (ggml-org#11318)

d255360

* common : add -hfd option for the draft model * cont : fix env var * cont : more fixes

tinglou pushed a commit to tinglou/llama.cpp that referenced this pull request Feb 13, 2025

common : add -hfd option for the draft model (ggml-org#11318)

591049d

* common : add -hfd option for the draft model * cont : fix env var * cont : more fixes

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Feb 26, 2025

common : add -hfd option for the draft model (ggml-org#11318)

133153f

* common : add -hfd option for the draft model * cont : fix env var * cont : more fixes

mglambda pushed a commit to mglambda/llama.cpp that referenced this pull request Mar 8, 2025

common : add -hfd option for the draft model (ggml-org#11318)

94220cd

* common : add -hfd option for the draft model * cont : fix env var * cont : more fixes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

common : add -hfd option for the draft model #11318

common : add -hfd option for the draft model #11318

Uh oh!

ggerganov commented Jan 20, 2025 •

edited

Loading

Uh oh!

Uh oh!

eng1n88r commented Feb 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

common : add -hfd option for the draft model #11318

common : add -hfd option for the draft model #11318

Uh oh!

Conversation

ggerganov commented Jan 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Example

Uh oh!

Uh oh!

eng1n88r commented Feb 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ggerganov commented Jan 20, 2025 •

edited

Loading