Skip to content

Commit c75869b

Browse files
committed
change doc
1 parent e75bb9e commit c75869b

File tree

2 files changed

+7
-6
lines changed

2 files changed

+7
-6
lines changed

docs/source/use-vllm-as-backend.mdx

Lines changed: 0 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -44,19 +44,13 @@ model: # Model specific parameters
4444
model_args: "pretrained=HuggingFaceTB/SmolLM-1.7B,revision=main,dtype=bfloat16" # Model args that you would pass in the command line
4545
generation: # Generation specific parameters
4646
temperature: 0.3
47-
early_stopping: 1
4847
repetition_penalty: 1.0
4948
frequency_penalty: 0.0
50-
length_penalty: 0.0
5149
presence_penalty: 0.0
52-
max_new_tokens: 100
53-
min_new_tokens: 1
5450
seed: 42
55-
stop_tokens: null
5651
top_k: 0
5752
min_p: 0.0
5853
top_p: 0.9
59-
truncate_prompt: false
6054
```
6155
6256
> [!WARNING]

examples/model_configs/vllm_model_config.yaml

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -3,3 +3,10 @@ model:
33
model_args: "pretrained=HuggingFaceTB/SmolLM-1.7B,revision=main,dtype=bfloat16" # pretrained=model_name,trust_remote_code=boolean,revision=revision_to_use,model_parallel=True ...
44
generation:
55
temperature: 0.3
6+
repetition_penalty: 1.0
7+
frequency_penalty: 0.0
8+
presence_penalty: 0.0
9+
seed: 42
10+
top_k: 0
11+
min_p: 0.0
12+
top_p: 0.9

0 commit comments

Comments
 (0)