Skip to content

Commit 13f8d8a

Browse files
committed
update
1 parent 745b739 commit 13f8d8a

File tree

4 files changed

+6
-3
lines changed

4 files changed

+6
-3
lines changed

docs/assets/sample-output1.png

-75.6 KB
Loading

docs/assets/sample-output2.png

30.6 KB
Loading

docs/assets/sample-output3.png

17.2 KB
Loading

docs/examples/practice_on_vllm_simulator.md

Lines changed: 6 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -80,8 +80,11 @@ curl --request POST 'http://localhost:8000/v1/completions' \
8080

8181
#### 1.5 Download Tokenizer
8282

83-
Download Qwen/Qwen3-0.6B tokenizer.json from [Qwen/Qwen3-0.6B](https://modelscope.cn/models/Qwen/Qwen3-0.6B/files) save to local path.
84-
83+
Download Qwen/Qwen2.5-1.5B-Instruct tokenizer files from [Qwen/Qwen2.5-1.5B-Instruct](https://modelscope.cn/models/Qwen/Qwen2.5-1.5B-Instruct/files) save to local path such as ${local_path}/Qwen2.5-1.5B-Instruct
84+
```bash
85+
ls ./Qwen2.5-1.5B-Instruct
86+
merges.txt tokenizer.json tokenizer_config.json vocab.json
87+
```
8588
______________________________________________________________________
8689

8790
## 🚀 2. Running Benchmarks
@@ -90,7 +93,7 @@ ______________________________________________________________________
9093
guidellm benchmark \
9194
--target "http://localhost:8000/" \
9295
--model "tweet-summary-0" \
93-
--processor "${local_path}/Qwen3-0.6B" \
96+
--processor "${local_path}/Qwen2.5-1.5B-Instruct" \
9497
--rate-type sweep \
9598
--max-seconds 10 \
9699
--max-requests 10 \

0 commit comments

Comments
 (0)