File tree Expand file tree Collapse file tree 1 file changed +2
-2
lines changed
demos/continuous_batching/agentic_ai Expand file tree Collapse file tree 1 file changed +2
-2
lines changed Original file line number Diff line number Diff line change @@ -40,13 +40,13 @@ Run `export_model.py` script to download and quantize the model:
40
40
41
41
:::{tab-item} CPU
42
42
``` console
43
- python export_model.py text_generation --source_model Qwen/Qwen3-8B --weight-format int8 --config_file_path models/config.json --model_repository_path models --tools_model_type qwen3 --overwrite_models --enable_prefix_caching
43
+ python export_model.py text_generation --source_model Qwen/Qwen3-8B --weight-format int8 --config_file_path models/config.json --model_repository_path models --tool_parser qwen3 --overwrite_models --enable_prefix_caching
44
44
```
45
45
:::
46
46
47
47
:::{tab-item} GPU
48
48
``` console
49
- python export_model.py text_generation --source_model Qwen/Qwen3-8B --weight-format int8 --config_file_path models/config.json --model_repository_path models --tools_model_type qwen3 --target_device GPU --enable_prefix_caching --cache_size 2
49
+ python export_model.py text_generation --source_model Qwen/Qwen3-8B --weight-format int8 --config_file_path models/config.json --model_repository_path models --tool_parser qwen3 --target_device GPU --enable_prefix_caching --cache_size 2
50
50
```
51
51
:::
52
52
You can’t perform that action at this time.
0 commit comments