We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
1 parent c5d3e67 commit cd6b117Copy full SHA for cd6b117
catalog/qwen2-5-0-5b-instruct.yaml
@@ -32,4 +32,5 @@ spec:
32
enforce_eager: true
33
gpu_memory_utilization: 0.95
34
enable_chunked_prefill: true
35
+ tool_call_parser: hermes
36
served_model_name: Qwen/Qwen2.5-0.5B-Instruct
catalog/qwq-32b-preview.yaml
served_model_name: Qwen/QwQ-32B-Preview
0 commit comments