Skip to content
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
8 changes: 6 additions & 2 deletions Qwen3.5-122B.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -51,8 +51,12 @@ x-sglang-qwen35-122b-common: &sglang-qwen35-122b-common
sglang serve
--model-path Qwen/Qwen3.5-122B-A10B
--tp 4
--mem-fraction-static 0.80
--context-length 131072
--mem-fraction-static 0.88
--context-length 262144
--kv-cache-dtype fp8_e4m3
--chunked-prefill-size 8192
--attention-backend flashinfer
--schedule-conservativeness 0.5
--reasoning-parser qwen3
--tool-call-parser qwen3_coder
--log-requests-level 0
Expand Down