Merge sglang model docs into cookbook#141
Open
JingwenGu0829 wants to merge 1 commit intosgl-project:mainfrom
Open
Merge sglang model docs into cookbook#141JingwenGu0829 wants to merge 1 commit intosgl-project:mainfrom
JingwenGu0829 wants to merge 1 commit intosgl-project:mainfrom
Conversation
Companion PR to sglang#18427 - consolidate documentation. Merges model-specific content from sglang/docs into cookbook pages: GPT-OSS: Added Responses API, Built-in Tools, EAGLE3 speculative decoding MiniMax-M2: Replaced placeholder with full sglang doc content Qwen3-Next: Added Mamba Radix Cache, EAGLE NEXTN speculative decoding Qwen3-VL: Added multimodal parameters, hardware-specific notes Llama4: Renamed from Llama4-Scout, covers Scout and Maverick DeepSeek-V3/V3.1/R1: Added hardware table, optimizations, multi-node, FAQ DeepSeek-V3.2: Added installation, launch examples, DSA context parallel DeepSeek-OCR: Added prompt examples, OpenAI-compatible requests GLM-4.5/4.6: Added EAGLE speculative decoding, Thinking Budget GLM-4.7: Parser updated to glm47 GLM-4.5V/4.6V: Added hardware notes, multimodal parameters, optimized launch
5 tasks
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Motivation
This is a companion PR to sglang#18427 - consolidate documentation. Merges model-specific content from sglang/docs into cookbook pages.
Detailed roadmap
The change is a SITUATION 2 change as mentioned in the issue since there's a lot of unique contents on sglang document compared to cookbook.
Model Checklist
Models listed in
sglang/docs/basic_usage/popular_model_usage.rst(toctree):Philosophy / Priorities in Merging
Followed a strict set of priorities to keep the merge predictable and minimize risk:
If the cookbook already has a section covering a topic, leave it as-is, even if the sglang doc phrases it differently. The cookbook is the user-facing source of truth.
Add content from the sglang doc only when the cookbook has no equivalent section. This means every addition in this PR represents information that was previously only available in
docs.sglang.io.When adding content, we copy it as closely as possible from the original sglang doc — preserving commands, arguments, code examples, and wording. We do not rewrite or editorialize the technical content.
Minimal structural changes during merge. The only things we change when inserting content are:
#.#.#hierarchy (e.g., a new section becomes3.4if3.3already exists).Llama4-Scout.md→Llama4.mdsince the doc covers both Scout and Maverick).```{tip},```{note}) to Markdown equivalents, and adjusting internal links to work within Docusaurus.Self-contained pages over cross-references. When one sglang doc covers multiple models (e.g.,
deepseek_v3.mdcovers V3, V3.1, R1), shared content (hardware table, optimizations, multi-node, FAQ) is duplicated into each cookbook page rather than centralizing it in one page and linking. This ensures a user setting up R1 doesn't have to navigate to the V3 page for basic setup info.Tags:
Note on DeepSeek V3/V3.1/R1: The sglang doc
deepseek_v3.mdcovers all three jointly.Shared content (hardware table, optimizations, multi-node, FAQ) is duplicated into each
cookbook page so that each page is self-contained — a user setting up R1 shouldn't have
to visit the V3 page for shared setup info, same thing goes for GLM series model.
Details
GPT-OSS: [Merge]
MiniMax M2/M2.1: [New]
Qwen3-Next: [Merge]
qwen3.mdis about Qwen3-Next only.Qwen3-VL: [Merge]
Llama4: [New] [Renamed]
Llama4-Scout.md->Llama4.md(covers both Scout and Maverick).sidebars.jsanddocs/intro.mdreferences.Llama4-Scout.mdand unusedLlama4ScoutConfigGenerator/.DeepSeek V3: [Merge] [Lot]
DeepSeek V3.1: [Merge] [Lot]
deepseekv31parser,tool_chat_template_deepseekv31.jinja).deepseek-ai/DeepSeek-V3.1where applicable.DeepSeek R1: [Merge] [Lot]
deepseekv3parser,tool_chat_template_deepseekr1.jinja).CustomLogitProcessor.deepseek-ai/DeepSeek-R1-0528where applicable.DeepSeek V3.2: [Merge] [Lot]
DeepSeek OCR: [Merge]
GLM-4.5: [Merge]
Glm4MoeThinkingBudgetLogitProcessor.GLM-4.6: [Merge]
GLM-4.7: No changes needed — sglang doc has no 4.7-specific content beyond what's already in cookbook.
GLM-4.5V: [Merge]
Glm4MoeThinkingBudgetLogitProcessor.GLM-4.6V: [Merge]