Skip to content

Improve docker autocalc linear recipe for long contexts (cherry-pick to 0.16.0)#1041

Open
nngokhale wants to merge 4 commits intovllm-project:releases/v0.16.0from
nngokhale:plugin-cd-0.16.0_lin32k
Open

Improve docker autocalc linear recipe for long contexts (cherry-pick to 0.16.0)#1041
nngokhale wants to merge 4 commits intovllm-project:releases/v0.16.0from
nngokhale:plugin-cd-0.16.0_lin32k

Conversation

@nngokhale
Copy link
Contributor

Improve docker autocalc linear recipe for long contexts

Add Torch Compile support / Env vars

Update to latest env var names

Add Qwen3 head_dim support

Signed-off-by: Neelesh Gokhale <neelesh.gokhale@intel.com>
Signed-off-by: Neelesh Gokhale <neelesh.gokhale@intel.com>
Signed-off-by: Neelesh Gokhale <neelesh.gokhale@intel.com>
Signed-off-by: Neelesh Gokhale <neelesh.gokhale@intel.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant