Skip to content

Optimize the inference performance of the FLA operator On Qwen3.5 Model#7597

Draft
mikequan0425 wants to merge 2 commits intovllm-project:mainfrom
mikequan0425:0.17.0rc1-FLA-fix
Draft

Optimize the inference performance of the FLA operator On Qwen3.5 Model#7597
mikequan0425 wants to merge 2 commits intovllm-project:mainfrom
mikequan0425:0.17.0rc1-FLA-fix

Commits

Commits on Mar 24, 2026