Optimize the inference performance of the FLA operator On Qwen3.5 Model#7597
Draft
mikequan0425 wants to merge 2 commits intovllm-project:mainfrom
Draft
Optimize the inference performance of the FLA operator On Qwen3.5 Model#7597mikequan0425 wants to merge 2 commits intovllm-project:mainfrom
mikequan0425 wants to merge 2 commits intovllm-project:mainfrom
Commits
Commits on Mar 24, 2026
- committed
- committed