Skip to content
This repository was archived by the owner on Sep 4, 2025. It is now read-only.

Commit cc2039c

Browse files
authored
Revert "[Kernel] changing fused moe kernel chunk size default to 32k (vllm-project#7995)" (#207)
This reverts commit 34a0e96.
1 parent 48c0cb4 commit cc2039c

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

vllm/envs.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -404,7 +404,7 @@ def get_default_config_root():
404404
os.path.join(get_default_cache_root(), "vllm", "xla_cache"),
405405
)),
406406
"VLLM_FUSED_MOE_CHUNK_SIZE":
407-
lambda: int(os.getenv("VLLM_FUSED_MOE_CHUNK_SIZE", "32768")),
407+
lambda: int(os.getenv("VLLM_FUSED_MOE_CHUNK_SIZE", "65536")),
408408

409409
# If set, vllm will skip the deprecation warnings.
410410
"VLLM_NO_DEPRECATION_WARNING":

0 commit comments

Comments
 (0)