Skip to content

Commit a328124

Browse files
committed
switch to tokens_per_block=32
1 parent e224184 commit a328124

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

tests/unittest/_torch/modules/test_mla_helix.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -80,7 +80,7 @@ class Scenario:
8080
rope_original_max_position_embeddings: int = 4096
8181
rope_type: str = "yarn"
8282
model_type: str = "deepseek_v3"
83-
kv_cache_tokens_per_block: int = 64
83+
kv_cache_tokens_per_block: int = 32
8484
# TODO only 1 is supported for now here
8585
predicted_tokens_per_seq: int = 1
8686
bias: bool = False

0 commit comments

Comments
 (0)