Skip to content

Commit c1c003f

Browse files
committed
mla cleanup
Signed-off-by: Sage Moore <[email protected]>
1 parent 21b0f16 commit c1c003f

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

vllm/v1/attention/backends/mla/common.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -586,7 +586,7 @@ def _build_decode(self, block_table_tensor: torch.Tensor,
586586
)
587587

588588
def build_for_cudagraph_capture(
589-
self, common_attn_metadata: CommonAttentionMetadata) -> M:
589+
self, common_attn_metadata: CommonAttentionMetadata) -> M:
590590
"""
591591
This method builds the metadata for full cudagraph capture.
592592
Currently, only decode is supported for full cudagraphs with MLA.

0 commit comments

Comments
 (0)