Skip to content

Commit 0ad18c3

Browse files
committed
refact attn metadata build
Signed-off-by: weiguihua2 <[email protected]>
1 parent 0c43220 commit 0ad18c3

File tree

1 file changed

+3
-3
lines changed

1 file changed

+3
-3
lines changed

vllm_ascend/attention/utils.py

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
from dataclasses import dataclass
2-
from typing import Any, Optional
2+
from typing import Any
33

44
import torch
55

@@ -36,7 +36,7 @@ class AscendCommonAttentionMetadata:
3636

3737
slot_mapping_cpu: torch.Tensor
3838

39-
actual_seq_lengths_q: Optional[list[int]]
39+
actual_seq_lengths_q: list[int]
4040

4141
positions: torch.Tensor = None
4242

@@ -70,7 +70,7 @@ class TorchairCommonAttentionMetadata:
7070

7171
decode_token_per_req: int
7272

73-
actual_seq_lengths_q: Optional[list[int]] = None
73+
actual_seq_lengths_q: list[int]
7474

7575
attn_mask: torch.Tensor = None
7676

0 commit comments

Comments
 (0)