Skip to content

Commit 19109e4

Browse files
committed
add clear TODO description
1 parent 8572b8a commit 19109e4

File tree

1 file changed

+3
-3
lines changed

1 file changed

+3
-3
lines changed

fastdeploy/model_executor/layers/attention/ops/append_attention.py

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -144,9 +144,9 @@ def append_attention(
144144
raise NotImplementedError
145145

146146

147-
# TODO: merge w/o output append attention after finishing developing sub-graph cudagraph capture
148-
149-
147+
# TODO: (mengyuan) merge w/o output version append attention after
148+
# finishing developing sub-graph cudagraph capture to reduce
149+
# compilation volume
150150
def append_attention_with_output(
151151
qkv: paddle.Tensor,
152152
key_cache: paddle.Tensor,

0 commit comments

Comments
 (0)