Skip to content

Commit 4feac9f

Browse files
committed
[FIX] fix FlashAttention calling param to sync with hopper implementation in v2.7.4.post1
1 parent 79fcc13 commit 4feac9f

File tree

1 file changed

+1
-3
lines changed

1 file changed

+1
-3
lines changed

lightllm/models/vit/triton_kernel/flashattention_nopad.py

Lines changed: 1 addition & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -175,7 +175,6 @@ def flash_attention_v3_fwd(
175175
v,
176176
None,
177177
None, # k_new, v_new
178-
None, # qv
179178
o, # out
180179
None,
181180
None,
@@ -192,9 +191,8 @@ def flash_attention_v3_fwd(
192191
None,
193192
None,
194193
None,
195-
None,
196194
softmax_scale,
197-
causal=False,
195+
False, # causal
198196
window_size=(-1, -1),
199197
softcap=0.0,
200198
num_splits=1,

0 commit comments

Comments
 (0)