Skip to content

Commit 3734d90

Browse files
committed
cc attention
1 parent 3226a86 commit 3734d90

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

lightllm/models/deepseek2/triton_kernel/context_flashattention_nopad_with_v.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -165,7 +165,7 @@ def context_attention_fwd_with_v(
165165
if q_nope.dtype == torch.float32:
166166
BLOCK = BLOCK // 4
167167

168-
sm_scale = softmax_scale
168+
sm_scale = softmax_scale * 1.4426950408889634
169169
batch, head = b_seq_len.shape[0], q_nope.shape[1]
170170

171171
grid = (batch, head, triton.cdiv(max_input_len, BLOCK)) # batch, head,

0 commit comments

Comments
 (0)