Skip to content

Commit ca7646e

Browse files
author
wangzaijun
committed
fix
1 parent 27940f1 commit ca7646e

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

lightllm/models/qwen2_vl/triton_kernel/rotary_pos_emb.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -29,7 +29,7 @@ def rotary_kernel(
2929
offs_d = tl.arange(0, BLOCK_D)
3030
d = pid_blk * BLOCK_D + offs_d
3131
mask = d < D
32-
for pid_l in tl.range(pid_l_start, total_len, step=tl.num_programs(axis=1)):
32+
for pid_l in tl.range(pid_l_start, total_len, step=tl.num_programs(axis=1), num_stages=3):
3333

3434
base = pid_l * stride_l + pid_h * stride_h
3535

0 commit comments

Comments
 (0)