Skip to content

Commit ccdb5b8

Browse files
authored
(temp workaround for Triton bug) (ROCm#306)
Triton's autotuner had a bug (fixed in triton-lang/triton@184fb53 ) that we happen to trigger - this PR is a temp. workaround before we patch/update Triton.
1 parent 68fdfc2 commit ccdb5b8

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

vllm/attention/ops/triton_flash_attention.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -367,12 +367,12 @@ def attn_fwd(
367367
IS_CAUSAL: tl.constexpr,
368368
BLOCK_M: tl.constexpr,
369369
BLOCK_DMODEL: tl.constexpr,
370+
USE_FP8: tl.constexpr,
370371
BLOCK_N: tl.constexpr,
371372
PRE_LOAD_V: tl.constexpr,
372373
BIAS_TYPE: tl.constexpr,
373374
ENABLE_DROPOUT: tl.constexpr,
374375
RETURN_ENCODED_SOFTMAX: tl.constexpr,
375-
USE_FP8: tl.constexpr,
376376
):
377377
start_m = tl.program_id(0)
378378
off_h_q = tl.program_id(1)

0 commit comments

Comments
 (0)