Skip to content

Commit e306b03

Browse files
authored
misc: add debug flag for ncu profile (#288)
* misc: add new zhihu tech blog link * misc: add debug flag for ncu profile
1 parent 193a5ca commit e306b03

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

kernels/openai-triton/merge-attn-states/cuda_merge_attn_states.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -15,6 +15,7 @@
1515
"-U__CUDA_NO_BFLOAT16_CONVERSIONS__",
1616
"--expt-relaxed-constexpr",
1717
"--expt-extended-lambda",
18+
"--generate-line-info -g", # for NCU debugging
1819
# "--use_fast_math"
1920
],
2021
extra_cflags=['-std=c++17'],

0 commit comments

Comments
 (0)