Shaoclee/compare ck #788

k50112113 · 2025-05-02T14:14:09Z

A new file fwd_decode_splitk_kvcache-tunning.py, duplicated from 06-attention-decode.py is added in order to benchmark/optimize forward decoder. This script comes with additional heuristics in handling large M edge cases. This script also provides an additional option to bypass using do_bench() function for runtime estimation in order to incorporate with rocprof to accurately estimate the runtime of short running kernels. This script is also ready to provide comparisons with CK once their kernels are implemented (so far, their function call points to an empty kernel, you can see that by doing pytest and you will see 1 failed case).

I am not making a trivial change, such as fixing a typo in a comment.
I have written a PR description following these
rules.
I have run pre-commit run --from-ref origin/main --to-ref HEAD.
Select one of the following.
- I have added tests.
  - /test for lit tests
  - /unittest for C++ tests
  - /python/test for end-to-end tests
- This PR does not need a test because this is only a script for tunning fwd decode kernel.
Select one of the following.
- I have not added any lit tests.
- The lit tests I have added follow these best practices,
  including the "tests should be minimal" section. (Usually running Python code
  and using the instructions it generates is not minimal.)

k50112113 added 9 commits April 29, 2025 21:13

add a custom do_bench func

7a48c50

add a script to compare with ck from 06-attention-decode

fc08c3b

add Hkv=64 case

f0b1451

clean up

9231d89

large M ad hoc BLOCK_M and BLOCK_N

3305647

clean up

9adaac5

clean up

31b4760

clean up

58174a1

clean up

0b8442c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Shaoclee/compare ck #788

Shaoclee/compare ck #788

Uh oh!

k50112113 commented May 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Shaoclee/compare ck #788

Are you sure you want to change the base?

Shaoclee/compare ck #788

Uh oh!

Conversation

k50112113 commented May 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants