Skip to content

feat: masked layout fp4 gemm using cute-dsl #1331

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 37 commits into from
Aug 13, 2025
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
37 commits
Select commit Hold shift + click to select a range
c908eec
init
yzh119 Jul 25, 2025
a9c9d8d
upd
yzh119 Jul 25, 2025
e8d7e94
Merge branch 'main' of github.com:flashinfer-ai/flashinfer into cute-…
yyihuang Jul 26, 2025
36f2073
draft python interface and test
yyihuang Jul 26, 2025
a65a0eb
upd default plan params
yyihuang Jul 26, 2025
05fec34
to fix: convert torch to cute tensor
yyihuang Jul 27, 2025
dd06f20
upd test tensor init
yyihuang Jul 28, 2025
e1edff7
stash todo
yyihuang Jul 28, 2025
ee41387
init dlpack utils
yyihuang Jul 30, 2025
ef10daa
upd dlpack init by shape and stride
yyihuang Jul 30, 2025
2963db5
ckpt: add tensor print
yyihuang Jul 30, 2025
8185aef
add test cases and IR print cleanup todo
yyihuang Jul 30, 2025
5a18102
ckpt: to fix float8 not supported by dlpack
yyihuang Jul 31, 2025
c8d9989
ckpt: workaround fp8 with int8, add sync between tests, cannot exceed…
yyihuang Aug 1, 2025
5a31f52
Merge branch 'main' of github.com:flashinfer-ai/flashinfer into cute-…
yyihuang Aug 4, 2025
93186eb
Merge branch 'main' of github.com:flashinfer-ai/flashinfer into cute-…
yyihuang Aug 7, 2025
adbf041
Merge remote-tracking branch 'origin/main' into cute-dsl-fp4-masked-l…
yzh119 Aug 7, 2025
73d5ca4
remove prints
yyihuang Aug 9, 2025
40f4edb
upd
yyihuang Aug 11, 2025
ffe730a
upd
yzh119 Aug 11, 2025
8c378b1
upd
yzh119 Aug 11, 2025
3020648
upd
yyihuang Aug 12, 2025
6cd81c5
Merge branch 'cute-dsl-fp4-masked-layout' of github.com:yzh119/flashi…
yyihuang Aug 12, 2025
5fd30df
upd
yyihuang Aug 12, 2025
5eaab4d
stash
yyihuang Aug 12, 2025
fce9c07
upd
yyihuang Aug 12, 2025
f847545
remove unused ut
yzh119 Aug 13, 2025
f33da08
upd
yyihuang Aug 13, 2025
058fa2d
upd
yyihuang Aug 13, 2025
0991de2
add test ref
yyihuang Aug 13, 2025
ac2b2b1
fix
yzh119 Aug 13, 2025
7db7cdd
Merge remote-tracking branch 'origin/main' into cute-dsl-fp4-masked-l…
yzh119 Aug 13, 2025
0184f90
lint
yzh119 Aug 13, 2025
344bb13
add cute_dsl blockscaled gemm to blackwell ut
yzh119 Aug 13, 2025
445bb46
fix
yzh119 Aug 13, 2025
470e4aa
fix random seed
yzh119 Aug 13, 2025
5868824
ruff
yzh119 Aug 13, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Loading