Skip to content
Merged
Show file tree
Hide file tree
Changes from 24 commits
Commits
Show all changes
37 commits
Select commit Hold shift + click to select a range
bf3ebc2
code drop
pggPL Dec 10, 2025
76293d4
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Dec 10, 2025
296d773
Add FP8 scale support and fix alignment for grouped GEMM
pggPL Dec 10, 2025
785df34
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Dec 10, 2025
1329b37
fix
pggPL Dec 10, 2025
47c58be
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Dec 10, 2025
a155a8a
Grouped GEMM: code cleanup and NULL C support
pggPL Dec 11, 2025
3b2fcdf
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Dec 11, 2025
5b0582b
Grouped GEMM: per-matrix alpha/beta support
pggPL Dec 11, 2025
101766b
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Dec 11, 2025
1167f75
Fix alpha/beta numel - use SimpleTensor::numel()
pggPL Dec 11, 2025
e4a80a3
Refactor: move grouped GEMM to separate file and cleanup API
pggPL Dec 19, 2025
db1e177
Merge branch 'main' into grouped_gemm
pggPL Dec 19, 2025
047a9f9
fix
pggPL Dec 19, 2025
c490e06
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Dec 19, 2025
59145cc
fix
pggPL Dec 22, 2025
77b422a
Require Blackwell (SM100) and cuBLAS 13.1+ for grouped GEMM
pggPL Dec 22, 2025
9c8158e
fix
pggPL Dec 22, 2025
ed540c8
fixes
pggPL Dec 30, 2025
359a9f5
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Dec 30, 2025
a702426
fixes
pggPL Dec 30, 2025
fb027d0
fix
pggPL Dec 30, 2025
ae85415
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Dec 30, 2025
43f7e60
Update transformer_engine/common/gemm/config.h
pggPL Jan 7, 2026
30468af
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Jan 7, 2026
2ccaee5
changed
pggPL Jan 7, 2026
bd8fa30
suggestions
pggPL Jan 7, 2026
f0df80e
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Jan 7, 2026
301874d
fix
pggPL Jan 7, 2026
6c5d969
fix
pggPL Jan 9, 2026
c91cd8f
fix
pggPL Jan 9, 2026
0319e79
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Jan 9, 2026
a14d5bc
refactored hopper tensor selection
pggPL Jan 13, 2026
ee8f3ef
Merge remote-tracking branch 'origin/grouped_gemm' into grouped_gemm
pggPL Jan 13, 2026
c5c2fbf
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Jan 13, 2026
1a58653
Merge remote-tracking branch 'upstream/main' into grouped_gemm
pggPL Jan 22, 2026
3047134
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Jan 22, 2026
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions tests/cpp/operator/CMakeLists.txt
Original file line number Diff line number Diff line change
Expand Up @@ -30,6 +30,7 @@ add_executable(test_operator
test_causal_softmax.cu
test_swizzle.cu
test_swap_first_dims.cu
test_grouped_gemm.cu
../test_common.cu)

# Find required packages
Expand Down
Loading
Loading