Skip to content

[hipblaslt] Update gfx12 F8BS TN SAB Vector Gridbase yaml for ROCm 7.0#1057

Merged
vamovsik merged 7 commits intorelease/rocm-rel-7.0from
users/hyoon1/navi4_sabv_7.0
Sep 22, 2025
Merged

[hipblaslt] Update gfx12 F8BS TN SAB Vector Gridbase yaml for ROCm 7.0#1057
vamovsik merged 7 commits intorelease/rocm-rel-7.0from
users/hyoon1/navi4_sabv_7.0

Conversation

@hyoon1
Copy link
Contributor

@hyoon1 hyoon1 commented Aug 5, 2025

Following up of #1054 ROCm/hipBLASLt#1941
These configs are required to support DeepSeek-R1-Distill FP8 models on gfx12.
Previously, it was only applied to release-6.4 branch. To keep further support for these models, minimized kernels to add, and it generally performs similar or better than before.

Conducted hipblaslt-test on both gfx1200 and gfx1201, and it passed all tests.

[----------] Global test environment tear-down
[==========] 39066 tests from 11 test suites ran. (364139 ms total)
[ PASSED ] 39066 tests.
hipBLASLt version: 100100
hipBLASLt git version: 879361da5cbf79495ff43db977eada9cfc7db360-dirty
command line: hipblaslt-test

@hyoon1 hyoon1 force-pushed the users/hyoon1/navi4_sabv_7.0 branch from 1be2387 to 0202d3f Compare August 5, 2025 08:22
@hyoon1 hyoon1 requested a review from cmingch August 7, 2025 04:02
@fjankovi
Copy link
Contributor

@ROCm/hipblaslt-reviewers Can we get another review/approval?

@bnemanich
Copy link
Contributor

Please re-run tests and verify that they all pass. Right now the precheckin tests are failing on gfx 12, which this PR is changing.

@vamovsik vamovsik merged commit 738b6d2 into release/rocm-rel-7.0 Sep 22, 2025
6 of 9 checks passed
@vamovsik vamovsik deleted the users/hyoon1/navi4_sabv_7.0 branch September 22, 2025 13:51
assistant-librarian bot pushed a commit to ROCm/hipBLASLt that referenced this pull request Sep 22, 2025
[hipblaslt] Update gfx12 F8BS TN SAB Vector Gridbase yaml for
 ROCm 7.0 (#1057)

Following up of ROCm/rocm-libraries#1054
#1941
These configs are required to support DeepSeek-R1-Distill FP8 models on
gfx12.
Previously, it was only applied to release-6.4 branch. To keep further
support for these models, minimized kernels to add, and it generally
performs similar or better than before.

Conducted hipblaslt-test on both gfx1200 and gfx1201, and it passed all
tests.

[----------] Global test environment tear-down
[==========] 39066 tests from 11 test suites ran. (364139 ms total)
[ PASSED ] 39066 tests.
hipBLASLt version: 100100
hipBLASLt git version:
ROCm/rocm-libraries@879361da5cbf79495ff43db977eada9cfc7db360-dirty
command line: hipblaslt-test

Co-authored-by: Val Movsik <160653499+vamovsik@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants