Skip to content

Conversation

@jataylo
Copy link

@jataylo jataylo commented Oct 17, 2025

These changes are currently in progress of being upstreamed. Bring into release 2.9 for customer model perf improvement

jataylo and others added 23 commits October 16, 2025 14:45
(cherry picked from commit 5d4455f)
(cherry picked from commit 2fc7525)
(cherry picked from commit d5c71f0)
(cherry picked from commit 262a33e)
(cherry picked from commit 6c3f540)
(cherry picked from commit 9f19754)
(cherry picked from commit dc58de9)
removed the (erroneous?) check that disables autotuning for pointwise
kernels

(cherry picked from commit e3b8e25)
(cherry picked from commit 10af207)
Added two nice grid configs for the 2d pointwise kernel cases for WRT5
workload.
Confirmed that they were picked up when using max autotune.

(cherry picked from commit f1eac49)
(cherry picked from commit 2e79001)
This config improves the performance of a 1D pointwise kernel by 20% as
measured on MI350.

(cherry picked from commit a7bac0a)
(cherry picked from commit 0bdb796)
(cherry picked from commit 16e8266)
(cherry picked from commit dfc1579)
(cherry picked from commit 666e81b)
(cherry picked from commit f97c7a9)
(cherry picked from commit db49466)
(cherry picked from commit 6e9b4ee)
(cherry picked from commit 0c52d01)
(cherry picked from commit dd990a3)
(cherry picked from commit 291ee06)
Reorganized slightly the adding of hard-coded autotuning configs.
Fixed wrt1 configs.
Added wrt2 & 3 configs.

(cherry picked from commit e3e9a17)
@rocm-repo-management-api
Copy link

rocm-repo-management-api bot commented Oct 17, 2025

Jenkins build for 6534df0b317e7780b2119fc780869a872126c9b1 commit finished as FAILURE
Links: Blue Ocean view / Build artifacts

@pruthvistony pruthvistony merged commit 1735e04 into ROCm:release/2.9 Oct 17, 2025
1 of 3 checks passed
jeffdaily pushed a commit that referenced this pull request Nov 17, 2025
These changes are currently in progress of being upstreamed. Bring into
release 2.9 for customer model perf improvement

---------

Co-authored-by: Nichols A. Romero <[email protected]>
Co-authored-by: Sampsa Riikonen <[email protected]>
Co-authored-by: Nichols A. Romero <[email protected]>
Co-authored-by: AmdSampsa <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants