Skip to content

Conversation

jataylo and others added 11 commits December 5, 2024 12:05
Enable new arch target since backend support has been added.

(cherry picked from commit 134b3eb)
triton-lang#5064)

Bumping llvm to include a loop unroller fix:
llvm/llvm-project#114573. This is needed for
subsequent loop unroller upstreaming work.

(cherry picked from commit 3c296ab)
This pulls in llvm/llvm-project@bd9145c8c213
to enable ASan on AMD backend.

(cherry picked from commit 0bd30a2)
This pulls in the AMDGPU backend support for the
gfx950 target.

We need to fix the rewrites in `Combine.td` given that
llvm/llvm-project#112700 adds
a new attribute for denorm mode for `arith.addf`.

---------

Co-authored-by: Lei Zhang <[email protected]>
(cherry picked from commit 1d5e9a2)
In the case of 16 bit floats operands for tt::AtomicRMWOp, construct
only one LLVM::AtomicRMWOp but use vector of elements.
Such approach allows to generate packed intrinsics and process 2
elements at once.
Added a lit test for f16 vectorized case.

(cherry picked from commit 78c8054)
TritonAMDGPUTransforms now depends on it.

(cherry picked from commit 0b443ce)
This commit adds support for warp-level reduction
with DPP instructions, which can improve performance.

See https://gpuopen.com/learn/amd-gcn-assembly-cross-lane-operations/

(cherry picked from commit 21119e3)
@antiagainst antiagainst merged commit 7e401df into triton-lang:rc/3.2.x Dec 5, 2024
6 of 7 checks passed
jataylo added a commit to jataylo/triton that referenced this pull request Dec 11, 2024
jataylo added a commit to jataylo/triton that referenced this pull request Dec 12, 2024
antiagainst added a commit that referenced this pull request Dec 13, 2024
This PR brings in required LLVM bumps and additional targets for gfx950
support.
- #5040
- #5064
- #5180
- #5242
- #5392

Note this PR reverts the last two PRs to only focus on the LLVM upgrade
- #5347 
- #5191

---------

Co-authored-by: peterbell10 <[email protected]>
Co-authored-by: Hongtao Yu <[email protected]>
Co-authored-by: Lei Zhang <[email protected]>
Co-authored-by: Jungwook Park <[email protected]>
jataylo added a commit to jataylo/triton that referenced this pull request Dec 18, 2024
This PR brings in required LLVM bumps and additional targets for gfx950
support.
- triton-lang#5040
- triton-lang#5064
- triton-lang#5180
- triton-lang#5242
- triton-lang#5392

Note this PR reverts the last two PRs to only focus on the LLVM upgrade
- triton-lang#5347
- triton-lang#5191

---------

Co-authored-by: peterbell10 <[email protected]>
Co-authored-by: Hongtao Yu <[email protected]>
Co-authored-by: Lei Zhang <[email protected]>
Co-authored-by: Jungwook Park <[email protected]>
(cherry picked from commit f11c5ba)
jataylo added a commit to jataylo/triton that referenced this pull request Dec 18, 2024
Reverts triton-lang#5191 due to some mlir errors in pytorch unit tests

Smaller set of cherry picks:
- triton-lang#5308 (and previous LLVM upgrades)
- triton-lang#5281
- triton-lang#4925
- triton-lang#5053
- triton-lang#5019
- triton-lang#4998

---------

Co-authored-by: Jungwook Park <[email protected]>
Co-authored-by: peterbell10 <[email protected]>
Co-authored-by: Hongtao Yu <[email protected]>
Co-authored-by: Lei Zhang <[email protected]>
Co-authored-by: Ilya V <[email protected]>
Co-authored-by: Kyle Wang <[email protected]>
(cherry picked from commit 7e401df)
jataylo added a commit to jataylo/triton that referenced this pull request Dec 18, 2024
This PR brings in required LLVM bumps and additional targets for gfx950
support.
- triton-lang#5040
- triton-lang#5064
- triton-lang#5180
- triton-lang#5242
- triton-lang#5392

Note this PR reverts the last two PRs to only focus on the LLVM upgrade
- triton-lang#5347
- triton-lang#5191

---------

Co-authored-by: peterbell10 <[email protected]>
Co-authored-by: Hongtao Yu <[email protected]>
Co-authored-by: Lei Zhang <[email protected]>
Co-authored-by: Jungwook Park <[email protected]>
(cherry picked from commit f11c5ba)
atalman pushed a commit that referenced this pull request Dec 19, 2024
This PR brings in required LLVM bumps and additional targets for gfx950
support.
- #5040
- #5064
- #5180
- #5242
- #5392

Reverts:
- #5347 
- #5191
bertmaher pushed a commit that referenced this pull request Dec 19, 2024
Reverts #5191 due to some mlir errors in pytorch unit tests

Smaller set of cherry picks:
- #5308 (and previous LLVM upgrades)
- #5281 
- #4925 
- #5053 
- #5019 
- #4998

---------

Co-authored-by: Jungwook Park <[email protected]>
Co-authored-by: peterbell10 <[email protected]>
Co-authored-by: Hongtao Yu <[email protected]>
Co-authored-by: Lei Zhang <[email protected]>
Co-authored-by: Ilya V <[email protected]>
Co-authored-by: Kyle Wang <[email protected]>
bertmaher pushed a commit that referenced this pull request Dec 19, 2024
This PR brings in required LLVM bumps and additional targets for gfx950
support.
- #5040
- #5064
- #5180
- #5242
- #5392

Note this PR reverts the last two PRs to only focus on the LLVM upgrade
- #5347 
- #5191

---------

Co-authored-by: peterbell10 <[email protected]>
Co-authored-by: Hongtao Yu <[email protected]>
Co-authored-by: Lei Zhang <[email protected]>
Co-authored-by: Jungwook Park <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants