Merge OpenAI Triton commit `4d2e9e5` #2978

whitneywhtsang · 2024-12-10T04:39:12Z

This PR change the Triton base from 89c0b0a to 4d2e9e5 (Dec 9).
Pass rate: 99.84%->99.82% (#2980)

Please do not squash and merge this PR.

…5362) This relands triton-lang/triton#5139: Adding a shortcut case for fp8 MFMA to dot operand layout conversion that avoids using shared memory, to speed up FP8 attention kernels. --------- Co-authored-by: ilia-cher <[email protected]>

…ls on (#5286) This pull request contains changes for all tutorials except `09-persistent-matmul.py`, as there is a lot of cuda-specific function. --------- Signed-off-by: Anatoly Myachev <[email protected]>

All deleted libraries are either in `${triton_libs}` or in `${conversion_libs}`. Signed-off-by: Anatoly Myachev <[email protected]>

`local_load` should be in the same stage that the `subview` that it is using.

…m shared memory (#5154)

1. Fix the problem that [m, k, n] but not [m, n, k] is returned on the nvidia backend 2. Check both int8 and float8 3. Add a new compiler error test 4. Fix dtype check in AMD backend

Fixes #5364

This reverts commit 9743ec0.

antiagainst and others added 11 commits December 9, 2024 00:32

[FRONTEND] added support for tuples (#5220)

9743ec0

Use get_current_target function to select the device to run tutoria…

105cb56

…ls on (#5286) This pull request contains changes for all tutorials except `09-persistent-matmul.py`, as there is a lot of cuda-specific function. --------- Signed-off-by: Anatoly Myachev <[email protected]>

[NFC] Remove duplicate libraries in bin/CMakeLists.txt (#5370)

07e1cc6

All deleted libraries are either in `${triton_libs}` or in `${conversion_libs}`. Signed-off-by: Anatoly Myachev <[email protected]>

[PIPELINING] Fix stage for the local_load in the TMA pipelining (#5365)

2626f2f

`local_load` should be in the same stage that the `subview` that it is using.

[BACKEND] Use linear layout for loading mmav2 dot operand tensors fro…

e3d3851

…m shared memory (#5154)

[FRONTEND] Fix and improve minimum dot size checks (#5383)

5700c14

1. Fix the problem that [m, k, n] but not [m, n, k] is returned on the nvidia backend 2. Check both int8 and float8 3. Add a new compiler error test 4. Fix dtype check in AMD backend

[FRONTEND] Fix bitcast with constexpr dtype (#5382)

4d2e9e5

Fixes #5364

Merge commit '9743ec0dca5bbd9dbce20adc3ee273af6b095f94'

2c10050

Revert "[FRONTEND] added support for tuples (#5220)"

492ea92

This reverts commit 9743ec0.

Merge commit '4d2e9e5de96a5d6ea163f2de04ae5c5b6be45825'

3f4fdd1

whitneywhtsang requested a review from pbchekin December 10, 2024 04:39

whitneywhtsang self-assigned this Dec 10, 2024

pbchekin approved these changes Dec 10, 2024

View reviewed changes

whitneywhtsang marked this pull request as ready for review December 10, 2024 05:22

This was referenced Dec 10, 2024

Reland upstream commit 9743ec0 #2979

Closed

Merge OpenAI Triton till Dec 13rd #2879

Closed

Port test_min_dot_size to XPU #2980

Closed

whitneywhtsang merged commit 3f4fdd1 into main Dec 10, 2024
6 checks passed

whitneywhtsang deleted the whitneywhtsang/merge branch December 10, 2024 06:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Merge OpenAI Triton commit `4d2e9e5` #2978

Merge OpenAI Triton commit `4d2e9e5` #2978

Uh oh!

whitneywhtsang commented Dec 10, 2024 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants

Merge OpenAI Triton commit 4d2e9e5 #2978

Merge OpenAI Triton commit 4d2e9e5 #2978

Uh oh!

Conversation

whitneywhtsang commented Dec 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants

Merge OpenAI Triton commit `4d2e9e5` #2978

Merge OpenAI Triton commit `4d2e9e5` #2978

whitneywhtsang commented Dec 10, 2024 •

edited

Loading