Support for `tt.dot_scaled` operator #2804

etiotto · 2024-11-22T16:59:19Z

This PR decomposed a tt.dot_scaled operation into a tt.dot operation where one of the operands (e.g A) is scaled using the triton_gpu_upcast_mxfp operation.

Note: The upcast_mxfp operation is not lowered to LLVM IR in this PR.

Signed-off-by: Tiotto, Ettore <[email protected]>

anmyachev · 2024-11-25T21:47:39Z

@etiotto in order to test these changes we need to unskip test_scaled_dot:

intel-xpu-backend-for-triton/python/test/unit/language/test_core.py

Lines 3437 to 3438 in d804502

    
           if is_xpu(): 
        
               pytest.skip("scaled_dot isn't supported on XPU")

Did you plan to do this here?

etiotto · 2024-11-26T00:36:37Z

@etiotto in order to test these changes we need to unskip test_scaled_dot:

intel-xpu-backend-for-triton/python/test/unit/language/test_core.py

Lines 3437 to 3438 in d804502

if is_xpu():

pytest.skip("scaled_dot isn't supported on XPU")

Did you plan to do this here?

Yes we need to do that, however the code is not yet fully functional because the lowering code for the triton_gpu.upcast_mxfp operation is not working yet. I will remove that part of the PR and just deal with decomposing the dot_scaled operation in this particular PR.

Signed-off-by: Tiotto, Ettore <[email protected]>

third_party/intel/lib/Dialect/TritonIntelGPU/IR/Dialect.cpp

third_party/intel/lib/TritonIntelGPUTransforms/AccelerateMatmul.cpp

victor-eds · 2024-11-28T13:26:40Z

PR approach LGTM. Just some NITs.

whitneywhtsang · 2024-11-28T14:49:42Z

PR approach LGTM. Just some NITs.

@victor-eds Maybe you forgot to submit the NITs?

leonling-ll

LGTM.

third_party/intel/lib/Dialect/TritonIntelGPU/IR/Dialect.cpp

victor-eds · 2024-11-28T09:38:18Z

lib/Dialect/TritonGPU/IR/Ops.cpp

      newShape[kIdx] *= 2;
-      retTy = RankedTensorType::get(newShape, FloatType::getBF16(ctx),
-                                    newVEncoding);
+      Type elemType = FloatType::getBF16(ctx);


Can we define this inside the if statement below?

I rather have it here because elemType is used after the if/else at line 147

victor-eds · 2024-11-28T09:40:21Z

third_party/intel/include/Dialect/TritonIntelGPU/IR/Attributes.h

+namespace mlir {
+class ModuleOp;
+}


I'd bet we don't need this

ModuleOp is used in "intel/include/Dialect/TritonIntelGPU/IR/TritonIntelGPUAttrDefs.h.inc" now:

static DPASCapability getDPASCapability(mlir::ModuleOp mod);

That is the reason I have put the forward declaration here.

third_party/intel/include/Dialect/TritonIntelGPU/IR/TritonIntelGPUAttrDefs.td

Signed-off-by: Tiotto, Ettore <[email protected]>

test/TritonIntelGPU/accelerate-matmul-pvc.mlir

Signed-off-by: Tiotto, Ettore <[email protected]>

etiotto added 5 commits November 11, 2024 18:21

[NFC]: Clean up AccelerateMatmul.cpp

4728298

Signed-off-by: Tiotto, Ettore <[email protected]>

Codegen for tritongpu.upcast_mxfp

f02c70f

Signed-off-by: Tiotto, Ettore <[email protected]>

Merge branch 'main' into etiotto.add_support_for_UpcastMXFPOp

ecf31df

Merge branch 'main' into etiotto.add_support_for_UpcastMXFPOp

6c8b589

WIP: tt.scaled_dot

0f4091d

Signed-off-by: Tiotto, Ettore <[email protected]>

etiotto self-assigned this Nov 22, 2024

etiotto linked an issue Nov 22, 2024 that may be closed by this pull request

Implement support for the tt.dot_scaled operation on XPU #2633

Closed

etiotto added 2 commits November 22, 2024 20:40

WIP: tt.scaled_dot

d934710

Signed-off-by: Tiotto, Ettore <[email protected]>

Merge branch 'main' into etiotto.add_support_for_scaled_dot

1a43320

etiotto marked this pull request as ready for review November 25, 2024 14:25

etiotto marked this pull request as draft November 26, 2024 00:38

etiotto added 3 commits November 26, 2024 19:13

WIP: tt.scaled_dot

c230c6f

Signed-off-by: Tiotto, Ettore <[email protected]>

WIP: tt.scaled_dot

5772d3f

Signed-off-by: Tiotto, Ettore <[email protected]>

Fix precommit

bbf250d

Signed-off-by: Tiotto, Ettore <[email protected]>

etiotto requested review from a team, chengjunlu, leonling-ll and whitneywhtsang November 26, 2024 21:14

etiotto marked this pull request as ready for review November 26, 2024 21:15

whitneywhtsang reviewed Nov 27, 2024

View reviewed changes

third_party/intel/lib/Dialect/TritonIntelGPU/IR/Dialect.cpp Outdated Show resolved Hide resolved

third_party/intel/lib/TritonIntelGPUTransforms/AccelerateMatmul.cpp Show resolved Hide resolved

third_party/intel/lib/TritonIntelGPUTransforms/AccelerateMatmul.cpp Show resolved Hide resolved

leonling-ll approved these changes Nov 28, 2024

View reviewed changes

victor-eds reviewed Nov 28, 2024

View reviewed changes

Address code review comments

bd1094c

Signed-off-by: Tiotto, Ettore <[email protected]>

etiotto requested review from victor-eds and whitneywhtsang November 29, 2024 14:57

whitneywhtsang reviewed Nov 29, 2024

View reviewed changes

test/TritonIntelGPU/accelerate-matmul-pvc.mlir Show resolved Hide resolved

etiotto added 2 commits November 29, 2024 16:45

Merge branch 'main' into etiotto.add_support_for_scaled_dot

efee92a

Change triton_gpu --> ttg

75da01c

Signed-off-by: Tiotto, Ettore <[email protected]>

etiotto enabled auto-merge (squash) November 29, 2024 17:18

etiotto merged commit 0c70ca3 into main Nov 29, 2024
5 checks passed

etiotto deleted the etiotto.add_support_for_scaled_dot branch November 29, 2024 17:48

Support for tt.dot_scaled operator #2804

Support for tt.dot_scaled operator #2804

Uh oh!

Conversation

etiotto commented Nov 22, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

anmyachev commented Nov 25, 2024

Uh oh!

etiotto commented Nov 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

victor-eds commented Nov 28, 2024

Uh oh!

whitneywhtsang commented Nov 28, 2024

Uh oh!

leonling-ll left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

victor-eds Nov 28, 2024

Choose a reason for hiding this comment

Uh oh!

etiotto Nov 29, 2024

Choose a reason for hiding this comment

Uh oh!

victor-eds Nov 28, 2024

Choose a reason for hiding this comment

Uh oh!

etiotto Nov 29, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Support for `tt.dot_scaled` operator #2804

Support for `tt.dot_scaled` operator #2804

etiotto commented Nov 22, 2024 •

edited

Loading

etiotto commented Nov 26, 2024 •

edited

Loading