[XPU][TritonGPUToLLVM] Use `llvm.func` attributes to express kernels ND-ranges #2770

victor-eds · 2024-11-20T11:47:33Z

Use llvm.func intel_reqd_sub_group_size to express sub-group size instead of triton_gen attributes that are later translated.

Replace triton_gen.max_work_group_size value type with dense array.

…ND-ranges Use `llvm.func` `reqd_work_group_size` and `intel_reqd_sub_group_size` to express ND-range dimensions instead of `triton_gen` attributes that are later translated. Signed-off-by: victor-eds <[email protected]>

third_party/intel/lib/TritonIntelGPUToLLVM/PipelineManager.h

chengjunlu · 2024-11-20T12:19:08Z

The changes LGTM. But I am not sure about the reason of the old code. Let @whitneywhtsang to approve.

whitneywhtsang · 2024-11-20T15:19:09Z

There are some test_reduce failures, maybe due to changing from max_work_group_size to reqd_work_group_size?

victor-eds · 2024-11-20T16:21:12Z

There are some test_reduce failures, maybe due to changing from max_work_group_size to reqd_work_group_size?

Interesting. I'll look into that tomorrow.

whitneywhtsang · 2024-11-20T22:44:16Z

There are some test_reduce failures, maybe due to changing from max_work_group_size to reqd_work_group_size?

Interesting. I'll look into that tomorrow.

Confirmed it is due to changing from max_work_group_size to reqd_work_group_size. I modified main branch to use reqd_work_group_size, and I can observe the test_reduce failures.

victor-eds · 2024-11-21T09:41:12Z

Confirmed it is due to changing from max_work_group_size to reqd_work_group_size. I modified main branch to use reqd_work_group_size, and I can observe the test_reduce failures.

Interesting. I'll take a look.

victor-eds · 2024-11-21T12:55:47Z

Interesting. I'll take a look.

@whitneywhtsang @etiotto Apparently not setting anything at all fixes crashes. I'd go with this for now, get this merged to get going and open an investigation ticket to tackle ASAP. The reqd_work_group_size attribute may be helpful for codegen and improve performance after all and I'd say we want to use that.

My guess is we're modifying the number of warps or warp size at some point during this lowering process and this mismatch leads to crashes.

Does this course of action sound good?

victor-eds · 2024-11-22T09:26:10Z

I'll restore back max_work_group_size and file a ticket to use reqd_work_group_size when we find out what's the matter here

victor-eds · 2024-11-22T09:56:16Z

I will keep the dense array specification for max_work_group_size as this has no impact, I had already done it and it will make the change to llvm.func's reqd_work_group_size easier.

third_party/intel/lib/Target/LLVMIR/Dialect/TritonGEN/TritonGENToLLVMIRTranslation.cpp

victor-eds · 2024-11-26T12:08:09Z

@etiotto are we OK with this now that we're keeping the max_work_group_size code?

etiotto · 2024-11-26T13:28:59Z

@etiotto are we OK with this now that we're keeping the max_work_group_size code?

yes

victor-eds requested review from a team, chengjunlu and whitneywhtsang November 20, 2024 11:47

victor-eds self-assigned this Nov 20, 2024

victor-eds commented Nov 20, 2024

View reviewed changes

third_party/intel/lib/TritonIntelGPUToLLVM/PipelineManager.h Show resolved Hide resolved

Merge branch 'main' into use-llvm-func-attrs

27f8636

victor-eds added 2 commits November 20, 2024 12:44

Drop old tests

824fbc1

Do not assert

7c192f6

Merge branch 'main' into use-llvm-func-attrs

a8dcad0

Add back attribute

f0a2fcc

victor-eds force-pushed the use-llvm-func-attrs branch from 7cb8d7d to f0a2fcc Compare November 22, 2024 09:54

victor-eds requested a review from etiotto November 22, 2024 09:55

whitneywhtsang approved these changes Nov 22, 2024

View reviewed changes

third_party/intel/lib/Target/LLVMIR/Dialect/TritonGEN/TritonGENToLLVMIRTranslation.cpp Outdated Show resolved Hide resolved

victor-eds mentioned this pull request Nov 22, 2024

Use llvm.func's reqd_work_group_size to specify static local size #2798

Closed

victor-eds added 2 commits November 22, 2024 10:04

Drop comment

7907234

Fix attribute type

bf8fc32

etiotto merged commit 553b997 into intel:main Nov 26, 2024
5 checks passed

victor-eds deleted the use-llvm-func-attrs branch November 26, 2024 14:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[XPU][TritonGPUToLLVM] Use `llvm.func` attributes to express kernels ND-ranges #2770

[XPU][TritonGPUToLLVM] Use `llvm.func` attributes to express kernels ND-ranges #2770

Uh oh!

victor-eds commented Nov 20, 2024 •

edited

Loading

Uh oh!

Uh oh!

chengjunlu commented Nov 20, 2024

Uh oh!

whitneywhtsang commented Nov 20, 2024

Uh oh!

victor-eds commented Nov 20, 2024

Uh oh!

whitneywhtsang commented Nov 20, 2024

Uh oh!

victor-eds commented Nov 21, 2024

Uh oh!

victor-eds commented Nov 21, 2024

Uh oh!

victor-eds commented Nov 22, 2024

Uh oh!

victor-eds commented Nov 22, 2024

Uh oh!

Uh oh!

victor-eds commented Nov 26, 2024

Uh oh!

etiotto commented Nov 26, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[XPU][TritonGPUToLLVM] Use llvm.func attributes to express kernels ND-ranges #2770

[XPU][TritonGPUToLLVM] Use llvm.func attributes to express kernels ND-ranges #2770

Uh oh!

Conversation

victor-eds commented Nov 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

chengjunlu commented Nov 20, 2024

Uh oh!

whitneywhtsang commented Nov 20, 2024

Uh oh!

victor-eds commented Nov 20, 2024

Uh oh!

whitneywhtsang commented Nov 20, 2024

Uh oh!

victor-eds commented Nov 21, 2024

Uh oh!

victor-eds commented Nov 21, 2024

Uh oh!

victor-eds commented Nov 22, 2024

Uh oh!

victor-eds commented Nov 22, 2024

Uh oh!

Uh oh!

victor-eds commented Nov 26, 2024

Uh oh!

etiotto commented Nov 26, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[XPU][TritonGPUToLLVM] Use `llvm.func` attributes to express kernels ND-ranges #2770

[XPU][TritonGPUToLLVM] Use `llvm.func` attributes to express kernels ND-ranges #2770

victor-eds commented Nov 20, 2024 •

edited

Loading