Siwasaki/pr/loadstorescalar by shintaro-iwasaki · Pull Request #16 · shintaro-iwasaki/triton

shintaro-iwasaki · 2022-10-17T20:54:42Z

No description provided.

WIP but should work int t…he cases we need so far

…ang#785) Correct the Load/Store Op's vector size with the mask's alignment correctly considered. Some cases: ```mlir // num_warp = 2 // block_size = 128 func @vecadd_mask_align_16(%a_ptr: !tt.ptr<f32> {tt.divisibility = 16 : i32}, %b_ptr: !tt.ptr<f32> {tt.divisibility = 16 : i32}, %out_ptr: !tt.ptr<f32> {tt.divisibility = 16 : i32}, %n_elements: i32 {tt.divisibility = 16 : i32}) { // mask = make_range(128) < n_element } ``` This should get the vec=2 `ld`/`st` instructions. While the following example ```mlir // num_warp = 2 // block_size = 128 func @vecadd_mask_align_16(%a_ptr: !tt.ptr<f32> {tt.divisibility = 16 : i32}, %b_ptr: !tt.ptr<f32> {tt.divisibility = 16 : i32}, %out_ptr: !tt.ptr<f32> {tt.divisibility = 16 : i32}, %n_elements: i32) { // mask = make_range(128) < n_element } ``` it should get the vec=1 `ld`/`st` instructions.

shintaro-iwasaki and others added 3 commits October 13, 2022 18:53

[Triton-IR] Fix LoadOp definition (triton-lang#771) (triton-lang#777)

5898352

[Triton-MLIR] fix a tiny bug in coalesce pass (triton-lang#782)

e948a61

[OPTIMIZER] Updated TritonGPU-combine pass (triton-lang#784)

38a8066

WIP but should work int t…he cases we need so far

shintaro-iwasaki force-pushed the siwasaki/pr/loadstorescalar branch 6 times, most recently from 9770dc1 to a95a4f5 Compare October 17, 2022 23:24

Superjomn and others added 4 commits October 18, 2022 11:43

[TritonIR] Update Load/StoreOps to support scalar values

6f1611f

[TritonIR] tl.reduce returns a scalar value if input is a 1D tensor

db1c304

[Tests] Add tests to check type inference

07c440e

shintaro-iwasaki force-pushed the siwasaki/pr/loadstorescalar branch from a95a4f5 to 07c440e Compare October 18, 2022 15:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Siwasaki/pr/loadstorescalar#16

Siwasaki/pr/loadstorescalar#16
shintaro-iwasaki wants to merge 7 commits intotriton-mlirfrom
siwasaki/pr/loadstorescalar

shintaro-iwasaki commented Oct 17, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

shintaro-iwasaki commented Oct 17, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants