[SYCL][COMPAT][cuda] Add "ptr_to_integer" syclcompat functions. #14283

JackAKirk · 2024-06-25T14:26:48Z

Add "ptr_to_integer" (generic address space to .shared) syclcompat functions.

These functions are commonly required in optimized libraries that use inline ptx. The standard naming convention of removing "__" from corresponding cuda builtins has been applied. See the readme and accompanying test-e2e for example usage.

These functions are commonly required in optimized libraries that use inline ptx. The standard naming convention of removing "__" from corresponding cuda builtins has been applied. Signed-off-by: JackAKirk <[email protected]>

Signed-off-by: JackAKirk <[email protected]>

sycl/include/syclcompat/memory.hpp

sycl/doc/syclcompat/README.md

sycl/test-e2e/syclcompat/memory/local_memory_ptr_to_integer.cpp

ptx -> PTX removed ptx doc link as requested. Co-authored-by: Alberto Cabrera Pérez <[email protected]>

joeatodd

Since these functions do the same thing aside from casting to int/size_t, can we not implement them as a single templated function?

sycl/doc/syclcompat/README.md

JackAKirk · 2024-06-27T11:36:13Z

Since these functions do the same thing aside from casting to int/size_t, can we not implement them as a single templated function?

Uncertainty around this is the reason I put them in experimental. It's a bit messy since the cuda versions of these api require different cuda toolkit versions (10.1 for the uint32_t and 11 for size_t, I think), but this does not affect these syclcompat translated versions. I was just told to translate them in this way so that cutlass sycl path can have corresponding apis to cuda runtime path. I don't think I really have the context to make a decision beyond this. It is probably best to ask @aacostadiaz what is best.

Co-authored-by: Joe Todd <[email protected]>

JackAKirk · 2024-07-09T09:57:18Z

Since these functions do the same thing aside from casting to int/size_t, can we not implement them as a single templated function?

@aacostadiaz wants them to be two separate functions, so I'll leave it as it is.

Signed-off-by: JackAKirk <[email protected]>

JackAKirk · 2024-07-10T17:22:48Z

@Alcpz @joeatodd Any more reviews for this?

Thanks

joeatodd

As discussed offline, these should be a single function with a template parameter describing the return type.

npmiller · 2024-08-29T14:47:10Z

Closing this after further discussions offline

Merge branch 'sycl' into cuda-nvvm_get_smem_pointer Signed-off-by: JackAKirk <[email protected]>

A single templated function is preferred. Signed-off-by: JackAKirk <[email protected]>

Signed-off-by: JackAKirk <[email protected]>

joeatodd

Thanks for this @JackAKirk. Just a couple of formatting requests. Cheers!

sycl/include/syclcompat/memory.hpp

sycl/doc/syclcompat/README.md

joeatodd · 2024-10-08T09:22:23Z

sycl/doc/syclcompat/README.md

+``` c++
+  half *data = syclcompat::local_mem<half[NUM_ELEMENTS]>();
+  // ...
+  // ...
+  T addr =
+              syclcompat::ptr_to_int<T>(reinterpret_cast<char *>(data) + (id % 8) * 16);
+
+uint32_t fragment;
+#if defined(__NVPTX__)
+  asm volatile("ldmatrix.sync.aligned.m8n8.x1.shared.b16 {%0}, [%1];\n"
+                : "=r"(fragment)
+                : "r"(addr));
+#endif
+```


Could you fix the formatting of this code section? Thanks

I did clang-format it already using dpc++ format.

Possibly it's not running on code sections in markdown? I'd expect uint32_t fragment to align with T addr on the line above? And the line split on lines 975-976 looks pretty wacky? If I dump this code into a cpp file and autoformat this, I get:

half *data = syclcompat::local_mem<half[NUM_ELEMENTS]>(); // ... // ... T addr = syclcompat::ptr_to_int<T>(reinterpret_cast<char *>(data) + (id % 8) * 16); uint32_t fragment; #if defined(__NVPTX__) asm volatile("ldmatrix.sync.aligned.m8n8.x1.shared.b16 {%0}, [%1];\n" : "=r"(fragment) : "r"(addr)); #endif

I can see if that passes clang-format (in the test where it is used). The existing version passes the clang-format on the clang-format CI.

I don't think clang-format runs on the README tbh.

Yeah, I use the same code in the test-e2e

Possibly it's not running on code sections in markdown? I'd expect uint32_t fragment to align with T addr on the line above? And the line split on lines 975-976 looks pretty wacky? If I dump this code into a cpp file and autoformat this, I get:

half *data = syclcompat::local_mem<half[NUM_ELEMENTS]>(); // ... // ... T addr = syclcompat::ptr_to_int<T>(reinterpret_cast<char *>(data) + (id % 8) * 16); uint32_t fragment; #if defined(__NVPTX__) asm volatile("ldmatrix.sync.aligned.m8n8.x1.shared.b16 {%0}, [%1];\n" : "=r"(fragment) : "r"(addr)); #endif

I've updated the README with this suggestion now

Signed-off-by: JackAKirk <[email protected]>

JackAKirk · 2024-10-09T13:58:37Z

@Alcpz is this OK now?
Thanks

Alcpz · 2024-10-09T14:49:25Z

@Alcpz is this OK now? Thanks

Yes. I agree with @joeatodd review.
Accepting your changes, assuming that you will finalize addressing his suggestions. Sorry for missing this.

JackAKirk · 2024-10-09T14:51:49Z

@Alcpz is this OK now? Thanks

Yes. I agree with @joeatodd review. Accepting your changes, assuming that you will finalize addressing his suggestions. Sorry for missing this.

Yes, I've updated the formatting now, thanks.

joeatodd

LGTM

JackAKirk · 2024-10-09T15:02:27Z

@intel/llvm-gatekeepers Please merge this.

Thanks

JackAKirk added 2 commits June 25, 2024 07:19

Added ptr_to_integer syclcompat functions.

9e77065

These functions are commonly required in optimized libraries that use inline ptx. The standard naming convention of removing "__" from corresponding cuda builtins has been applied. Signed-off-by: JackAKirk <[email protected]>

Add missing eof space.

3dcd427

Signed-off-by: JackAKirk <[email protected]>

JackAKirk requested a review from a team as a code owner June 25, 2024 14:26

Fix test.

e5e3183

Signed-off-by: JackAKirk <[email protected]>

JackAKirk temporarily deployed to WindowsCILock June 25, 2024 14:44 — with GitHub Actions Inactive

JackAKirk temporarily deployed to WindowsCILock June 25, 2024 15:15 — with GitHub Actions Inactive

joeatodd changed the title ~~[syclcompat][cuda] Add "ptr_to_integer" syclcompat functions.~~ [SYCL][COMPAT][cuda] Add "ptr_to_integer" syclcompat functions. Jun 26, 2024

Alcpz suggested changes Jun 27, 2024

View reviewed changes

Apply suggestions from code review

a054077

ptx -> PTX removed ptx doc link as requested. Co-authored-by: Alberto Cabrera Pérez <[email protected]>

JackAKirk temporarily deployed to WindowsCILock June 27, 2024 10:46 — with GitHub Actions Inactive

JackAKirk temporarily deployed to WindowsCILock June 27, 2024 11:27 — with GitHub Actions Inactive

joeatodd suggested changes Jun 27, 2024

View reviewed changes

sycl/doc/syclcompat/README.md Outdated Show resolved Hide resolved

Update sycl/doc/syclcompat/README.md

5b8b643

Co-authored-by: Joe Todd <[email protected]>

JackAKirk temporarily deployed to WindowsCILock July 9, 2024 09:56 — with GitHub Actions Inactive

JackAKirk had a problem deploying to WindowsCILock July 9, 2024 10:23 — with GitHub Actions Error

Address review feedback.

c2d2a50

Signed-off-by: JackAKirk <[email protected]>

JackAKirk temporarily deployed to WindowsCILock July 9, 2024 10:42 — with GitHub Actions Inactive

JackAKirk had a problem deploying to WindowsCILock July 9, 2024 11:25 — with GitHub Actions Failure

include defs.hpp in memory.hpp

054e90e

Signed-off-by: JackAKirk <[email protected]>

JackAKirk had a problem deploying to WindowsCILock July 9, 2024 14:08 — with GitHub Actions Error

JackAKirk requested review from Alcpz and joeatodd July 9, 2024 14:18

Use sycl 2020 exceptions.

054baf7

Signed-off-by: JackAKirk <[email protected]>

JackAKirk temporarily deployed to WindowsCILock July 9, 2024 15:22 — with GitHub Actions Inactive

JackAKirk temporarily deployed to WindowsCILock July 9, 2024 17:28 — with GitHub Actions Inactive

joeatodd suggested changes Aug 23, 2024

View reviewed changes

npmiller closed this Aug 29, 2024

Switch to templated function.

ea085b3

Merge branch 'sycl' into cuda-nvvm_get_smem_pointer Signed-off-by: JackAKirk <[email protected]>

JackAKirk reopened this Oct 7, 2024

JackAKirk had a problem deploying to WindowsCILock October 7, 2024 16:04 — with GitHub Actions Error

Update docs to reflect change to ptr_to_int.

0d2064a

A single templated function is preferred. Signed-off-by: JackAKirk <[email protected]>

JackAKirk had a problem deploying to WindowsCILock October 7, 2024 16:16 — with GitHub Actions Error

Move docs to correct location.

18137d4

Signed-off-by: JackAKirk <[email protected]>

JackAKirk temporarily deployed to WindowsCILock October 7, 2024 16:21 — with GitHub Actions Inactive

JackAKirk had a problem deploying to WindowsCILock October 8, 2024 08:03 — with GitHub Actions Error

joeatodd self-requested a review October 8, 2024 09:22

joeatodd suggested changes Oct 8, 2024

View reviewed changes

Address reviewer comments.

7ea41d8

Signed-off-by: JackAKirk <[email protected]>

JackAKirk had a problem deploying to WindowsCILock October 8, 2024 10:31 — with GitHub Actions Error

Update format in readme.

888f0d5

Signed-off-by: JackAKirk <[email protected]>

JackAKirk temporarily deployed to WindowsCILock October 8, 2024 11:35 — with GitHub Actions Inactive

JackAKirk temporarily deployed to WindowsCILock October 9, 2024 04:57 — with GitHub Actions Inactive

Alcpz approved these changes Oct 9, 2024

View reviewed changes

JackAKirk requested a review from joeatodd October 9, 2024 14:52

joeatodd approved these changes Oct 9, 2024

View reviewed changes

martygrant merged commit 3ba29f3 into intel:sycl Oct 10, 2024
13 checks passed

[SYCL][COMPAT][cuda] Add "ptr_to_integer" syclcompat functions. #14283

[SYCL][COMPAT][cuda] Add "ptr_to_integer" syclcompat functions. #14283

Uh oh!

Conversation

JackAKirk commented Jun 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

joeatodd left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

JackAKirk commented Jun 27, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JackAKirk commented Jul 9, 2024

Uh oh!

JackAKirk commented Jul 10, 2024

Uh oh!

joeatodd left a comment

Choose a reason for hiding this comment

Uh oh!

npmiller commented Aug 29, 2024

Uh oh!

joeatodd left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

joeatodd Oct 8, 2024

Choose a reason for hiding this comment

Uh oh!

JackAKirk Oct 8, 2024

Choose a reason for hiding this comment

Uh oh!

joeatodd Oct 8, 2024

Choose a reason for hiding this comment

Uh oh!

JackAKirk Oct 8, 2024

Choose a reason for hiding this comment

Uh oh!

joeatodd Oct 8, 2024

Choose a reason for hiding this comment

Uh oh!

JackAKirk Oct 8, 2024

Choose a reason for hiding this comment

Uh oh!

JackAKirk Oct 8, 2024

Choose a reason for hiding this comment

Uh oh!

JackAKirk commented Oct 9, 2024

Uh oh!

Alcpz commented Oct 9, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JackAKirk commented Oct 9, 2024

Uh oh!

joeatodd left a comment

Choose a reason for hiding this comment

Uh oh!

JackAKirk commented Oct 9, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

JackAKirk commented Jun 25, 2024 •

edited

Loading

JackAKirk commented Jun 27, 2024 •

edited

Loading

Alcpz commented Oct 9, 2024 •

edited

Loading