File tree Expand file tree Collapse file tree 2 files changed +16
-0
lines changed Expand file tree Collapse file tree 2 files changed +16
-0
lines changed Original file line number Diff line number Diff line change @@ -21,6 +21,14 @@ CUDA device images can be generated by a CUDA-capable compiler toolchain. Most
2121CUDA compiler toolchains are capable of generating PTX, SASS and/or bundles of
2222PTX and SASS.
2323
24+ When generating device code to be launched using Unified Runtime, it is
25+ recommended to use a programming model with explicit kernel parameters, such as
26+ OpenCL or CUDA. This is because kernels generated by a programming model with
27+ implicit kernel parameters, such as SYCL, cannot guarantee any specific number
28+ or ordering of kernel parameters. It has been observed that kernel signatures
29+ for the same SYCL kernel may vary significantly when compiled for different
30+ architectures.
31+
2432PTX
2533---
2634
Original file line number Diff line number Diff line change @@ -27,6 +27,14 @@ HIPCC can generate device code for a particular arch using the ``--genco`` flag
2727
2828 $ hipcc --genco hello.cu --amdgpu-target=gfx906 -o hello.hsaco
2929
30+ When generating device code to be launched using Unified Runtime, it is
31+ recommended to use a programming model with explicit kernel parameters, such as
32+ OpenCL or HIP. This is because kernels generated by a programming model with
33+ implicit kernel parameters, such as SYCL, cannot guarantee any specific number
34+ or ordering of kernel parameters. It has been observed that kernel signatures
35+ for the same SYCL kernel may vary significantly when compiled for different
36+ architectures.
37+
3038UR Programs
3139===========
3240
You can’t perform that action at this time.
0 commit comments