@@ -7548,11 +7548,12 @@ A managed variable can be accessed in both device and host code.
75487548def CUDAClusterDimsAttrDoc : Documentation {
75497549 let Category = DocCatDecl;
75507550 let Content = [{
7551- In CUDA/HIP programming, the ``__cluster_dims__`` attribute can be applied to a kernel function
7552- to set the dimensions of a thread block cluster, which is an optional level of hierarchy and made
7553- up of thread blocks. ``__cluster_dims__`` defines the cluster size as ``(X, Y, Z)``, where each value
7554- is the number of thread blocks in that dimension. The ``__cluster_dims__`` and `__no_cluster__``
7555- attributes are mutually exclusive.
7551+ In CUDA/HIP programming, the ``cluster_dims`` attribute, conventionally exposed as
7552+ ``__cluster_dims__`` macro, can be applied to a kernel function to set the dimensions of a
7553+ thread block cluster, which is an optional level of hierarchy and made up of thread blocks.
7554+ ``__cluster_dims__`` defines the cluster size as ``(X, Y, Z)``, where each value is the number
7555+ of thread blocks in that dimension. The ``__cluster_dims__`` and `__no_cluster__`` attributes
7556+ are mutually exclusive.
75567557
75577558.. code::
75587559
@@ -7568,12 +7569,12 @@ def CUDANoClusterAttrDoc : Documentation {
75687569 let Content = [{
75697570In CUDA/HIP programming, a kernel function can still be launched with the cluster feature
75707571enabled at runtime, even without the ``__cluster_dims__`` attribute. The LLVM/Clang-exclusive
7571- ``__no_cluster__ `` attribute can be applied to a kernel function to explicitly indicate that
7572- the cluster feature will not be enabled either at compile time or at kernel launch time. This
7573- allows the compiler to apply certain optimizations without assuming that clustering could be
7574- enabled at runtime. It is undefined behavior to launch a kernel with the ``__no_cluster__``
7575- attribute if the cluster feature is enabled at runtime. The ``__cluster_dims__`` and
7576- ``__no_cluster__`` attributes are mutually exclusive.
7572+ ``no_cluster `` attribute, conventionally exposed as ``__no_cluster__`` macro, can be applied
7573+ to a kernel function to explicitly indicate that the cluster feature will not be enabled either
7574+ at compile time or at kernel launch time. This allows the compiler to apply certain optimizations
7575+ without assuming that clustering could be enabled at runtime. It is undefined behavior to launch
7576+ a kernel with the ``__no_cluster__`` attribute if the cluster feature is enabled at runtime.
7577+ The ``__cluster_dims__`` and ``__no_cluster__`` attributes are mutually exclusive.
75777578
75787579.. code::
75797580
0 commit comments