improve documentation

fbusato · fbusato · commit f78db300dbf5 · 2025-12-19T16:06:02.000-08:00
diff --git a/docs/libcudacxx/extended_api/mdspan/mdspan_to_dlpack.rst b/docs/libcudacxx/extended_api/mdspan/mdspan_to_dlpack.rst
@@ -3,7 +3,7 @@
 ``mdspan`` to DLPack
 ====================
 
-This functionality provides a conversion from ``cuda::host_mdspan``, ``cuda::device_mdspan``, and ``cuda::managed_mdspan`` to a `DLPack <https://dmlc.github.io/dlpack/latest/>`__ ``DLTensor`` view.
+This functionality provides a conversion from ``cuda::host_mdspan``, ``cuda::device_mdspan``, and ``cuda::managed_mdspan`` to `DLPack <https://dmlc.github.io/dlpack/latest/>`__ ``DLTensor`` view.
 
 Defined in the ``<cuda/mdspan>`` header.
 
@@ -48,17 +48,17 @@ Types
 
       DLTensor&       get() noexcept;
       const DLTensor& get() const noexcept;
-    };
+  };
 
   } // namespace cuda
 
 ``cuda::dlpack_tensor`` stores a ``DLTensor`` and owns the backing storage for its ``shape`` and ``strides`` pointers. The class does not use any heap allocation.
 
-.. note:: Lifetime
+.. note:: **Lifetime**
 
   The ``DLTensor`` associated with ``cuda::dlpack_tensor`` must not outlive the wrapper. If the wrapper is destroyed, the returned ``DLTensor::shape`` and ``DLTensor::strides`` pointers will dangle.
 
-.. note:: Const-correctness
+.. note:: **Const-correctness**
 
   ``DLTensor::data`` points at ``mdspan.data_handle()`` (or is ``nullptr`` if ``mdspan.size() == 0``). If ``T`` is ``const``, the pointer is ``const_cast``'d because ``DLTensor::data`` is unqualified.
 
@@ -79,27 +79,27 @@ The conversion produces a non-owning DLPack view of the ``mdspan`` data and meta
 
 Element types are mapped to ``DLDataType`` according to the DLPack conventions, including:
 
+- ``bool``.
 - Signed and unsigned integers.
-- IEEE-754 Floating-point and extended precision floating-point, including ``__half``, ``__nv_bfloat16``, FP8, FP6, FP4 when available.
+- IEEE-754 Floating-point and extended precision floating-point, including ``__half``, ``__nv_bfloat16``, ``__float128``, FP8, FP6, FP4 when available.
 - Complex: ``cuda::std::complex<__half>``, ``cuda::std::complex<float>``, and ``cuda::std::complex<double>``.
-- `CUDA built-in vector types <https://docs.nvidia.com/cuda/cuda-programming-guide/05-appendices/cpp-language-extensions.html#built-in-types>`__, such as ``int2``, ``float4``, etc..
+- `CUDA built-in vector types <https://docs.nvidia.com/cuda/cuda-programming-guide/05-appendices/cpp-language-extensions.html#built-in-types>`__, such as ``int2``, ``float4``, etc.
+- Vector types for extended floating-point, such as ``__half2``, ``__nv_fp8x4_e4m3``, etc.
 
-Constraints and errors
-----------------------
-
-**Constraints**
+Constraints
+-----------
 
 - The accessor ``data_handle_type`` must be a pointer type.
 
-**Runtime errors**
+Runtime errors
+--------------
 
 - If any ``extent(i)`` or ``stride(i)`` cannot be represented in ``int64_t``, the conversion raises an exception.
 
 Availability notes
 ------------------
 
-- This API is available only when DLPack headers are present (``<dlpack/dlpack.h>`` is found in the include path).
-* ``dlpack/dlpack.h`` (`DLPack v1 <https://github.com/dmlc/dlpack>`__) must be discoverable at compile time, namely available in the include path.
+- This API is available only when DLPack header is present, namely ``<dlpack/dlpack.h>`` is found in the include path.
 
 References
 ----------
@@ -111,14 +111,13 @@ Example
 
 .. code:: cuda
 
-  #include <cuda/mdspan>
-
   #include <dlpack/dlpack.h>
-  #include <cassert>
-  #include <cstdint>
+  #include <cuda/mdspan>
+  #include <cuda/std/cassert>
+  #include <cuda/std/cstdint>
 
   int main() {
-    using extents_t = cuda::std::extents<std::size_t, 2, 3>;
+    using extents_t = cuda::std::extents<size_t, 2, 3>;
 
     int data[6] = {0, 1, 2, 3, 4, 5};
     cuda::host_mdspan<int, extents_t> md{data, extents_t{}};