Skip to content

Commit bf2bdb2

Browse files
authored
Fix CDMM known issues item (#260)
1 parent 5cf4fee commit bf2bdb2

File tree

1 file changed

+13
-6
lines changed

1 file changed

+13
-6
lines changed

gpu-operator/release-notes.rst

Lines changed: 13 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -103,11 +103,18 @@ New Features
103103
Known Issues
104104
------------
105105

106-
* The 580.65.06 driver container comes with Coherent Driver Memory Management (CDMM) enabled by default to support GB200 on Kubernetes.
107-
For more details on CDMM, refer to the `release notes <https://docs.nvidia.com/datacenter/tesla/tesla-release-notes-580-65-06/index.html#hardware-software-support>`__.
108-
Note that the 580.65.06 driver container does not support MIG on GB200 when CDMM is enabled.
109-
Additionally, CDMM enablement is only applicable to Grace-based systems like GH200 and GB200, so it is ignored on other GPU machines.
110-
NVIDIA strongly recommends keeping CDMM enabled on these systems to avoid memory over-reporting and unrestricted GPU memory access issues.
106+
* Starting with version **580.65.06**, the driver container has **Coherent Driver Memory Management (CDMM)** enabled by default to support **GB200** on Kubernetes.
107+
For more information about CDMM, refer to the `release notes <https://docs.nvidia.com/datacenter/tesla/tesla-release-notes-580-65-06/index.html#hardware-software-support>`__.
108+
109+
.. note::
110+
111+
Currently, CDMM is not compatible with the **Multi-Instance GPUs (MIG)** sharing.
112+
CDMM is also not compatible with **GPU Direct Storage**.
113+
CDMM support for these features is planned for future driver updates.
114+
However, these limitations will remain in place until a future driver update removes them.
115+
116+
CDMM enablement applies only to **Grace-based systems** such as **GH200** and **GB200** and is ignored on other GPU platforms.
117+
NVIDIA strongly recommends keeping CDMM enabled with Kubernetes on supported systems to prevent memory over-reporting and uncontrolled GPU memory access.
111118

112119
* For drivers 570.124.06, 570.133.20, 570.148.08, and 570.158.01,
113120
GPU workloads cannot be scheduled on nodes that have a mix of MIG slices and full GPUs.
@@ -127,7 +134,7 @@ Fixed Issues
127134
------------
128135

129136
* Fixed security vulnerabilities in NVIDIA Container Toolkit and related components.
130-
This release addresses CVE-2025-23266 (Critical) and CVE-2025-23267 (High) which could allow
137+
This release addresses CVE-2025-23266 (Critical) and CVE-2025-23267 (High) that could allow
131138
arbitrary code execution and link following attacks in container environments.
132139
For complete details, refer to the `NVIDIA Security Bulletin <https://nvidia.custhelp.com/app/answers/detail/a_id/5659>`__.
133140

0 commit comments

Comments
 (0)