Skip to content

Commit f6612c2

Browse files
authored
GPU Operator Release 25.3.2 (#219)
* Initial 25.3.2 changes Signed-off-by: Andrew Chen <[email protected]> * Add @mikemckiernan feedback Signed-off-by: Andrew Chen <[email protected]> * accept feedback Signed-off-by: Andrew Chen <[email protected]> --------- Signed-off-by: Andrew Chen <[email protected]>
1 parent 548a3c0 commit f6612c2

File tree

5 files changed

+43
-11
lines changed

5 files changed

+43
-11
lines changed

gpu-operator/cdi.rst

Lines changed: 4 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -55,7 +55,10 @@ Both the ``single`` and ``mixed`` strategies are supported.
5555
Limitations and Restrictions
5656
============================
5757

58-
Enabling CDI is not supported with Red Hat OpenShift Container Platform.
58+
* Enabling CDI is not supported with Red Hat OpenShift Container Platform.
59+
60+
* Enabling CDI is not supported with Rancher Kubernetes Engine 2 (RKE2).
61+
5962
Refer to the :ref:`Supported Operating Systems and Kubernetes Platforms`.
6063

6164

gpu-operator/life-cycle-policy.rst

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -104,7 +104,7 @@ Refer to :ref:`Upgrading the NVIDIA GPU Operator` for more information.
104104
- `1.17.8 <https://github.com/NVIDIA/nvidia-container-toolkit/releases>`__
105105

106106
* - NVIDIA Kubernetes Device Plugin
107-
- `0.17.2 <https://github.com/NVIDIA/k8s-device-plugin/releases>`__
107+
- `0.17.3 <https://github.com/NVIDIA/k8s-device-plugin/releases>`__
108108

109109
* - DCGM Exporter
110110
- `4.2.3-4.1.3 <https://github.com/NVIDIA/dcgm-exporter/releases>`__
@@ -114,10 +114,10 @@ Refer to :ref:`Upgrading the NVIDIA GPU Operator` for more information.
114114

115115
* - | NVIDIA GPU Feature Discovery
116116
| for Kubernetes
117-
- `0.17.2 <https://github.com/NVIDIA/k8s-device-plugin/releases>`__
117+
- `0.17.3 <https://github.com/NVIDIA/k8s-device-plugin/releases>`__
118118

119119
* - NVIDIA MIG Manager for Kubernetes
120-
- `0.12.1 <https://github.com/NVIDIA/mig-parted/tree/main/deployments/gpu-operator>`__
120+
- `0.12.2 <https://github.com/NVIDIA/mig-parted/tree/main/deployments/gpu-operator>`__
121121

122122
* - DCGM
123123
- `4.2.3 <https://docs.nvidia.com/datacenter/dcgm/latest/release-notes/changelog.html>`__

gpu-operator/release-notes.rst

Lines changed: 29 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -33,6 +33,35 @@ See the :ref:`GPU Operator Component Matrix` for a list of software components a
3333

3434
----
3535

36+
.. _v25.3.2:
37+
38+
25.3.2
39+
======
40+
41+
.. _v25.3.2-new-features:
42+
43+
New Features
44+
------------
45+
46+
* Added support for the following software component versions:
47+
48+
- NVIDIA Kubernetes Device Plugin/NVIDIA GPU Feature Discovery v0.17.3
49+
- NVIDIA MIG Manager for Kubernetes v0.12.2
50+
51+
.. _v25.3.2-known-issues:
52+
53+
Known Issues
54+
------------
55+
56+
* For drivers 570.124.06, 570.133.20, 570.148.08, and 570.158.01,
57+
GPU workloads cannot be scheduled on nodes that have a mix of MIG slices and full GPUs.
58+
This manifests as GPU pods getting stuck indefinitely in the ``Pending`` state.
59+
NVIDIA recommends that you downgrade the driver to version 570.86.15 to work around this issue.
60+
For more detailed information, see GitHub issue #1361 <https://github.com/NVIDIA/gpu-operator/issue/1361>__.
61+
62+
* Configuring the Operator to enable CDI is not supported on Rancher Kubernetes Engine 2 (RKE2).
63+
64+
3665
.. _v25.3.1:
3766

3867
25.3.1

gpu-operator/versions1.json

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,10 @@
11
[
22
{
33
"preferred": "true",
4+
"url": "../25.3.2",
5+
"version": "25.3.2"
6+
},
7+
{
48
"url": "../25.3.1",
59
"version": "25.3.1"
610
},
@@ -19,9 +23,5 @@
1923
{
2024
"url": "../24.9.0",
2125
"version": "24.9.0"
22-
},
23-
{
24-
"url": "../24.6.2",
25-
"version": "24.6.2"
2626
}
2727
]

repo.toml

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -166,8 +166,8 @@ output_format = "linkcheck"
166166
docs_root = "${root}/gpu-operator"
167167
project = "gpu-operator"
168168
name = "NVIDIA GPU Operator"
169-
version = "25.3.1"
170-
source_substitutions = { version = "v25.3.1", recommended = "570.148.08" }
169+
version = "25.3.2"
170+
source_substitutions = { version = "v25.3.2", recommended = "570.148.08" }
171171
copyright_start = 2020
172172
sphinx_exclude_patterns = [
173173
"life-cycle-policy.rst",
@@ -225,7 +225,7 @@ output_format = "linkcheck"
225225
docs_root = "${root}/openshift"
226226
project = "gpu-operator-openshift"
227227
name = "NVIDIA GPU Operator on Red Hat OpenShift Container Platform"
228-
version = "25.3.1"
228+
version = "25.3.2"
229229
copyright_start = 2020
230230
sphinx_exclude_patterns = [
231231
"get-entitlement.rst",

0 commit comments

Comments
 (0)