Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
9 changes: 4 additions & 5 deletions gpu-operator/life-cycle-policy.rst
Original file line number Diff line number Diff line change
Expand Up @@ -91,21 +91,20 @@ Refer to :ref:`Upgrading the NVIDIA GPU Operator` for more information.
* - NVIDIA GPU Driver
- | `565.57.01 <https://docs.nvidia.com/datacenter/tesla/tesla-release-notes-565-57-01/index.html>`_
| `560.35.03 <https://docs.nvidia.com/datacenter/tesla/tesla-release-notes-560-35-03/index.html>`_
| `550.127.08 <https://docs.nvidia.com/datacenter/tesla/tesla-release-notes-550-127-08/index.html>`_ (recommended),
| `550.127.05 <https://docs.nvidia.com/datacenter/tesla/tesla-release-notes-550-127-05/index.html>`_ (default),
| `550.127.08 <https://docs.nvidia.com/datacenter/tesla/tesla-release-notes-550-127-08/index.html>`_ (default),
| `535.216.03 <https://docs.nvidia.com/datacenter/tesla/tesla-release-notes-535-216-03/index.html>`_

* - NVIDIA Driver Manager for Kubernetes
- `v0.7.0 <https://ngc.nvidia.com/catalog/containers/nvidia:cloud-native:k8s-driver-manager>`__

* - NVIDIA Container Toolkit
- `1.17.0 <https://github.com/NVIDIA/nvidia-container-toolkit/releases>`__
- `1.17.3 <https://github.com/NVIDIA/nvidia-container-toolkit/releases>`__

* - NVIDIA Kubernetes Device Plugin
- `0.17.0 <https://github.com/NVIDIA/k8s-device-plugin/releases>`__

* - DCGM Exporter
- `3.3.8-3.6.0 <https://github.com/NVIDIA/dcgm-exporter/releases>`__
- `3.3.9-3.6.1 <https://github.com/NVIDIA/dcgm-exporter/releases>`__

* - Node Feature Discovery
- v0.16.6
Expand All @@ -118,7 +117,7 @@ Refer to :ref:`Upgrading the NVIDIA GPU Operator` for more information.
- `0.10.0 <https://github.com/NVIDIA/mig-parted/tree/main/deployments/gpu-operator>`__

* - DCGM
- `3.3.8-1 <https://docs.nvidia.com/datacenter/dcgm/latest/release-notes/changelog.html>`__
- `3.3.9-1 <https://docs.nvidia.com/datacenter/dcgm/latest/release-notes/changelog.html>`__

* - Validator for NVIDIA GPU Operator
- ${version}
Expand Down
4 changes: 2 additions & 2 deletions gpu-operator/platform-support.rst
Original file line number Diff line number Diff line change
Expand Up @@ -471,7 +471,7 @@ Support for GPUDirect RDMA

Supported operating systems and NVIDIA GPU Drivers with GPUDirect RDMA.

- Ubuntu 20.04 and 22.04 LTS with Network Operator 24.7.0
- Ubuntu 20.04 and 22.04 LTS with Network Operator 24.10.0
- Red Hat OpenShift 4.12 and higher with Network Operator 23.10.0

For information about configuring GPUDirect RDMA, refer to :doc:`gpu-operator-rdma`.
Expand All @@ -482,7 +482,7 @@ Support for GPUDirect Storage

Supported operating systems and NVIDIA GPU Drivers with GPUDirect Storage.

- Ubuntu 20.04 and 22.04 LTS with Network Operator 24.7.0
- Ubuntu 20.04 and 22.04 LTS with Network Operator 24.10.0
- Red Hat OpenShift Container Platform 4.12 and higher

.. note::
Expand Down
40 changes: 40 additions & 0 deletions gpu-operator/release-notes.rst
Original file line number Diff line number Diff line change
Expand Up @@ -34,6 +34,46 @@ See the :ref:`GPU Operator Component Matrix` for a list of software components a

----

.. _v24.9.1:

24.9.1
======

.. _v24.9.1-new-features:

New Features
------------

* Added support for the NVIDIA Data Center GPU Driver versions 550.127.08 and 535.216.03.
Refer to the :ref:`GPU Operator Component Matrix`
on the platform support page.

* Added support for the following software component versions:

- NVIDIA Container Toolkit v1.17.3
- NVIDIA DCGM v3.3.9-1
- NVIDIA DCGM Exporter v3.3.9-3.6.1

* Added support for NVIDIA Network Operator v24.10.0.
Refer to :ref:`Support for GPUDirect RDMA` and :ref:`Support for GPUDirect Storage`.

* Added an ``all-balanced`` MIG profile for H200 NVL which creates the following GPU instances:

* ``1g.18gb`` :math:`\times` 2
* ``2g.35gb`` :math:`\times` 1
* ``3g.71gb`` :math:`\times` 1

.. _v24.9.1-fixed-issues:

Fixed Issues
------------

* Fixed an issue where NVIDIA Container Toolkit would fail to start on Rancher RKE2, K3s, and Canonical MicroK8s.
Refer to Github `issue #1109 <https://github.com/NVIDIA/gpu-operator/issues/1109>`__ for more details.

* Fixed an issue where events were not being generated by the NVIDIA driver upgrade controller.
Refer to Github `issue #1101 <https://github.com/NVIDIA/gpu-operator/issues/1101>`__ for more details.

.. _v24.9.0:

24.9.0
Expand Down
8 changes: 4 additions & 4 deletions gpu-operator/versions.json
Original file line number Diff line number Diff line change
@@ -1,7 +1,10 @@
{
"latest": "24.9.0",
"latest": "24.9.1",
"versions":
[
{
"version": "24.9.1"
},
{
"version": "24.9.0"
},
Expand All @@ -16,9 +19,6 @@
},
{
"version": "24.3.0"
},
{
"version": "23.9.2"
}
]
}
8 changes: 4 additions & 4 deletions openshift/versions.json
Original file line number Diff line number Diff line change
@@ -1,7 +1,10 @@
{
"latest": "24.9.0",
"latest": "24.9.1",
"versions":
[
{
"version": "24.9.1"
},
{
"version": "24.9.0"
},
Expand All @@ -13,9 +16,6 @@
},
{
"version": "24.3.0"
},
{
"version": "23.9.2"
}
]
}
6 changes: 3 additions & 3 deletions repo.toml
Original file line number Diff line number Diff line change
Expand Up @@ -142,8 +142,8 @@ output_format = "linkcheck"
docs_root = "${root}/gpu-operator"
project = "gpu-operator"
name = "NVIDIA GPU Operator"
version = "24.9.0"
source_substitutions = { version = "v24.9.0", recommended = "550.127.08" }
version = "24.9.1"
source_substitutions = { version = "v24.9.1", recommended = "550.127.08" }
copyright_start = 2020
sphinx_exclude_patterns = [
"life-cycle-policy.rst",
Expand Down Expand Up @@ -201,7 +201,7 @@ output_format = "linkcheck"
docs_root = "${root}/openshift"
project = "gpu-operator-openshift"
name = "NVIDIA GPU Operator on Red Hat OpenShift Container Platform"
version = "24.9.0"
version = "24.9.1"
copyright_start = 2020
sphinx_exclude_patterns = [
"get-entitlement.rst",
Expand Down
Loading