Skip to content

Commit 84b449f

Browse files
authored
Merge pull request #64465 from empovit/nvidia-gpu-admin-dashboard
TELCODOCS-1571: Update NVIDIA GPU dashboard
2 parents 28ba771 + 5d6f8e2 commit 84b449f

8 files changed

+4
-229
lines changed

_topic_maps/_topic_map.yml

Lines changed: 0 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -2589,8 +2589,6 @@ Topics:
25892589
File: managing-alerts
25902590
- Name: Reviewing monitoring dashboards
25912591
File: reviewing-monitoring-dashboards
2592-
- Name: The NVIDIA GPU administration dashboard
2593-
File: nvidia-gpu-admin-dashboard
25942592
- Name: Monitoring bare-metal events
25952593
File: using-rfhe
25962594
- Name: Accessing third-party monitoring APIs

architecture/nvidia-gpu-architecture-overview.adoc

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -57,10 +57,9 @@ include::modules/nvidia-gpu-features.adoc[leveloffset=+1]
5757
.Additional resources
5858
5959
* link:https://docs.nvidia.com/ngc/ngc-deploy-on-premises/nvidia-certified-systems/index.html[NVIDIA-Certified Systems]
60-
* link:https://access.redhat.com/documentation/en-us/openshift_container_platform/4.13/html/monitoring/nvidia-gpu-admin-dashboard#doc-wrapper[The NVIDIA GPU administration dashboard]
6160
* link:https://docs.nvidia.com/datacenter/cloud-native/gpu-operator/latest/openshift/nvaie-with-ocp.html[NVIDIA AI Enterprise with OpenShift]
6261
* link:https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/overview.html#[NVIDIA Container Toolkit]
63-
* link:https://developer.nvidia.com/dcgm[NVIDIA DCGM]
62+
* link:https://docs.nvidia.com/datacenter/cloud-native/gpu-operator/latest/openshift/enable-gpu-monitoring-dashboard.html[Enabling the GPU Monitoring Dashboard]
6463
* link:https://docs.nvidia.com/datacenter/cloud-native/gpu-operator/openshift/mig-ocp.html[MIG Support in OpenShift Container Platform]
6564
* link:https://docs.nvidia.com/datacenter/cloud-native/gpu-operator/openshift/time-slicing-gpus-in-openshift.html[Time-slicing NVIDIA GPUs in OpenShift]
6665
* link:https://docs.nvidia.com/datacenter/cloud-native/gpu-operator/openshift/mirror-gpu-ocp-disconnected.html[Deploy GPU Operators in a disconnected or airgapped environment]

modules/nvidia-gpu-admin-dashboard-installing.adoc

Lines changed: 0 additions & 136 deletions
This file was deleted.

modules/nvidia-gpu-admin-dashboard-introduction.adoc

Lines changed: 0 additions & 14 deletions
This file was deleted.

modules/nvidia-gpu-admin-dashboard-using.adoc

Lines changed: 0 additions & 59 deletions
This file was deleted.

modules/nvidia-gpu-csps.adoc

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@
66
[id="nvidia-gpu-csps_{context}"]
77
= GPUs and CSPs
88

9-
You can deploy {product title} to one of the major cloud service providers (CSPs): Amazon Web Services (AWS), Google Cloud Platform (GCP), or Microsoft Azure.
9+
You can deploy {product-title} to one of the major cloud service providers (CSPs): Amazon Web Services (AWS), Google Cloud Platform (GCP), or Microsoft Azure.
1010

1111
Two modes of operation are available: a fully managed deployment and a self-managed deployment.
1212

modules/nvidia-gpu-features.adoc

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -54,5 +54,5 @@ Up until this point, the GPU Operator only provisioned worker nodes to run GPU-a
5454
+
5555
You can configure the GPU Operator to deploy different software components to worker nodes depending on which GPU workload is configured to run on those nodes.
5656

57-
GPU Operator dashboard::
58-
You can install a console plugin to display GPU usage information on the cluster utilization screen in the {product title} web console. GPU utilization information includes the number of available GPUs, power consumption (in watts) for each GPU and the percentage of GPU workload used for video encoding and decoding.
57+
GPU Monitoring dashboard::
58+
You can install a monitoring dashboard to display GPU usage information on the cluster *Observe* page in the {product-title} web console. GPU utilization information includes the number of available GPUs, power consumption (in watts), temperature (in degrees Celsius), utilization (in percent), and other metrics for each GPU.

monitoring/nvidia-gpu-admin-dashboard.adoc

Lines changed: 0 additions & 13 deletions
This file was deleted.

0 commit comments

Comments
 (0)