You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: content/en/docs/measuring/metric-providers/gpu-energy-nvidia-nvml-component.md
+19-16Lines changed: 19 additions & 16 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,6 +1,6 @@
1
1
---
2
-
title: "GPU energy - NVIDIA SMI - Component"
3
-
description: "Documentation for GpuEnergyNvidiaSmiComponentProvider of the Green Metrics Tool"
2
+
title: "GPU energy - NVIDIA NVML - Component"
3
+
description: "Documentation for GpuEnergyNvidiaNvmlComponentProvider of the Green Metrics Tool"
4
4
lead: ""
5
5
date: 2024-01-06T08:49:15+00:00
6
6
draft: false
@@ -14,27 +14,25 @@ This metric provider gets the current GPU power draw from the NVIDIA SMI softwar
14
14
15
15
### Classname
16
16
17
-
-`GpuEnergyNvidiaSmiComponentProvider`
17
+
-`GpuEnergyNvidiaNvmlComponentProvider`
18
18
19
19
### Metric Name
20
20
21
-
-`gpu_energy_nvidia_smi_component`
21
+
-`gpu_energy_nvidia_nvml_component`
22
22
23
23
### Prerequisites & Installation
24
24
25
-
You first must install the *CUDA Toolkit* from *NVIDIA* for the metric provider to have the needed libraries and binars. The URL at the time of writing is here: [https://developer.nvidia.com/cuda-downloads](https://developer.nvidia.com/cuda-downloads)
25
+
We assume that the NVIDIA graphics card and the associated drivers are installed on your system.
26
26
27
-
You need both:
28
-
- Base Installer
29
-
- Driver Installer
27
+
Please resort to [NVIDIA Docs](https://developer.nvidia.com) for installation if you still need to install.
30
28
31
-
To check if the installation has succeeded you can run:
32
-
```console
33
-
$ nvidia-smi -q
34
-
```
29
+
GMT will try to install the needed C header and development files for the *Metrics Provider* to compile.
30
+
31
+
You can trigger this by adding `--nvidia-gpu` to the install script. If the installation fails, please resort to your OS documentation. e.g.: [NVIDIA Linux docs](https://docs.nvidia.com/cuda/cuda-installation-guide-linux)
35
32
36
-
After the installation you system can use language bindings for your matching *CUDA* version.
37
-
Please check on our [Measurement Cluster]({{< relref "/docs/measuring/measurement-cluster" >}}) page which *CUDA* version is installed.
33
+
### Running your code on our hosted service
34
+
35
+
Please check on our [Measurement Cluster]({{< relref "/docs/measuring/measurement-cluster" >}}) page which *CUDA* version is installed. You must use the same CUDA version if you have compiled artifacts in your containers.
38
36
39
37
#### Debugging
40
38
@@ -61,7 +59,7 @@ Check in `sudo dmesg` if the kernel module could correctly be lodaded and then v
61
59
By default the measurement interval is 100 ms.
62
60
63
61
```bash
64
-
./metric-provider-nvidia-smi-wrapper.sh -i 100
62
+
./metric-provider-binary -i 100
65
63
```
66
64
67
65
### Output
@@ -73,10 +71,15 @@ This metric provider prints to Stdout a continuous stream of data. The format of
73
71
Where:
74
72
-`TIMESTAMP`: Unix timestamp, in microseconds
75
73
-`READING`: The energy used by the GPU in milliWatts (Ex: 12230 for 12.23 Watts)
74
+
-`CARD NAME`: The name of the graphics card as reported by the driver
0 commit comments