Skip to content

Commit bf9430a

Browse files
authored
RHOAIENG-26247 - AMD Grafana metrics (#873)
1 parent 73c2c9c commit bf9430a

File tree

1 file changed

+8
-5
lines changed

1 file changed

+8
-5
lines changed

modules/deploying-vllm-gpu-metrics-dashboard-grafana.adoc

Lines changed: 8 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -20,7 +20,9 @@ endif::[]
2020
.Procedure
2121

2222
. Define a `GrafanaDashboard` object in a YAML file, similar to the following examples:
23-
.. To monitor accelerator metrics, see link:https://github.com/rh-aiservices-bu/rhoai-uwm/tree/main/rhoai-uwm-grafana/overlays/rhoai-uwm-user-grafana-app/nvidia-vllm-dashboard.yaml[`nvidia-vllm-dashboard.yaml`].
23+
.. To monitor NVIDIA accelerator metrics, see link:https://github.com/rh-aiservices-bu/rhoai-uwm/tree/main/rhoai-uwm-grafana/overlays/rhoai-uwm-user-grafana-app/nvidia-vllm-dashboard.yaml[`nvidia-vllm-dashboard.yaml`].
24+
.. To monitor AMD accelerator metrics, see link:https://github.com/rh-aiservices-bu/rhoai-uwm/tree/main/rhoai-uwm-grafana/overlays/rhoai-uwm-user-grafana-app/amd-vllm-dashboard.yaml[`amd-vllm-dashboard.yaml`].
25+
.. To monitor Intel accelerator metrics, see link:https://github.com/rh-aiservices-bu/rhoai-uwm/tree/main/rhoai-uwm-grafana/overlays/rhoai-uwm-user-grafana-app/gaudi-vllm-dashboard.yaml[`gaudi-vllm-dashboard.yaml`].
2426
.. To monitor vLLM metrics, see link:https://github.com/rh-aiservices-bu/rhoai-uwm/tree/main/rhoai-uwm-grafana/overlays/rhoai-uwm-user-grafana-app/grafana-vllm-dashboard.yaml[`grafana-vllm-dashboard.yaml`].
2527

2628
. Create an `inputs.env` file similar to the following example. Replace the `NAMESPACE` and `MODEL_NAME` parameters with your own values:
@@ -33,19 +35,20 @@ MODEL_NAME=<model-name> <2>
3335
<1> **NAMESPACE** is the target namespace where the model will be deployed.
3436
<2> **MODEL_NAME** is the model name as defined in your InferenceService. The model name is also used to filter the pod name in the Grafana dashboard.
3537

36-
. Replace the `NAMESPACE` and `MODEL_NAME` parameters in your YAML file with the values from the `input.env` file by performing the following actions:
38+
. Replace the `NAMESPACE` and `MODEL_NAME` parameters in your YAML file with the values from the `inputs.env` file by performing the following actions:
3739

3840
.. Export the parameters described in the `inputs.env` as environment variables:
3941
+
4042
[source]
4143
----
4244
export $(cat inputs.env | xargs)
4345
----
44-
.. Replace the `$NAMESPACE` and `${MODEL_NAME)` variables in the YAML file with the values of the exported environment variables:
46+
47+
.. Update the following YAML file, replacing the `${NAMESPACE}` and `${MODEL_NAME}` variables with the values of the exported environment variables, and `dashboard_template.yaml` with the name of the `GrafanaDashboard` object YAML file that you created earlier:
4548
+
4649
[source]
4750
----
48-
envsubst '${NAMESPACE} ${MODEL_NAME}' < nvidia-vllm-dashboard.yaml > nvidia-vllm-dashboard-replaced.yaml
51+
envsubst '${NAMESPACE} ${MODEL_NAME}' < dashboard_template.yaml > dashboard_template-replaced.yaml
4952
----
5053

5154
. Confirm that your YAML file contains updated values.
@@ -54,7 +57,7 @@ envsubst '${NAMESPACE} ${MODEL_NAME}' < nvidia-vllm-dashboard.yaml > nvidia-vllm
5457
+
5558
[source]
5659
----
57-
oc create -f nvidia-vllm-dashboard-replaced.yaml
60+
oc create -f dashboard_template-replaced.yaml
5861
----
5962

6063
.Verification

0 commit comments

Comments
 (0)