You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Helm charts values and how-to for customization can be found [in the official documentation](https://github.com/IBM/autopilot/blob/main/helm-charts/autopilot/README.md). As-is, Autopilot will run on GPU nodes.
Copy file name to clipboardExpand all lines: setup.RHOAI-v2.16/CLUSTER-SETUP.md
+27Lines changed: 27 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -76,6 +76,33 @@ AI configuration as follows:
76
76
77
77
78
78
79
+
## Autopilot
80
+
81
+
Helm charts values and how-to for customization can be found [in the official documentation](https://github.com/IBM/autopilot/blob/main/helm-charts/autopilot/README.md). As-is, Autopilot will run on GPU nodes.
Copy file name to clipboardExpand all lines: setup.RHOAI-v2.17/CLUSTER-SETUP.md
+27Lines changed: 27 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -76,6 +76,33 @@ AI configuration as follows:
76
76
77
77
78
78
79
+
## Autopilot
80
+
81
+
Helm charts values and how-to for customization can be found [in the official documentation](https://github.com/IBM/autopilot/blob/main/helm-charts/autopilot/README.md). As-is, Autopilot will run on GPU nodes.
Copy file name to clipboardExpand all lines: setup.k8s/CLUSTER-SETUP.md
+29Lines changed: 29 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -7,6 +7,7 @@ The cluster setup installs and configures the following components:
7
7
+ Kueue
8
8
+ AppWrappers
9
9
+ Cluster roles and priority classes
10
+
+ Autopilot
10
11
11
12
## Priorities
12
13
@@ -73,6 +74,34 @@ operators as follows:
73
74
-`queueName` is set to `default-queue`,
74
75
- pod priorities, resource requests and limits have been adjusted.
75
76
77
+
## Autopilot
78
+
79
+
Helm charts values and how-to for customization can be found [in the official documentation](https://github.com/IBM/autopilot/blob/main/helm-charts/autopilot/README.md). As-is, Autopilot will run on GPU nodes.
The `ServiceMonitor` object is the one that enables Prometheus to scrape the metrics produced by Autopilot.
97
+
In order for Prometheus to find the right objects, the `ServiceMonitor` needs to be annotated with the Prometheus' release name. It is usually `prometheus`, and that's the default added in the Autopilot release.
98
+
If that is not the case in your cluster, the correct release label can be found by checking in the `ServiceMonitor` of Prometheus itself, or the name of Prometheus helm chart.
99
+
Then, Autopilot's `ServiceMonitor` can be labeled with the following command
Copy file name to clipboardExpand all lines: setup.tmpl/CLUSTER-SETUP.md.tmpl
+39Lines changed: 39 additions & 0 deletions
Original file line number
Diff line number
Diff line change
@@ -12,6 +12,7 @@ The cluster setup installs and configures the following components:
12
12
+ Kueue
13
13
+ AppWrappers
14
14
+ Cluster roles and priority classes
15
+
+ Autopilot
15
16
16
17
{{- end }}
17
18
@@ -154,6 +155,44 @@ operators as follows:
154
155
155
156
{{- end }}
156
157
158
+
## Autopilot
159
+
160
+
Helm charts values and how-to for customization can be found [in the official documentation](https://github.com/IBM/autopilot/blob/main/helm-charts/autopilot/README.md). As-is, Autopilot will run on GPU nodes.
The `ServiceMonitor` object is the one that enables Prometheus to scrape the metrics produced by Autopilot.
187
+
In order for Prometheus to find the right objects, the `ServiceMonitor` needs to be annotated with the Prometheus' release name. It is usually `prometheus`, and that's the default added in the Autopilot release.
188
+
If that is not the case in your cluster, the correct release label can be found by checking in the `ServiceMonitor` of Prometheus itself, or the name of Prometheus helm chart.
189
+
Then, Autopilot's `ServiceMonitor` can be labeled with the following command
0 commit comments