Skip to content

Commit 39a943b

Browse files
committed
move autopilot to provider.gke.
1 parent f43ae48 commit 39a943b

File tree

3 files changed

+14
-5
lines changed

3 files changed

+14
-5
lines changed

config/charts/inferencepool/README.md

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -135,6 +135,8 @@ inferenceExtension:
135135

136136
For GKE environments, monitoring is enabled by setting `provider.name` to `gke` and `inferenceExtension.monitoring.gke.enabled` to `true`. This will create the necessary `PodMonitoring` and RBAC resources for metrics collection.
137137

138+
If you are using a GKE Autopilot cluster, you also need to set `provider.gke.autopilot` to `true`.
139+
138140
Then apply it with:
139141

140142
```txt
@@ -174,10 +176,10 @@ The following table list the configurable parameters of the chart.
174176
| `inferenceExtension.monitoring.interval` | Metrics scraping interval for monitoring. Defaults to `10s`. |
175177
| `inferenceExtension.monitoring.secret.name` | Name of the service account token secret for metrics authentication. Defaults to `inference-gateway-sa-metrics-reader-secret`. |
176178
| `inferenceExtension.monitoring.prometheus.enabled` | Enable Prometheus ServiceMonitor creation for EPP metrics collection. Defaults to `false`. |
177-
| `inferenceExtension.monitoring.gke.enabled` | Enable GKE monitoring resources (`PodMonitoring` and RBAC). Defaults to `false`. |
178-
| `inferenceExtension.monitoring.gke.autopilot` | Set to `true` if the cluster is a GKE Autopilot cluster. This ensures the correct `gke-gmp-system` namespace is used for the GMP collector. Defaults to `false`. |
179+
| `inferenceExtension.monitoring.gke.enabled` | Enable GKE monitoring resources (`PodMonitoring` and RBAC). Defaults to `false`. |
179180
| `inferenceExtension.pluginsCustomConfig` | Custom config that is passed to EPP as inline yaml. |
180181
| `provider.name` | Name of the Inference Gateway implementation being used. Possible values: `gke`. Defaults to `none`. |
182+
| `provider.gke.autopilot` | Set to `true` if the cluster is a GKE Autopilot cluster. This is only used if `provider.name` is `gke`. Defaults to `false`. |
181183

182184
## Notes
183185

config/charts/inferencepool/templates/gke.yaml

Lines changed: 5 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -42,7 +42,11 @@ spec:
4242
{{- $metricsReadRoleBindingName := printf "%s-%s-metrics-reader-role-binding" .Release.Namespace .Release.Name -}}
4343
{{- $secretReadRoleName := printf "%s-metrics-reader-secret-read" .Release.Name -}}
4444
{{- $gmpNamespace := "gmp-system" -}}
45-
{{- if .Values.inferenceExtension.monitoring.gke.autopilot -}}
45+
{{- $isAutopilot := false -}}
46+
{{- with .Values.provider.gke }}
47+
{{- $isAutopilot = .autopilot | default false -}}
48+
{{- end }}
49+
{{- if $isAutopilot -}}
4650
{{- $gmpNamespace = "gke-gmp-system" -}}
4751
{{- end -}}
4852
{{- $gmpCollectorRoleBindingName := printf "%s:collector:%s-%s-metrics-reader-secret-read" $gmpNamespace .Release.Namespace .Release.Name -}}

config/charts/inferencepool/values.yaml

Lines changed: 5 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -53,8 +53,6 @@ inferenceExtension:
5353

5454
gke:
5555
enabled: false
56-
# Set to true if the cluster is an Autopilot cluster.
57-
autopilot: false
5856

5957
inferencePool:
6058
targetPorts:
@@ -72,3 +70,8 @@ inferencePool:
7270
provider:
7371
name: none
7472

73+
# GKE-specific configuration.
74+
# This block is only used if name is "gke".
75+
gke:
76+
# Set to true if the cluster is an Autopilot cluster.
77+
autopilot: false

0 commit comments

Comments
 (0)