Skip to content

Commit 520b679

Browse files
committed
update
1 parent 3925675 commit 520b679

File tree

1 file changed

+6
-5
lines changed

1 file changed

+6
-5
lines changed

config/charts/inferencepool/README.md

Lines changed: 6 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -137,14 +137,16 @@ inferenceExtension:
137137
monitoring:
138138
interval: "10s"
139139
prometheus:
140-
enabled: true
141-
secret:
142-
name: inference-gateway-sa-metrics-reader-secret
140+
enabled: false
141+
auth:
142+
enabled: true
143+
secretName: inference-gateway-sa-metrics-reader-secret
144+
extraLabels: {}
143145
```
144146

145147
**Note:** Prometheus monitoring requires the Prometheus Operator and ServiceMonitor CRD to be installed in the cluster.
146148

147-
For GKE environments, monitoring is enabled by setting `provider.name` to `gke` and `inferenceExtension.monitoring.gke.enabled` to `true`. This will create the necessary `PodMonitoring` and RBAC resources for metrics collection.
149+
For GKE environments, monitoring is enabled by setting `provider.name` to `gke`. This will create the necessary `PodMonitoring` and RBAC resources for metrics collection.
148150

149151
If you are using a GKE Autopilot cluster, you also need to set `provider.gke.autopilot` to `true`.
150152

@@ -186,7 +188,6 @@ The following table list the configurable parameters of the chart.
186188
| `inferenceExtension.monitoring.interval` | Metrics scraping interval for monitoring. Defaults to `10s`. |
187189
| `inferenceExtension.monitoring.secret.name` | Name of the service account token secret for metrics authentication. Defaults to `inference-gateway-sa-metrics-reader-secret`. |
188190
| `inferenceExtension.monitoring.prometheus.enabled` | Enable Prometheus ServiceMonitor creation for EPP metrics collection. Defaults to `false`. |
189-
| `inferenceExtension.monitoring.gke.enabled` | Enable GKE monitoring resources (`PodMonitoring` and RBAC). Defaults to `false`. |
190191
| `inferenceExtension.pluginsCustomConfig` | Custom config that is passed to EPP as inline yaml. |
191192
| `provider.name` | Name of the Inference Gateway implementation being used. Possible values: [`none`, `gke`, or `istio`]. Defaults to `none`. |
192193
| `provider.gke.autopilot` | Set to `true` if the cluster is a GKE Autopilot cluster. This is only used if `provider.name` is `gke`. Defaults to `false`. |

0 commit comments

Comments
 (0)