You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
To deploy the EndpointPicker in a high-availability (HA) active-passive configuration, you can enable leader election. When enabled, the EPP deployment will have multiple replicas, but only one "leader" replica will be active and ready to process traffic at any given time. If the leader pod fails, another pod will be elected as the new leader, ensuring service continuity.
85
85
86
-
To enable HA, set `inferenceExtension.flags.has-enable-leader-election` to `true` and increase the number of replicas in your `values.yaml` file:
86
+
To enable HA, set `inferenceExtension.enableLeaderElection` to `true`.
@@ -150,7 +161,8 @@ The following table list the configurable parameters of the chart.
150
161
| `inferenceExtension.extraContainerPorts` | List of additional container ports to expose. Defaults to `[]`. |
151
162
| `inferenceExtension.extraServicePorts` | List of additional service ports to expose. Defaults to `[]`. |
152
163
| `inferenceExtension.flags` | List of flags which are passed through to endpoint picker. Example flags, enable-pprof, grpc-port etc. Refer [runner.go](https://github.com/kubernetes-sigs/gateway-api-inference-extension/blob/main/cmd/epp/runner/runner.go) for complete list. |
153
-
| `inferenceExtension.flags.has-enable-leader-election` | Enable leader election for high availability. When enabled, only one EPP pod (the leader) will be ready to serve traffic. |
164
+
| `inferenceExtension.affinity` | Affinity for the endpoint picker. Defaults to `{}`. |
165
+
| `inferenceExtension.tolerations` | Tolerations for the endpoint picker. Defaults to `[]`. | |
154
166
| `inferenceExtension.monitoring.interval` | Metrics scraping interval for monitoring. Defaults to `10s`. |
155
167
| `inferenceExtension.monitoring.secret.name` | Name of the service account token secret for metrics authentication. Defaults to `inference-gateway-sa-metrics-reader-secret`. |
156
168
| `inferenceExtension.monitoring.prometheus.enabled` | Enable Prometheus ServiceMonitor creation for EPP metrics collection. Defaults to `false`. |
0 commit comments