You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: config/charts/inferencepool/README.md
+3-4Lines changed: 3 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -119,11 +119,10 @@ The following table list the configurable parameters of the chart.
119
119
|`inferenceExtension.image.hub`| Registry URL where the endpoint picker image is hosted. |
120
120
|`inferenceExtension.image.tag`| Image tag of the endpoint picker. |
121
121
|`inferenceExtension.image.pullPolicy`| Image pull policy for the container. Possible values: `Always`, `IfNotPresent`, or `Never`. Defaults to `Always`. |
122
-
|`inferenceExtension.extProcPort`| Port where the endpoint picker service is served for external processing. Defaults to `9002`. |
123
122
|`inferenceExtension.env`| List of environment variables to set in the endpoint picker container as free-form YAML. Defaults to `[]`. |
124
-
|`inferenceExtension.extraContainerPorts`| List of additional container ports to expose. Defaults to `[]`. |
125
-
|`inferenceExtension.extraServicePorts`| List of additional service ports to expose. Defaults to `[]`. |
126
-
|`inferenceExtension.logVerbosity`| Logging verbosity level for the endpoint picker. Defaults to `"3"`.|
123
+
|`inferenceExtension.extraContainerPorts`| List of additional container ports to expose. Defaults to `[]`. |
124
+
|`inferenceExtension.extraServicePorts`| List of additional service ports to expose. Defaults to `[]`. |
125
+
|`inferenceExtension.flags`| List of flags which are passed through to endpoint picker. |
127
126
|`provider.name`| Name of the Inference Gateway implementation being used. Possible values: `gke`. Defaults to `none`. |
128
127
|`inferenceExtension.enableLeaderElection`| Enable leader election for high availability. When enabled, only one EPP pod (the leader) will be ready to serve traffic. It is recommended to set `inferenceExtension.replicas` to a value greater than 1 when this is set to `true`. Defaults to `false`. |
0 commit comments