You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: config/charts/inferencepool/README.md
+6-6Lines changed: 6 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -104,18 +104,18 @@ The following table list the configurable parameters of the chart.
104
104
|`inferenceExtension.extProcPort`| Port where the endpoint picker service is served for external processing. Defaults to `9002`. |
105
105
|`inferenceExtension.env`| List of environment variables to set in the endpoint picker container as free-form YAML. Defaults to `[]`. |
106
106
|`inferenceExtension.enablePprof`| Enables pprof for profiling and debugging |
107
-
|`inferenceExtension.modelServerMetricsPath`|Flag to have model server metrics|
108
-
|`inferenceExtension.modelServerMetricsScheme`|Flag to have model server metrics scheme |
109
-
|`inferenceExtension.modelServerMetricsPort`|Flag for have model server metrics port |
107
+
|`inferenceExtension.modelServerMetricsPath`|Path to scrape metrics from pods|
108
+
|`inferenceExtension.modelServerMetricsScheme`|Scheme to scrape metrics from pods|
109
+
|`inferenceExtension.modelServerMetricsPort`|Port to scrape metrics from pods. Default value will be set to InferencePool.Spec.TargetPortNumber if not set.|
110
110
|`inferenceExtension.modelServerMetricsHttpsInsecureSkipVerify`| When using 'https' scheme for 'model-server-metrics-scheme', configure 'InsecureSkipVerify' (default to true) |
111
111
|`inferenceExtension.secureServing`| Enables secure serving. Defaults to true. |
112
-
|`inferenceExtension.healthChecking`| Enables health checking |
112
+
|`inferenceExtension.healthChecking`| Enables health checking. Defaults to false.|
113
113
|`inferenceExtension.certPath`| The path to the certificate for secure serving. The certificate and private key files are assumed to be named tls.crt and tls.key, respectively. If not set, and secureServing is enabled, then a self-signed certificate is used. |
114
114
|`inferenceExtension.refreshMetricsInterval`| Interval to refresh metrics |
115
115
|`inferenceExtension.refreshPrometheusMetricsInterval`| Interval to flush prometheus metrics |
116
-
|`inferenceExtension.metricsStalenessThreshold`| Duration after which metrics are considered stale. This is used to determine if a pod's metrics are fresh enough. |
116
+
|`inferenceExtension.metricsStalenessThreshold`| Duration after which pod's metrics are considered stale (invalid).|
117
117
|`inferenceExtension.totalQueuedRequestsMetric`| Prometheus metric for the number of queued requests. |
118
-
|`inferenceExtension.extraContainerPorts`| List of additional container ports to expose. Defaults to `[]`.|
118
+
|`inferenceExtension.extraContainerPorts`| List of additional container ports to expose for endpoint picker. Defaults to `[]`. |
119
119
|`inferenceExtension.extraServicePorts`| List of additional service ports to expose. Defaults to `[]`. |
120
120
|`inferenceExtension.logVerbosity`| Logging verbosity level for the endpoint picker. Defaults to `"1"`. |
121
121
|`provider.name`| Name of the Inference Gateway implementation being used. Possible values: `gke`. Defaults to `none`. |
0 commit comments