You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
|`servingEngineSpec.modelSpec[].replicaCount`| integer |`1`| The number of replicas for the model |
90
+
|`servingEngineSpec.modelSpec[].pdb.enabled`| boolean |`false`| Whether to create a PodDisruptionBudget for the model |
91
+
|`servingEngineSpec.modelSpec[].pdb.labels`| map |`{}`| Labels to add to the PodDisruptionBudget |
92
+
|`servingEngineSpec.modelSpec[].pdb.annotations`| map |`{}`| Annotations to add to the PodDisruptionBudget |
93
+
|`servingEngineSpec.modelSpec[].pdb.minAvailable`| string |`""`| Number of pods that are available after eviction as number or percentage (eg.: 50%) |
94
+
|`servingEngineSpec.modelSpec[].pdb.maxUnavailable`| string |`""`| Number of pods that are unavailable after eviction as number or percentage (eg.: 50%). |
90
95
|`servingEngineSpec.modelSpec[].resources`| object |`{}`| Standard Kubernetes resources block (requests/limits). If specified, this takes priority over and ignores simplified resource fields (requestCPU, requestMemory, requestGPU, etc.) |
91
96
|`servingEngineSpec.modelSpec[].requestCPU`| integer |`0`| The number of CPUs requested for the model |
92
97
|`servingEngineSpec.modelSpec[].requestMemory`| string |`""`| The amount of memory requested for the model, e.g., "16Gi" |
@@ -196,12 +201,17 @@ This table documents all available configuration values for the Production Stack
196
201
197
202
| Field | Type | Default | Description |
198
203
|-------|------|---------|-------------|
204
+
|`routerSpec.enableRouter`| boolean |`true`| Whether to enable the router service |
199
205
|`routerSpec.repository`| string |`"lmcache/lmstack-router"`| Docker image repository for the router |
200
206
|`routerSpec.tag`| string |`"latest"`| Docker image tag for the router |
201
207
|`routerSpec.imagePullPolicy`| string |`"Always"`| Image pull policy for the router |
202
208
|`routerSpec.imagePullSecrets`| list |`[]`| Image pull secrets for private container registries |
203
-
|`routerSpec.enableRouter`| boolean |`true`| Whether to enable the router service |
204
209
|`routerSpec.replicaCount`| integer |`1`| Number of replicas for the router pod |
210
+
|`routerSpec.pdb.enabled`| boolean |`false`| Whether to create a PodDisruptionBudget for the model |
211
+
|`routerSpec.pdb.labels`| map |`{}`| Labels to add to the PodDisruptionBudget |
212
+
|`routerSpec.pdb.annotations`| map |`{}`| Annotations to add to the PodDisruptionBudget |
213
+
|`routerSpec.pdb.minAvailable`| string |`""`| Number of pods that are available after eviction as number or percentage (eg.: 50%) |
214
+
|`routerSpec.pdb.maxUnavailable`| string |`""`| Number of pods that are unavailable after eviction as number or percentage (eg.: 50%). |
205
215
|`routerSpec.priorityClassName`| string |`""`| Priority class for router |
206
216
|`routerSpec.containerPort`| integer |`8000`| Port the router container is listening on |
207
217
|`routerSpec.serviceType`| string |`"ClusterIP"`| Kubernetes service type for the router |
@@ -325,7 +335,10 @@ This table documents all available configuration values for the Production Stack
325
335
|`loraController.image.tag`| string |`"latest"`| Docker image tag |
0 commit comments