Skip to content

Commit f3e25c0

Browse files
committed
Add RollingUpdate example comment
1 parent ad40723 commit f3e25c0

File tree

1 file changed

+6
-2
lines changed

1 file changed

+6
-2
lines changed

charts/azimuth-llm/values.yaml

Lines changed: 6 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -66,8 +66,12 @@ api:
6666
gpus: 1
6767
# The update strategy to use for the deployment
6868
# See https://kubernetes.io/docs/concepts/workloads/controllers/deployment/#updating-a-deployment
69-
# NOTE: Changing this has implications for the number of additional GPU worker nodes required
70-
# to preform a rolling zero-downtime update
69+
# NOTE: The following RollingUpdate strategy offers a zero-downtime update but requires additional GPU worker nodes.
70+
# updateStrategy:
71+
# type: RollingUpdate
72+
# rollingUpdate:
73+
# maxSurge: 1
74+
# maxUnavailable: 0
7175
updateStrategy:
7276
type: Recreate
7377
# The value of the vLLM backend's max_model_len argument (if the model's default is not suitable)

0 commit comments

Comments
 (0)