Skip to content

Commit bdf8cf5

Browse files
committed
Add pod resources option for inference model deployment
1 parent dbb65cc commit bdf8cf5

File tree

2 files changed

+5
-0
lines changed

2 files changed

+5
-0
lines changed

deployment/helm/charts/danswer/templates/inference-model-deployment.yaml

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -25,6 +25,10 @@ spec:
2525
image: "{{ .Values.inferenceCapability.deployment.image.repository }}:{{ .Values.inferenceCapability.deployment.image.tag | default .Values.appVersionOverride | default .Chart.AppVersion }}"
2626
imagePullPolicy: {{ .Values.inferenceCapability.deployment.image.pullPolicy }}
2727
command: {{ toYaml .Values.inferenceCapability.deployment.command | nindent 14 }}
28+
{{- if .Values.inferenceCapability.deployment.resources }}
29+
resources:
30+
{{- toYaml .Values.inferenceCapability.deployment.resources | nindent 10 }}
31+
{{- end }}
2832
ports:
2933
- containerPort: {{ .Values.inferenceCapability.service.port }}
3034
envFrom:

deployment/helm/charts/danswer/values.yaml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -39,6 +39,7 @@ inferenceCapability:
3939
tag:
4040
pullPolicy: IfNotPresent
4141
command: ["uvicorn", "model_server.main:app", "--host", "0.0.0.0", "--port", "9000"]
42+
resources:
4243
port: 9000
4344
volumeMounts:
4445
- name: inference-model-storage

0 commit comments

Comments
 (0)