Skip to content

Commit cf7a121

Browse files
committed
Various minor updates to deployment manifests
1 parent df205a9 commit cf7a121

File tree

4 files changed

+2
-17
lines changed

4 files changed

+2
-17
lines changed

deployment/models/flan-ul2-tp/kustomization.yaml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -13,6 +13,7 @@ patchesStrategicMerge:
1313
- ../../base/patches/gpus/2-gpu.yaml
1414
- ../../base/patches/gpus/a100-80g.yaml
1515
- ../../base/patches/limits/t5-xxl.yaml
16+
- ../../base/patches/shm.yaml
1617
- |-
1718
apiVersion: apps/v1
1819
kind: Deployment

deployment/models/gpt-neox/kustomization.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -33,7 +33,7 @@ patchesStrategicMerge:
3333
- name: MAX_CONCURRENT_REQUESTS
3434
value: "320"
3535
- name: MAX_BATCH_WEIGHT
36-
value: "9900"
36+
value: "9200"
3737
- name: MAX_SEQUENCE_LENGTH
3838
value: "8192"
3939
- name: MAX_PREFILL_WEIGHT

deployment/models/ibm-mpt-7b-instruct/kustomization.yaml

Lines changed: 0 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -40,11 +40,3 @@ patchesStrategicMerge:
4040
value: "2000000"
4141
- name: MAX_BATCH_WEIGHT
4242
value: "200000000"
43-
44-
startupProbe:
45-
httpGet:
46-
port: http
47-
path: /health
48-
# MPT-7B currently takes more than 15 mins to load :-(
49-
failureThreshold: 52
50-
periodSeconds: 30

deployment/models/mpt-7b/kustomization.yaml

Lines changed: 0 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -42,11 +42,3 @@ patchesStrategicMerge:
4242
value: "2000000"
4343
- name: MAX_BATCH_WEIGHT
4444
value: "200000000"
45-
46-
startupProbe:
47-
httpGet:
48-
port: http
49-
path: /health
50-
# MPT-7B currently takes more than 15 mins to load :-(
51-
failureThreshold: 52
52-
periodSeconds: 30

0 commit comments

Comments
 (0)