Skip to content
This repository was archived by the owner on Sep 6, 2025. It is now read-only.

Commit e0c3077

Browse files
authored
perf: let midnight rose scale to zero (#86)
1 parent 7301ae2 commit e0c3077

File tree

1 file changed

+0
-1
lines changed

1 file changed

+0
-1
lines changed

modal/runner/containers/vllm_unified.py

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -176,7 +176,6 @@ def __init__(self):
176176
model_name=_midnight_rose,
177177
gpu=modal.gpu.H100(count=1),
178178
concurrent_inputs=4,
179-
keep_warm=1,
180179
max_containers=1,
181180
quantization="GPTQ",
182181
)

0 commit comments

Comments
 (0)