Skip to content

Commit 17ef2d1

Browse files
authored
Use the fp8 KV cache by default in the single node (#1883)
Use the fp8 KV cache by default in the single node
1 parent 9010b45 commit 17ef2d1

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

scripts/quickstart/start_vllm.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -31,7 +31,7 @@ max_num_seqs=128
3131
host=0.0.0.0
3232
max_model_len=16384
3333

34-
KV_CACHE_DTYPE=auto
34+
KV_CACHE_DTYPE=fp8_inc
3535

3636
while getopts hw:u:p:l:b:c:sq flag; do
3737
case $flag in

0 commit comments

Comments
 (0)