Skip to content

Commit 4cbc75c

Browse files
committed
Update guide.
Signed-off-by: congw729 <[email protected]>
1 parent 5bd2584 commit 4cbc75c

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

InterVL/InterVL3.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -17,7 +17,7 @@ uv pip install -U vllm --torch-backend auto
1717
### Weights
1818
[OpenGVLab/InternVL3-8B-hf](https://huggingface.co/OpenGVLab/InternVL3-8B)
1919

20-
#### Running InternVL3-8B-hf model on A100-SXM4-40GB GPUs (2 cards) in eager mode
20+
### Running InternVL3-8B-hf model on A100-SXM4-40GB GPUs (2 cards) in eager mode
2121

2222
Launch the online inference server using TP=2:
2323
```bash
@@ -96,7 +96,7 @@ vllm bench serve \
9696
--random-input 2048 \
9797
--random-output 1024 \
9898
--max-concurrency 10 \
99-
--num-prompts 50\
99+
--num-prompts 50 \
100100
--ignore-eos
101101
```
102102
If it works successfully, you will see the following output.

0 commit comments

Comments
 (0)