Update guide.

congw729 · congw729 · commit 4cbc75c19b8e · 2025-08-22T18:35:52.000+08:00
Signed-off-by: congw729 &lt;115451386+congw729@users.noreply.github.com&gt;
diff --git a/InterVL/InterVL3.md b/InterVL/InterVL3.md
@@ -17,7 +17,7 @@ uv pip install -U vllm --torch-backend auto
 ### Weights
 [OpenGVLab/InternVL3-8B-hf](https://huggingface.co/OpenGVLab/InternVL3-8B)
 
-#### Running InternVL3-8B-hf model on A100-SXM4-40GB GPUs (2 cards) in eager mode
+### Running InternVL3-8B-hf model on A100-SXM4-40GB GPUs (2 cards) in eager mode
 
 Launch the online inference server using TP=2:
 ```bash
@@ -96,7 +96,7 @@ vllm bench serve \
   --random-input 2048 \
   --random-output 1024 \
   --max-concurrency 10 \
-  --num-prompts 50\
+  --num-prompts 50 \
   --ignore-eos
 ```
 If it works successfully, you will see the following output.