Skip to content

Commit 2922fec

Browse files
changed #
Signed-off-by: zRzRzRzRzRzRzR <[email protected]>
1 parent fc55b11 commit 2922fec

File tree

1 file changed

+5
-3
lines changed

1 file changed

+5
-3
lines changed

_posts/2025-08-15-glm45-vllm.md

Lines changed: 5 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,9 @@ author: "Yuxuan Zhang"
55
image: /assets/logos/vllm-logo-text-light.png
66
---
77

8-
# Model Introduction
8+
# Use vLLM to deploy GLM-4.5 and GLM-4.5V model
9+
10+
## Model Introduction
911

1012
The GLM-4.5 series models are foundation models designed for intelligent agents. GLM-4.5 has 355 billion total
1113
parameters with 32 billion active parameters, while GLM-4.5-Air adopts a more compact design with 106 billion total
@@ -69,7 +71,7 @@ vllm serve zai-org/GLM-4.5V \
6971
--media-io-kwargs '{"video": {"num_frames": -1}}'
7072
```
7173

72-
## Important Notes
74+
### Important Notes
7375

7476
+ The reasoning part of the model output will be wrapped in `reasoning_content`. `content` will only contain the final
7577
answer. To disable reasoning, add the following parameter:
@@ -86,7 +88,7 @@ vllm serve zai-org/GLM-4.5V \
8688
GLM-4.5V equips precise grounding capabilities. Given a prompt that requests the location of a specific object, GLM-4.5V
8789
is able to reasoning step-by-step and identify the bounding boxes of the target object. The query prompt supports
8890
complex descriptions of the target object as well as specified output formats, for example:
89-
>
91+
9092
> - Help me to locate <expr> in the image and give me its bounding boxes.
9193
> - Please pinpoint the bounding box [[x1,y1,x2,y2], …] in the image as per the given description. <expr>
9294

0 commit comments

Comments
 (0)