Skip to content

Commit c64f548

Browse files
committed
update
Signed-off-by: yiliu30 <[email protected]>
1 parent de67e9f commit c64f548

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

_posts/2025-12-03-intel-autoround-llmc.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -120,7 +120,7 @@ vllm serve Qwen3-8B-W4A16-G128-AutoRound \
120120
--max-num-batched-tokens 8192
121121
```
122122

123-
Note: please install vLLM from this PR https://github.com/vllm-project/vllm/pull/29484/
123+
Note: Please install vLLM from PR #29484. When serving on XPU, you must run vLLM with the --enforce-eager flag.
124124

125125
### 6. Evaluate (Example: GSM8K with `lm_eval`)
126126

0 commit comments

Comments
 (0)