update

yiliu30 · yiliu30 · commit c64f54893b4c · 2025-12-04T13:50:02.000Z
Signed-off-by: yiliu30 &lt;yi4.liu@intel.com&gt;
diff --git a/_posts/2025-12-03-intel-autoround-llmc.md b/_posts/2025-12-03-intel-autoround-llmc.md
@@ -120,7 +120,7 @@ vllm serve Qwen3-8B-W4A16-G128-AutoRound \
     --max-num-batched-tokens 8192 
 ```
 
-Note: please install vLLM from this PR https://github.com/vllm-project/vllm/pull/29484/
+Note: Please install vLLM from PR #29484. When serving on XPU, you must run vLLM with the --enforce-eager flag.
 
 ### 6. Evaluate (Example: GSM8K with `lm_eval`)