Skip to content

Commit 1640c9f

Browse files
committed
minor edit for gpt-oss
Signed-off-by: Kobe Chen <[email protected]>
1 parent fa37e38 commit 1640c9f

File tree

1 file changed

+3
-1
lines changed

1 file changed

+3
-1
lines changed

_posts/2025-08-05-gpt-oss-support.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -56,6 +56,9 @@ vLLM requires nightly built PyTorch to serve GPT models. To ensure compatibility
5656
Install LMCache from source (this command may take a few minutes due to CUDA kernel compilations):
5757

5858
```bash
59+
git clone https://github.com/LMCache/lmcache.github.io.git
60+
cd lmcache
61+
5962
# In your virtual environment
6063
ENABLE_CXX11_ABI=1 uv pip install -e . --no-build-isolation
6164
```
@@ -84,7 +87,6 @@ max_local_cpu_size: 80
8487
```bash
8588
LMCACHE_CONFIG_FILE="./backend_cpu.yaml" \
8689
LMCACHE_USE_EXPERIMENTAL=True \
87-
CUDA_VISIBLE_DEVICES=6,7 \
8890
vllm serve \
8991
openai/gpt-oss-120b \
9092
--max-model-len 32768 \

0 commit comments

Comments
 (0)