We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent e6ac6f6 commit 4a0a832Copy full SHA for 4a0a832
inference/trillium/vLLM/Llama3.3/README.md
@@ -78,10 +78,10 @@ gcloud compute tpus tpu-vm ssh $TPU_NAME --project $PROJECT --zone=$ZONE
78
## Step 3: Use the vllm docker image for TPU
79
80
```bash
81
-export DOCKER_URI=vllm/vllm-tpu:nightly-20251129-28607fc-39e63de
+export DOCKER_URI=vllm/vllm-tpu:v0.12.0
82
```
83
84
-The docker image is pinged here for users to reproduce the [results below](#section-benchmarking).
+The docker image is pinned here for users to reproduce the [results below](#section-benchmarking).
85
86
To use the latest stable version, set `DOCKER_URI=vllm/vllm-tpu:latest`.
87
0 commit comments