We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent e6ac6f6 commit 83c7fc7Copy full SHA for 83c7fc7
inference/trillium/vLLM/Llama3.3/README.md
@@ -78,7 +78,7 @@ gcloud compute tpus tpu-vm ssh $TPU_NAME --project $PROJECT --zone=$ZONE
78
## Step 3: Use the vllm docker image for TPU
79
80
```bash
81
-export DOCKER_URI=vllm/vllm-tpu:nightly-20251129-28607fc-39e63de
+export DOCKER_URI=vllm/vllm-tpu:v0.12.0
82
```
83
84
The docker image is pinged here for users to reproduce the [results below](#section-benchmarking).
0 commit comments