We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
2 parents a7cae1d + f2ae25f commit 7026623Copy full SHA for 7026623
inference/trillium/vLLM/Llama3.x/README.md
@@ -80,7 +80,7 @@ gcloud compute tpus tpu-vm ssh $TPU_NAME --project $PROJECT --zone=$ZONE
80
## Step 3: Use the latest vLLM Docker image for TPU
81
82
```bash
83
-export DOCKER_URI=vllm/vllm-tpu:nightly-20251129-28607fc-39e63de
+export DOCKER_URI=vllm/vllm-tpu:v0.12.0
84
```
85
86
The docker image is pinned here for users to reproduce the [results below](#section-benchmarking).
0 commit comments