Skip to content

Commit 54023cc

Browse files
eloycotoelai-shalev
authored andcommitted
fix: Google vertex ai fixes in docker-compose
The access-token is not valid with litellm, moving to credentials using docker-compose secrets, so it's easy that have an specific SA. Signed-off-by: Eloy Coto <eloy.coto@acalustra.com>
1 parent 68b76d0 commit 54023cc

File tree

2 files changed

+14
-2
lines changed

2 files changed

+14
-2
lines changed

README.md

Lines changed: 7 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -17,10 +17,16 @@ This project creates a Docker Compose setup with two main services:
1717

1818
### Before starting env variables:
1919

20+
2021
```
2122
export VLLM_URL="http://SERVER_IP:11434/v1"
2223
export VERTEXAI_PROJECT="myproject"
23-
export VERTEXAI_ACCESS_TOKEN=$(gcloud auth print-access-token)
24+
```
25+
26+
If using google, check that your google cloud account is running correctly:
27+
28+
```
29+
gcloud auth list
2430
```
2531

2632
### Using podman-compose

docker-compose.yaml

Lines changed: 7 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,7 @@
1+
secrets:
2+
gcp-credentials:
3+
file: ~/.config/gcloud/application_default_credentials.json
4+
15
services:
26
llama-stack:
37
image: docker.io/llamastack/distribution-starter:0.2.18
@@ -10,14 +14,16 @@ services:
1014
entrypoint: ["python", "-m", "llama_stack.core.server.server"]
1115
command: ["/app-root/run.yaml", "--port", "8321"]
1216
environment:
13-
- VERTEXAI_ACCESS_TOKEN=${VERTEXAI_ACCESS_TOKEN}
1417
- VERTEXAI_PROJECT=${VERTEXAI_PROJECT}
18+
- GOOGLE_APPLICATION_CREDENTIALS=/run/secrets/gcp-credentials
1519
- LLAMA_STACK_LOG=DEBUG
1620
- VLLM_URL=${VLLM_URL}
1721
- VLLM_API_TOKEN=test
1822
- VLLM_MAX_TOKENS=4096
1923
- VLLM_TLS_VERIFY=false
2024
- INFERENCE_MODEL=gemma3:27b-it-qat
25+
secrets:
26+
- gcp-credentials
2127
healthcheck:
2228
test: ["CMD", "curl", "-f", "http://localhost:8321/v1/health"]
2329
interval: 5s

0 commit comments

Comments
 (0)