Skip to content

Commit f509a20

Browse files
authored
[DOC] Update production-stack.md (vllm-project#26177)
Signed-off-by: Elieser Pereira <[email protected]>
1 parent 60bc25e commit f509a20

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

docs/deployment/integrations/production-stack.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -55,7 +55,7 @@ sudo kubectl port-forward svc/vllm-router-service 30080:80
5555
And then you can send out a query to the OpenAI-compatible API to check the available models:
5656

5757
```bash
58-
curl -o- http://localhost:30080/models
58+
curl -o- http://localhost:30080/v1/models
5959
```
6060

6161
??? console "Output"
@@ -78,7 +78,7 @@ curl -o- http://localhost:30080/models
7878
To send an actual chatting request, you can issue a curl request to the OpenAI `/completion` endpoint:
7979

8080
```bash
81-
curl -X POST http://localhost:30080/completions \
81+
curl -X POST http://localhost:30080/v1/completions \
8282
-H "Content-Type: application/json" \
8383
-d '{
8484
"model": "facebook/opt-125m",

0 commit comments

Comments
 (0)