Skip to content

Commit 6633c20

Browse files
yuezhu1Liav Weiss
authored andcommitted
Merge pull request vllm-project#741 from samzong/docs/improve-install-guide
docs(installation): update model_config examples and clarify vLLM backend setup
2 parents 519d4d8 + b62ba97 commit 6633c20

File tree

13 files changed

+1709
-45
lines changed

13 files changed

+1709
-45
lines changed

.github/workflows/integration-test-k8s.yml

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -10,6 +10,7 @@ on:
1010
push:
1111
branches:
1212
- main
13+
- feature/production-stack-profile
1314
paths-ignore:
1415
- 'website/**'
1516
- '**/*.md'
@@ -22,7 +23,7 @@ jobs:
2223
strategy:
2324
fail-fast: false # Continue testing other profiles even if one fails
2425
matrix:
25-
profile: [ai-gateway, aibrix, routing-strategies, llm-d]
26+
profile: [production-stack]
2627

2728
steps:
2829
- name: Check out the repo

e2e/README.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -15,7 +15,7 @@ The framework follows a **separation of concerns** design:
1515
- **ai-gateway**: Tests Semantic Router with Envoy AI Gateway integration
1616
- **aibrix**: Tests Semantic Router with vLLM AIBrix integration
1717
- **istio**: Tests Semantic Router with Istio Gateway (future)
18-
- **production-stack**: Tests vLLM Production Stack configurations (future)
18+
- **production-stack**: Tests vLLM Production Stack configurations
1919
- **llm-d**: Tests Semantic Router with LLM-D distributed inference
2020
- **dynamo**: Tests with Nvidia Dynamo (future)
2121

@@ -120,6 +120,7 @@ make e2e-test
120120

121121
```bash
122122
make e2e-test E2E_PROFILE=ai-gateway
123+
make e2e-test E2E_PROFILE=production-stack
123124
```
124125

125126
### Run specific test cases

e2e/cmd/e2e/main.go

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -13,12 +13,14 @@ import (
1313
aibrix "github.com/vllm-project/semantic-router/e2e/profiles/aibrix"
1414
dynamicconfig "github.com/vllm-project/semantic-router/e2e/profiles/dynamic-config"
1515
llmd "github.com/vllm-project/semantic-router/e2e/profiles/llm-d"
16+
productionstack "github.com/vllm-project/semantic-router/e2e/profiles/production-stack"
1617
routingstrategies "github.com/vllm-project/semantic-router/e2e/profiles/routing-strategies"
1718

1819
// Import profiles to register test cases
1920
_ "github.com/vllm-project/semantic-router/e2e/profiles/ai-gateway"
2021
_ "github.com/vllm-project/semantic-router/e2e/profiles/aibrix"
2122
_ "github.com/vllm-project/semantic-router/e2e/profiles/llm-d"
23+
_ "github.com/vllm-project/semantic-router/e2e/profiles/production-stack"
2224
_ "github.com/vllm-project/semantic-router/e2e/profiles/routing-strategies"
2325
)
2426

@@ -109,6 +111,8 @@ func getProfile(name string) (framework.Profile, error) {
109111
return aibrix.NewProfile(), nil
110112
case "llm-d":
111113
return llmd.NewProfile(), nil
114+
case "production-stack":
115+
return productionstack.NewProfile(), nil
112116
case "routing-strategies":
113117
return routingstrategies.NewProfile(), nil
114118
// Add more profiles here as they are implemented

0 commit comments

Comments
 (0)