You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: deploy/kubernetes/README.md
+53-15Lines changed: 53 additions & 15 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,6 +1,6 @@
1
1
# Semantic Router Kubernetes Deployment
2
2
3
-
This directory contains Kubernetes manifests for deploying the Semantic Router using Kustomize.
3
+
Kustomize manifests for deploying the Semantic Router and its observability stack (Prometheus, Grafana, Dashboard, optional Open WebUI + Pipelines) on Kubernetes.
4
4
5
5
## Architecture
6
6
@@ -12,8 +12,9 @@ The deployment consists of:
12
12
-**Init Container**: Downloads/copies model files to persistent volume
13
13
-**Main Container**: Runs the semantic router service
14
14
-**Services**:
15
-
- Main service exposing gRPC port (50051), Classification API (8080), and metrics port (9190)
16
-
- Separate metrics service for monitoring
15
+
- Main service exposing gRPC (50051), Classification API (8080), and metrics (9190)
16
+
- Separate metrics service for monitoring (`semantic-router-metrics`)
17
+
- Observability services (Grafana, Prometheus, Dashboard, optional Open WebUI)
17
18
18
19
## Ports
19
20
@@ -23,17 +24,40 @@ The deployment consists of:
23
24
24
25
## Quick Start
25
26
26
-
### Standard Kubernetes Deployment
27
+
### Deploy Core (Router)
27
28
28
29
```bash
29
30
kubectl apply -k deploy/kubernetes/
30
31
31
32
# Check deployment status
32
-
kubectl get pods -l app=semantic-router -n semantic-router
33
-
kubectl get services -l app=semantic-router -n semantic-router
33
+
kubectl get pods -l app=semantic-router -n vllm-semantic-router-system
34
+
kubectl get services -l app=semantic-router -n vllm-semantic-router-system
0 commit comments