Commit 970f349
feat(deployment): add startupProbe for slow model loading (#809)
Add startupProbe configuration to handle slow container startup
caused by ML model loading (embeddings, classifiers, LoRA).
Total startup time can reach ~60 seconds. Without startupProbe,
liveness/readiness probes start checking at 30s and may kill the
pod before it becomes ready.
Changes:
- Add startupProbe to Kubernetes deployment manifest
- Add startupProbe to Helm chart (values.yaml + template)
- Increase memory limit to 7Gi for full model set
The startupProbe allows up to 300 seconds for startup before
liveness/readiness probes begin checking.
Fixes #784
Signed-off-by: noalimoy <[email protected]>
Co-authored-by: Huamin Chen <[email protected]>1 parent 249812f commit 970f349
File tree
3 files changed
+35
-6
lines changed- deploy
- helm/semantic-router
- templates
- kubernetes/ai-gateway/semantic-router
3 files changed
+35
-6
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
110 | 110 | | |
111 | 111 | | |
112 | 112 | | |
| 113 | + | |
| 114 | + | |
| 115 | + | |
| 116 | + | |
| 117 | + | |
| 118 | + | |
| 119 | + | |
| 120 | + | |
| 121 | + | |
113 | 122 | | |
114 | 123 | | |
115 | 124 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
121 | 121 | | |
122 | 122 | | |
123 | 123 | | |
124 | | - | |
| 124 | + | |
125 | 125 | | |
126 | 126 | | |
127 | 127 | | |
| |||
202 | 202 | | |
203 | 203 | | |
204 | 204 | | |
| 205 | + | |
| 206 | + | |
| 207 | + | |
| 208 | + | |
| 209 | + | |
| 210 | + | |
| 211 | + | |
| 212 | + | |
| 213 | + | |
| 214 | + | |
| 215 | + | |
| 216 | + | |
| 217 | + | |
205 | 218 | | |
206 | 219 | | |
207 | 220 | | |
| |||
Lines changed: 12 additions & 5 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
137 | 137 | | |
138 | 138 | | |
139 | 139 | | |
| 140 | + | |
| 141 | + | |
| 142 | + | |
| 143 | + | |
| 144 | + | |
| 145 | + | |
| 146 | + | |
| 147 | + | |
140 | 148 | | |
141 | 149 | | |
142 | 150 | | |
| |||
152 | 160 | | |
153 | 161 | | |
154 | 162 | | |
155 | | - | |
| 163 | + | |
156 | 164 | | |
157 | 165 | | |
158 | | - | |
159 | | - | |
| 166 | + | |
| 167 | + | |
160 | 168 | | |
161 | | - | |
162 | | - | |
| 169 | + | |
163 | 170 | | |
164 | 171 | | |
165 | 172 | | |
| |||
0 commit comments