You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
when an inference request is received. If the health check fails, the entire
324
+
model will be unloaded, so it becomes NOT Ready at the server.
325
+
326
+
The Health Check is disabled by default. To enable it, set the following
327
+
parameter on the model config to true
328
+
```
329
+
parameters: {
330
+
key: "ENABLE_VLLM_HEALTH_CHECK"
331
+
value: { string_value: "true" }
332
+
}
333
+
```
334
+
and select
335
+
[Model Control Mode EXPLICIT](https://github.com/triton-inference-server/server/blob/main/docs/user_guide/model_management.md#model-control-mode-explicit)
0 commit comments