Difference between model and per-request timeouts #9206
Unanswered
jamesleeht
asked this question in
Q&A
Replies: 2 comments
-
Request timeout overrides model timeout (more dynamic) |
Beta Was this translation helpful? Give feedback.
0 replies
-
It is run on each fallback - so it'll try each fallback for the request timeout value |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
What's the difference between setting timeout per model vs timeout per request?
https://docs.litellm.ai/docs/proxy/timeout
Does the request timeout override the model timeout? Or does a request timeout span multiple fallbacks (e.g. if I set timeout for model to say 5 seconds, then i set request to 12 seconds, does that mean the router can only retry twice before being cancelled?)
Beta Was this translation helpful? Give feedback.
All reactions