RFC: Upstream a Tier Based Rate Limiter. #7247
michaelfeil
started this conversation in
General
Replies: 1 comment 1 reply
-
|
@michaelfeil can you share a bit more on the conceptual requirements, for example:
|
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
I would like to have a per model, frontend set-able.
I require to expose this function to python.
We would prefer this metric would be a global. For now, setting this in every frontend.
We would like to use the routers information on requests in flight, so exposing it in python gives the user a lot of freedom. We are running the code from the frontend (our frontend is in python).
I want to be able to make soft-rate limits this way, e.g. allowing 100% traffic under normal hours, or 200% during off-peak hours. Idea is e.g. that you can serve two user groups (free and paid) and only run free tier in some hours.
Beta Was this translation helpful? Give feedback.
All reactions