Rate Limit seems to be global, not per config profile, but many models have different rate limits

### App Version

3.11.5

### API Provider

Google Gemini

### Model Used

Gemini 2.0 and 2.5 pro.

### Actual vs. Expected Behavior

When I set up different profiles for different llms, I was disappointed to see that the rate limit (under advanced options) was a global value.

This doesn't make any sense, and should be tied to the model configuration profile.
Gemini 2.0 has a (free) rate limit of 15 Requests per minute (1 every 4 seconds), but 2.5 Pro has 5 (was 2) requests per minute.  I would like to use different rate limits tied to the profile for the specific model in use. This seems to make a lot more sense than needing to change the global config to accommodate a different model.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Rate Limit seems to be global, not per config profile, but many models have different rate limits #2287

App Version

API Provider

Model Used

Actual vs. Expected Behavior

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Rate Limit seems to be global, not per config profile, but many models have different rate limits #2287

Description

App Version

API Provider

Model Used

Actual vs. Expected Behavior

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions