-
Notifications
You must be signed in to change notification settings - Fork 2.1k
Open
Labels
questionFurther information is requestedFurther information is requested
Description
Prerequisites
- I will write this issue in English (see our Language Policy)
- I have searched existing issues and discussions
- I have read the documentation or asked an AI coding agent with this project's GitHub URL loaded and couldn't find the answer
- This is a question (not a bug report or feature request)
Question
When using Oh My OpenCode Gateway, if a model API (OpenAI / Gemini / OpenRouter / NVIDIA, etc.) returns a rate‑limit error such as:
This request would exceed your account's rate limit. Please try again later.
[retrying in 3h 39m attempt #1]
Gateway enters a multi‑hour retry loop and does not switch to any other available model, even if multiple models are configured.
This results in:
- The entire request pipeline being blocked
- All subsequent requests stuck behind the retry queue
- No automatic fallback
- User forced to wait hours until the rate‑limit window resets
From a user perspective, Gateway becomes effectively “locked” by a single model’s 429.
Expected Behavior
When a model returns 429 / rate limit exceeded, Gateway should:
- Automatically fallback to the next available model
- Or allow users to configure a fallback order
- Or at least avoid multi‑hour retry queues
- Or provide a config option to disable long retry behavior
Any of these would prevent the system from becoming unusable.
Actual Behavior
- Gateway receives a 429
- Gateway enters a long retry window (1–4 hours)
- Gateway does not switch to other models
- All requests remain blocked until the retry window ends
Steps to Reproduce
- Configure multiple models (e.g., OpenAI + Gemini + Claude)
- Use a model with insufficient quota or strict RPM limits
- Trigger a rate‑limit error
- Gateway outputs:
This request would exceed your account's rate limit. Please try again later.
[retrying in 3h 39m attempt #1]
- Gateway does not fallback and becomes stuck in retry mode
Suggestions
- Add automatic model fallback
- Allow users to define a fallback priority list
- Provide a config option to disable long retry queues
- Fail fast on 429 instead of blocking the entire pipeline
Context
No response
Doctor Output (Optional)
Question Category
Configuration
Additional Information
No response
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
questionFurther information is requestedFurther information is requested