Skip to content

handle the server-overloaded case#261

Merged
mtblanton merged 6 commits intomainfrom
timeout-error-handling
Apr 2, 2025
Merged

handle the server-overloaded case#261
mtblanton merged 6 commits intomainfrom
timeout-error-handling

Conversation

@mtblanton
Copy link
Copy Markdown
Contributor

Depends on #260

This PR handles the server-overloaded case from the infini-gram API and raises a relevant error for the client.

@mtblanton mtblanton requested review from liujch1998 and yensung April 1, 2025 22:39
@mtblanton mtblanton self-assigned this Apr 1, 2025
Base automatically changed from update-infini-gram-client to main April 2, 2025 16:19
@mtblanton mtblanton merged commit 4eb0dd1 into main Apr 2, 2025
3 checks passed
@mtblanton mtblanton deleted the timeout-error-handling branch April 2, 2025 19:31
yensung added a commit that referenced this pull request Apr 4, 2025
* main:
  increase timeout to receive first token (#263)
  handle the server-overloaded case (#261)
mtblanton added a commit that referenced this pull request Jun 13, 2025
Depends on #260

This PR handles the `server-overloaded` case from the infini-gram API
and raises a relevant error for the client.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants