Skip to content

Prod deployment on 1 GPU#178

Merged
vvolhejn merged 6 commits intomainfrom
vv/smaller-deployment
Mar 6, 2026
Merged

Prod deployment on 1 GPU#178
vvolhejn merged 6 commits intomainfrom
vv/smaller-deployment

Conversation

@vvolhejn
Copy link
Copy Markdown
Collaborator

@vvolhejn vvolhejn commented Mar 4, 2026

Serve the LLM using OpenRouter, use GPT-OSS-120b (it's what Cerebras has), update docs

Copy link
Copy Markdown
Collaborator

@gabrieldemarmiesse gabrieldemarmiesse left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good to me!

@vvolhejn vvolhejn merged commit 69d6ac6 into main Mar 6, 2026
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants