Skip to content

Conversation

@kevint-cerebras
Copy link

@kevint-cerebras kevint-cerebras commented Oct 29, 2025

Added zai-glm-4.6 as a Cerebras model. It will become available on the platform November 5th.

Pre-Submission Checklist

  • Issue Linked: This PR is linked to an approved GitHub Issue (see "Related GitHub Issue" above).
  • Scope: My changes are focused on the linked issue (one major feature/fix per PR).
  • Self-Review: I have performed a thorough self-review of my code.
  • Testing: New and/or updated tests have been added to cover my changes (if applicable).
  • Documentation Impact: I have considered if my changes require documentation updates (see "Documentation Updates" section below).
  • Contribution Guidelines: I have read and agree to the Contributor Guidelines.

Important

Add zai-glm-4.6 model to Cerebras and set it as default in cerebras.ts.

  • Models:
    • Add zai-glm-4.6 to cerebrasModels in cerebras.ts with attributes: maxTokens: 40000, contextWindow: 128000, supportsImages: false, supportsPromptCache: false, inputPrice: 0, outputPrice: 0, description: "Highly intelligent general-purpose model with ~2000 tokens/s".
    • Set cerebrasDefaultModelId to zai-glm-4.6 in cerebras.ts.

This description was created by Ellipsis for 7e7c7fe. You can customize this summary. It will automatically update as commits are pushed.

- Add zai-glm-4.6 with 128K context window and 40K max tokens
- Set zai-glm-4.6 as default Cerebras model
- Update gpt-oss-120b to 128K context and 40K max tokens
- Add zai-glm-4.6 with 128K context window and 40K max tokens
- Set zai-glm-4.6 as default Cerebras model
- Model provides ~2000 tokens/s for general-purpose tasks
@dosubot dosubot bot added size:S This PR changes 10-29 lines, ignoring generated files. enhancement New feature or request labels Oct 29, 2025
@roomote
Copy link

roomote bot commented Oct 29, 2025

PR Review Summary

All previously identified issues have been resolved:

  • Default model availability: The PR no longer sets zai-glm-4.6 as the default Cerebras model. The default remains gpt-oss-120b, preventing API errors for users. The new model is only added to the available models list and will become selectable when it launches on November 5th.

No new issues were identified in the latest changes.


Follow Along on Roo Code Cloud

@hannesrudolph hannesrudolph added the Issue/PR - Triage New issue. Needs quick review to confirm validity and assign labels. label Oct 29, 2025
export type CerebrasModelId = keyof typeof cerebrasModels

export const cerebrasDefaultModelId: CerebrasModelId = "qwen-3-coder-480b-free"
export const cerebrasDefaultModelId: CerebrasModelId = "zai-glm-4.6"
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Setting this unreleased model as the default will break the Cerebras provider for all users until November 5th. According to the PR description, zai-glm-4.6 becomes available on November 5th, but this PR is being merged on October 29th. Users who select Cerebras as their provider will encounter API errors for approximately one week because the default model doesn't exist yet. Consider keeping qwen-3-coder-480b-free as the default until the new model is actually available on the platform.

Copy link
Member

@daniel-lxs daniel-lxs left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you!

@daniel-lxs
Copy link
Member

Created this PR #8920 to fix the failing unit test

@daniel-lxs daniel-lxs closed this Oct 29, 2025
@github-project-automation github-project-automation bot moved this from Triage to Done in Roo Code Roadmap Oct 29, 2025
@github-project-automation github-project-automation bot moved this from New to Done in Roo Code Roadmap Oct 29, 2025
Copy link

@seduerr91 seduerr91 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good to me.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request Issue/PR - Triage New issue. Needs quick review to confirm validity and assign labels. lgtm This PR has been approved by a maintainer size:S This PR changes 10-29 lines, ignoring generated files.

Projects

Status: Done

Development

Successfully merging this pull request may close these issues.

4 participants