Skip to content

Conversation

roomote[bot]
Copy link

@roomote roomote bot commented Oct 16, 2025

This PR updates the context window for Claude Sonnet models in Vertex AI from 200k to 1M tokens.

Changes

  • Updated claude-sonnet-4@20250514 context window from 200,000 to 1,000,000 tokens
  • Updated claude-sonnet-4-5@20250929 context window from 200,000 to 1,000,000 tokens
  • Added comprehensive test coverage for both models

Testing

  • All existing tests pass
  • Added new tests specifically for the 1M context window on both models
  • Tests verify correct context window values are returned

Context

This change aligns with the increased context window capacity now available for Claude Sonnet models on Vertex AI.

Requested via GitHub comment: "@roomote any way you can create a PR here?"


Important

Update context window for Claude Sonnet models to 1M tokens and add tests to verify changes.

  • Behavior:
    • Update contextWindow for claude-sonnet-4@20250514 and claude-sonnet-4-5@20250929 from 200,000 to 1,000,000 tokens in vertex.ts.
    • Add tests in anthropic-vertex.spec.ts to verify 1M context window for both models.
  • Testing:
    • All existing tests pass.
    • New tests ensure correct contextWindow values are returned for updated models.

This description was created by Ellipsis for eefe194. You can customize this summary. It will automatically update as commits are pushed.

- Update contextWindow from 200k to 1M for claude-sonnet-4@20250514
- Update contextWindow from 200k to 1M for claude-sonnet-4-5@20250929
- Add tests to verify 1M context window configuration
- Addresses issue #8671 per Google Vertex AI documentation
@roomote roomote bot requested review from cte, jr and mrubens as code owners October 16, 2025 17:32
@dosubot dosubot bot added size:M This PR changes 30-99 lines, ignoring generated files. enhancement New feature or request labels Oct 16, 2025
Copy link
Author

roomote bot commented Oct 16, 2025

Review Summary

No issues found - This PR is ready to merge.

Changes Reviewed

  • Updated contextWindow from 200k to 1M tokens for claude-sonnet-4@20250514 and claude-sonnet-4-5@20250929
  • Added comprehensive test coverage for both models

Verification

  • ✅ Model schema correctly allows numeric context window values
  • ✅ Tests verify the exact expected values (1,000,000 tokens)
  • ✅ Changes align with Google Vertex AI documentation
  • ✅ No breaking changes introduced
  • ✅ Follows existing codebase patterns

@hannesrudolph hannesrudolph added the Issue/PR - Triage New issue. Needs quick review to confirm validity and assign labels. label Oct 16, 2025
@nickcatal
Copy link

@mrubens @cte Can y'all approve this and get it in the next release? Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request Issue/PR - Triage New issue. Needs quick review to confirm validity and assign labels. size:M This PR changes 30-99 lines, ignoring generated files.

Projects

Status: Triage

Development

Successfully merging this pull request may close these issues.

3 participants