feat: optimize Bedrock model configuration performance [close: #5419, #5420] #5434

KevinZhao · 2025-07-06T15:01:47Z

Add ModelConfigCache class for model config caching strategy\n2. Remove deprecated legacy model definitions\n3. Optimize cross-region inference and custom ARN handling\n4. Add comprehensive test suite for cache functionality

Related GitHub Issue

Roo Code Task Context (Optional)

Description

Creating simple caching for model configuration to avoid multi invocation of getModel()
Remove some legacy models or embedding models.

Test Procedure

Pre-Submission Checklist

Issue Linked: This PR is linked to an approved GitHub Issue (see "Related GitHub Issue" above).
Scope: My changes are focused on the linked issue (one major feature/fix per PR).
Self-Review: I have performed a thorough self-review of my code.
Testing: New and/or updated tests have been added to cover my changes (if applicable).
Documentation Impact: I have considered if my changes require documentation updates (see "Documentation Updates" section below).
Contribution Guidelines: I have read and agree to the Contributor Guidelines.

Screenshots / Videos

Documentation Updates

Additional Notes

Get in Touch

Important

Introduces caching for model configurations in AwsBedrockHandler, removes deprecated models, and adds tests for cache functionality.

Caching:
- Introduces ModelConfigCache class in bedrock.ts for caching model configurations.
- AwsBedrockHandler uses ModelConfigCache to cache results of getModel().
- Cache key generated based on ProviderSettings.
- Cache invalidated when configuration changes.
Model Definitions:
- Removes deprecated model definitions from bedrockModels in bedrock.ts.
Testing:
- Adds bedrock-cache-strategy.spec.ts to test caching functionality.
- Tests cover cache retrieval, cache key generation, cache invalidation, and performance benefits.
Misc:
- Optimizes cross-region inference and custom ARN handling in AwsBedrockHandler.

^{This description was created by}^{for e81ff4d. You can customize this summary. It will automatically update as commits are pushed.}

1. Add ModelConfigCache class for model config caching strategy\n2. Remove deprecated legacy model definitions\n3. Optimize cross-region inference and custom ARN handling\n4. Add comprehensive test suite for cache functionality

…20241022-v2:0

daniel-lxs

Hey @KevinZhao I reviewed your PR and left a couple of suggestions.

Let me know if you have any questions!

daniel-lxs · 2025-07-10T19:46:07Z

src/api/providers/bedrock.ts

+		logger.debug("Computed and cached new model config", {
+			ctx: "bedrock",
+			modelId: config.id,
+		})


I'm wondering about memory management for long-running applications. Since cache entries are never evicted, could this potentially lead to memory issues if many different model configurations are used over time? Would it make sense to implement a cache size limit or TTL (time-to-live) for entries?

daniel-lxs · 2025-07-10T19:46:23Z

src/api/providers/bedrock.ts

+		return config
+	}
+
+	private computeModelConfig(): ModelConfigResult {


Should we add error handling for the cache operations? While Map operations are generally safe, if we extend this cache in the future or if there are edge cases, having try-catch blocks could prevent unexpected failures from affecting the main flow.

daniel-lxs · 2025-07-10T19:46:40Z

src/api/providers/__tests__/bedrock-cache-strategy.spec.ts

+		ConverseCommand: vi.fn(),
+	}
+})
+


Instead of using as any to access private properties, would it be cleaner to either make these properties protected for testing purposes or use a testing utility that provides type-safe access to private members? This would improve type safety in the tests.

daniel-lxs · 2025-07-10T19:46:48Z

src/api/providers/__tests__/bedrock-invokedModelId.spec.ts

+		expect(getModelByIdSpy).toHaveBeenCalledWith("anthropic.claude-3-5-sonnet-20241022-v2:0", "inference-profile")

 		// Verify that getModel returns the updated model info
+		// 在这里模拟 invokedModelId 后模型的 inputPrice 被更新为 8


Could we translate these Chinese comments to English for consistency? The comments explain that this simulates the model's inputPrice being updated to 8 after invokedModelId, which is a key part of the test verification.

daniel-lxs · 2025-07-10T19:46:56Z

src/api/providers/bedrock.ts

The PR description mentions performance optimization, but I don't see any benchmarks or performance metrics. Would it be helpful to add performance tests that demonstrate the actual improvement? This would validate the optimization claims and help prevent performance regressions in the future.

feat: optimize Bedrock model configuration performance

e81ff4d

1. Add ModelConfigCache class for model config caching strategy\n2. Remove deprecated legacy model definitions\n3. Optimize cross-region inference and custom ARN handling\n4. Add comprehensive test suite for cache functionality

KevinZhao requested review from cte, jr and mrubens as code owners July 6, 2025 15:01

github-project-automation bot added this to Roo Code Roadmap and Roo Code Roadmap Jul 6, 2025

github-project-automation bot moved this to Triage in Roo Code Roadmap Jul 6, 2025

github-project-automation bot moved this to New in Roo Code Roadmap Jul 6, 2025

dosubot bot added size:L This PR changes 100-499 lines, ignoring generated files. enhancement New feature or request labels Jul 6, 2025

hannesrudolph added the Issue/PR - Triage New issue. Needs quick review to confirm validity and assign labels. label Jul 6, 2025

Fix model configuration cache issue in Bedrock handler tests

89d4063

daniel-lxs moved this from Triage to PR [Needs Prelim Review] in Roo Code Roadmap Jul 6, 2025

hannesrudolph added PR - Needs Preliminary Review and removed Issue/PR - Triage New issue. Needs quick review to confirm validity and assign labels. labels Jul 6, 2025

fix: update test to use current model ID anthropic.claude-3-5-sonnet-…

0ebbe91

…20241022-v2:0

daniel-lxs requested changes Jul 10, 2025

View reviewed changes

daniel-lxs moved this from PR [Needs Prelim Review] to PR [Changes Requested] in Roo Code Roadmap Jul 10, 2025

hannesrudolph added PR - Changes Requested and removed PR - Needs Preliminary Review labels Jul 10, 2025

hannesrudolph closed this Sep 22, 2025

github-project-automation bot moved this from New to Done in Roo Code Roadmap Sep 22, 2025

github-project-automation bot moved this from PR [Changes Requested] to Done in Roo Code Roadmap Sep 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: optimize Bedrock model configuration performance [close: #5419, #5420] #5434

feat: optimize Bedrock model configuration performance [close: #5419, #5420] #5434

Uh oh!

KevinZhao commented Jul 6, 2025 •

edited by ellipsis-dev bot

Loading

Uh oh!

daniel-lxs left a comment

Uh oh!

daniel-lxs Jul 10, 2025

Uh oh!

daniel-lxs Jul 10, 2025

Uh oh!

daniel-lxs Jul 10, 2025

Uh oh!

daniel-lxs Jul 10, 2025

Uh oh!

daniel-lxs Jul 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat: optimize Bedrock model configuration performance [close: #5419, #5420] #5434

feat: optimize Bedrock model configuration performance [close: #5419, #5420] #5434

Uh oh!

Conversation

KevinZhao commented Jul 6, 2025 • edited by ellipsis-dev bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Related GitHub Issue

Roo Code Task Context (Optional)

Description

Test Procedure

Pre-Submission Checklist

Screenshots / Videos

Documentation Updates

Additional Notes

Get in Touch

Uh oh!

daniel-lxs left a comment

Choose a reason for hiding this comment

Uh oh!

daniel-lxs Jul 10, 2025

Choose a reason for hiding this comment

Uh oh!

daniel-lxs Jul 10, 2025

Choose a reason for hiding this comment

Uh oh!

daniel-lxs Jul 10, 2025

Choose a reason for hiding this comment

Uh oh!

daniel-lxs Jul 10, 2025

Choose a reason for hiding this comment

Uh oh!

daniel-lxs Jul 10, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

KevinZhao commented Jul 6, 2025 •

edited by ellipsis-dev bot

Loading