Implement Multi-Cloud and Hybrid Cloud Routing Support #197
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This PR implements comprehensive multi-cloud and hybrid cloud routing support for the semantic router, enabling intelligent routing across multiple clusters, cloud providers, and deployment environments.
Overview
The semantic router now supports sophisticated routing decisions beyond single-cluster deployments. This enhancement enables enterprises to:
Key Features
🏗️ Configuration Extensions
inter_cluster_routing
configuration section with cluster discovery, providers, and routing strategies🎯 Intelligent Routing Strategies
Priority-based routing with sophisticated condition evaluation:
🛡️ Fault Tolerance & Reliability
⚡ Performance & Cost Optimization
Implementation Details
Configuration Example
Integration Points
handleModelRouting
with fallback to local endpointsRouterConfig
while maintaining full backward compatibilityintercluster
package with comprehensive routing logic and condition evaluationUse Cases Enabled
1. On-Premises + Cloud Hybrid
Route sensitive data to on-premises clusters while using cloud providers for general queries:
2. Multi-Region GDPR Compliance
Automatically ensure EU user data stays in EU clusters:
3. Cost Optimization
Route to the most cost-effective clusters based on token pricing:
Testing & Documentation
Files Changed
pkg/config/config.go
- Extended configuration structures for multi-cloud routingpkg/extproc/router.go
- Added inter-cluster router initializationpkg/extproc/request_handler.go
- Integrated inter-cluster routing into request processingpkg/intercluster/router.go
- New routing engine with strategy evaluationpkg/intercluster/router_test.go
- Comprehensive test coverageconfig/multi-cloud-config-example.yaml
- Complete configuration examplewebsite/docs/getting-started/multi-cloud-routing.md
- User documentationThis implementation enables enterprise-grade routing across complex, distributed LLM infrastructure while maintaining the simplicity and intelligence that makes the semantic router powerful.
Fixes #196.
✨ Let Copilot coding agent set things up for you — coding agent works faster and does higher quality work when set up for your repo.