[0.3.13]
Added
-
External MCP Agent Integration for Custom Tool Extension
- Added External MCP (Model Context Protocol) Agent support that enables integration with custom MCP servers to extend IDP capabilities
- Cross-Account Integration: Host MCP servers in separate AWS accounts or external infrastructure with secure OAuth authentication using AWS Cognito
- Dynamic Tool Discovery: Automatically discovers and integrates available tools from MCP servers through the IDP web interface
- Secure Authentication Flow: Uses AWS Cognito User Pools for OAuth bearer token authentication with proper token validation
- Configuration Management: JSON array configuration in AWS Secrets Manager supporting multiple MCP server connections with optional custom agent names and descriptions
- Real-time Integration: Tools become immediately available through the IDP web interface after configuration
-
AWS GovCloud Support with Automated Template Generation
- Added GovCloud compatibility through
scripts/generate_govcloud_template.py
script - ARN Partition Compatibility: All templates updated to use
arn:${AWS::Partition}:
for both commercial and GovCloud regions - Headless Operation: Automatically removes UI-related resources (CloudFront, AppSync, Cognito, WAF) for GovCloud deployment
- Core Functionality Preserved: All 3 processing patterns and complete 6-step pipeline (OCR, Classification, Extraction, Assessment, Summarization, Evaluation) remain fully functional
- Automated Workflow: Single script orchestrates build + GovCloud template generation + S3 upload with deployment URLs
- Enterprise Ready: Enables headless document processing for government and enterprise environments requiring GovCloud compliance
- Documentation: New
docs/govcloud-deployment.md
with deployment guide, architecture differences, and access methods
- Added GovCloud compatibility through
-
Pattern-2 and Pattern-3 Assessment now generate geometry (bounding boxes) for visualization in UI 'Visual Editor' (parity with Pattern-1)
- Added comprehensive spatial localization capabilities to both regular and granular assessment services
- Automatic Processing: When LLM provides bbox coordinates, automatically converts to UI-compatible (Visual Edit) geometry format without any configuration
- Universal Support: Works with all attribute types - simple attributes, nested group attributes (e.g., CompanyAddress.State), and list attributes
- Enhanced Prompts: Updated assessment task prompts with spatial-localization-guidelines requesting bbox coordinates in normalized 0-1000 scale
- Demo Notebooks: Assessment notebooks now showcase automatic bounding box processing
-
New Python-Based Publishing System
- Replaced
publish.sh
bash script with newpublish.py
Python script - Rich console interface with progress bars, spinners, and colored output using Rich library
- Multi-threaded artifact building and uploading for significantly improved performance
- Native support for Linux, macOS, and Windows environments
- Replaced
-
Windows Development Environment Setup Guide and Helper Script
- New
scripts/dev_setup.bat
(570 lines) for complete Windows development environment configuration
- New
-
OCR Service Default Image Sizing for Resource Optimization
- Implemented automatic default image size limits (951×1268) when no image sizing configuration is provided
- Key Benefits: Reduction in vision model token consumption, prevents OutOfMemory errors during concurrent processing, improves processing speed and reduces bandwidth usage
Changed
- Reverted to python3.12 runtime to resolve build package dependency problems
Fixed
- Improved Visual Edit bounding box position when using image zoom or pan