Release v0.3.13 · aws-solutions-library-samples/accelerated-intelligent-document-processing-on-aws

[0.3.13]

Added

External MCP Agent Integration for Custom Tool Extension
- Added External MCP (Model Context Protocol) Agent support that enables integration with custom MCP servers to extend IDP capabilities
- Cross-Account Integration: Host MCP servers in separate AWS accounts or external infrastructure with secure OAuth authentication using AWS Cognito
- Dynamic Tool Discovery: Automatically discovers and integrates available tools from MCP servers through the IDP web interface
- Secure Authentication Flow: Uses AWS Cognito User Pools for OAuth bearer token authentication with proper token validation
- Configuration Management: JSON array configuration in AWS Secrets Manager supporting multiple MCP server connections with optional custom agent names and descriptions
- Real-time Integration: Tools become immediately available through the IDP web interface after configuration
AWS GovCloud Support with Automated Template Generation
- Added GovCloud compatibility through scripts/generate_govcloud_template.py script
- ARN Partition Compatibility: All templates updated to use arn:${AWS::Partition}: for both commercial and GovCloud regions
- Headless Operation: Automatically removes UI-related resources (CloudFront, AppSync, Cognito, WAF) for GovCloud deployment
- Core Functionality Preserved: All 3 processing patterns and complete 6-step pipeline (OCR, Classification, Extraction, Assessment, Summarization, Evaluation) remain fully functional
- Automated Workflow: Single script orchestrates build + GovCloud template generation + S3 upload with deployment URLs
- Enterprise Ready: Enables headless document processing for government and enterprise environments requiring GovCloud compliance
- Documentation: New docs/govcloud-deployment.md with deployment guide, architecture differences, and access methods
Pattern-2 and Pattern-3 Assessment now generate geometry (bounding boxes) for visualization in UI 'Visual Editor' (parity with Pattern-1)
- Added comprehensive spatial localization capabilities to both regular and granular assessment services
- Automatic Processing: When LLM provides bbox coordinates, automatically converts to UI-compatible (Visual Edit) geometry format without any configuration
- Universal Support: Works with all attribute types - simple attributes, nested group attributes (e.g., CompanyAddress.State), and list attributes
- Enhanced Prompts: Updated assessment task prompts with spatial-localization-guidelines requesting bbox coordinates in normalized 0-1000 scale
- Demo Notebooks: Assessment notebooks now showcase automatic bounding box processing
New Python-Based Publishing System
- Replaced publish.sh bash script with new publish.py Python script
- Rich console interface with progress bars, spinners, and colored output using Rich library
- Multi-threaded artifact building and uploading for significantly improved performance
- Native support for Linux, macOS, and Windows environments
Windows Development Environment Setup Guide and Helper Script
- New scripts/dev_setup.bat (570 lines) for complete Windows development environment configuration
OCR Service Default Image Sizing for Resource Optimization
- Implemented automatic default image size limits (951×1268) when no image sizing configuration is provided
- Key Benefits: Reduction in vision model token consumption, prevents OutOfMemory errors during concurrent processing, improves processing speed and reduces bandwidth usage

Changed

Reverted to python3.12 runtime to resolve build package dependency problems

Fixed

Improved Visual Edit bounding box position when using image zoom or pan

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

v0.3.13

[0.3.13]

Added

Changed

Fixed

Uh oh!