Skip to content

v0.3.13

Latest
Compare
Choose a tag to compare
@rstrahan rstrahan released this 29 Aug 22:33
· 45 commits to main since this release

[0.3.13]

Added

  • External MCP Agent Integration for Custom Tool Extension

    • Added External MCP (Model Context Protocol) Agent support that enables integration with custom MCP servers to extend IDP capabilities
    • Cross-Account Integration: Host MCP servers in separate AWS accounts or external infrastructure with secure OAuth authentication using AWS Cognito
    • Dynamic Tool Discovery: Automatically discovers and integrates available tools from MCP servers through the IDP web interface
    • Secure Authentication Flow: Uses AWS Cognito User Pools for OAuth bearer token authentication with proper token validation
    • Configuration Management: JSON array configuration in AWS Secrets Manager supporting multiple MCP server connections with optional custom agent names and descriptions
    • Real-time Integration: Tools become immediately available through the IDP web interface after configuration
  • AWS GovCloud Support with Automated Template Generation

    • Added GovCloud compatibility through scripts/generate_govcloud_template.py script
    • ARN Partition Compatibility: All templates updated to use arn:${AWS::Partition}: for both commercial and GovCloud regions
    • Headless Operation: Automatically removes UI-related resources (CloudFront, AppSync, Cognito, WAF) for GovCloud deployment
    • Core Functionality Preserved: All 3 processing patterns and complete 6-step pipeline (OCR, Classification, Extraction, Assessment, Summarization, Evaluation) remain fully functional
    • Automated Workflow: Single script orchestrates build + GovCloud template generation + S3 upload with deployment URLs
    • Enterprise Ready: Enables headless document processing for government and enterprise environments requiring GovCloud compliance
    • Documentation: New docs/govcloud-deployment.md with deployment guide, architecture differences, and access methods
  • Pattern-2 and Pattern-3 Assessment now generate geometry (bounding boxes) for visualization in UI 'Visual Editor' (parity with Pattern-1)

    • Added comprehensive spatial localization capabilities to both regular and granular assessment services
    • Automatic Processing: When LLM provides bbox coordinates, automatically converts to UI-compatible (Visual Edit) geometry format without any configuration
    • Universal Support: Works with all attribute types - simple attributes, nested group attributes (e.g., CompanyAddress.State), and list attributes
    • Enhanced Prompts: Updated assessment task prompts with spatial-localization-guidelines requesting bbox coordinates in normalized 0-1000 scale
    • Demo Notebooks: Assessment notebooks now showcase automatic bounding box processing
  • New Python-Based Publishing System

    • Replaced publish.sh bash script with new publish.py Python script
    • Rich console interface with progress bars, spinners, and colored output using Rich library
    • Multi-threaded artifact building and uploading for significantly improved performance
    • Native support for Linux, macOS, and Windows environments
  • Windows Development Environment Setup Guide and Helper Script

    • New scripts/dev_setup.bat (570 lines) for complete Windows development environment configuration
  • OCR Service Default Image Sizing for Resource Optimization

    • Implemented automatic default image size limits (951×1268) when no image sizing configuration is provided
    • Key Benefits: Reduction in vision model token consumption, prevents OutOfMemory errors during concurrent processing, improves processing speed and reduces bandwidth usage

Changed

  • Reverted to python3.12 runtime to resolve build package dependency problems

Fixed

  • Improved Visual Edit bounding box position when using image zoom or pan