You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
-**Advanced Features**: Multi-model support (Nova, Claude), customizable prompts, configurable parameters, ground truth processing, schema conversion, and lifecycle management
19
+
-**Key Benefits**: Rapid new document type onboarding, reduced time-to-production, configuration optimization, and automated workflow bootstrapping
20
+
-**Use Cases**: New document exploration, configuration improvement, rapid prototyping, and document understanding
21
+
-**Documentation**: Guide in `docs/discovery.md` with architecture details, best practices, and troubleshooting
22
+
23
+
-**Optional Pattern-2 Regex-Based Classification for Enhanced Performance**
24
+
- Added support for optional regex patterns in document class definitions for performance optimization
25
+
-**Document Name Regex**: Match against document ID/name to classify all pages without LLM processing when all pages should be the same class
26
+
-**Document Page Content Regex**: Match against page text content during multi-modal page-level classification for fast page classification
27
+
-**Key Benefits**: Significant performance improvements and cost savings by bypassing LLM calls for pattern-matched documents, deterministic classification results for known document patterns, seamless fallback to existing LLM classification when regex patterns don't match
28
+
-**Configuration**: Optional `document_name_regex` and `document_page_content_regex` fields in class definitions with automatic regex compilation and validation
29
+
-**Logging**: Comprehensive info-level logging when regex patterns match for observability and debugging
30
+
-**CloudFormation Integration**: Updated Pattern-2 schema to support regex configuration through the Web UI
31
+
-**Demonstration**: New `step2_classification_with_regex.ipynb` notebook showcasing regex configuration and performance comparisons
32
+
-**Documentation**: Enhanced classification module README and main documentation with regex usage examples and best practices
33
+
34
+
-**Windows WSL Development Environment Setup Guide**
35
+
- Added WSL-based development environment setup guide for Windows developers in `docs/setup-development-env-WSL.md`
36
+
-**Key Features**: Automated setup script (`wsl_setup.sh`) for quick installation of Git, Python, Node.js, AWS CLI, and SAM CLI
37
+
-**Integrated Workflow**: Development setup combining Windows tools (VS Code, browsers) with native Linux environment
38
+
-**Target Use Cases**: Windows developers needing Linux compatibility without Docker Desktop or VM overhead
39
+
40
+
### Fixed
41
+
-**Throttling Error Detection and Retry Logic for Assessment Functions** - [GitHub Issue #45](https://github.com/aws-solutions-library-samples/accelerated-intelligent-document-processing-on-aws/issues/45)
42
+
-**Assessment Function**: Enhanced throttling detection to check for throttling errors returned in `document.errors` field in addition to thrown exceptions, raising `ThrottlingException` to trigger Step Functions retry when throttling is detected
43
+
-**Granular Assessment Task Caching**: Fixed caching logic to properly cache successful assessment tasks when there are ANY failed tasks (both exception-based and result-based failures), enabling efficient retry optimization by only reprocessing failed tasks while preserving successful results
44
+
-**Impact**: Improved resilience for throttling scenarios, reduced redundant processing during retries, and better Step Functions retry behavior
- Resolves GitHub Issue #50 - templates now deploy correctly in both standard AWS and GovCloud regions
54
+
55
+
-**Bug Fixes and Code Improvements**
56
+
- Fixed HITL processing errors in both Pattern-1 (DynamoDB validation with empty strings) and Pattern-2 (string indices error in A2I output processing)
57
+
- Fixed Step Function UI issues including auto-refresh button auto-disable and fetch failures for failed executions with datetime serialization errors
58
+
- Cleaned up unused Step Function subscription infrastructure and removed duplicate code in Pattern-2 HITL function
59
+
- Expanded UI Visual Editor bounding box size with padding for better visibility and user interaction
60
+
- Fixed bug in list of models supporting cache points - previously claude 4 sonnet and opus had been excluded.
61
+
- Validations added at the assessment step for checking valid json response. The validation fails after extraction/assessment is complete if json parsing issues are encountered.
0 commit comments