Skip to content

Commit 3fa8788

Browse files
author
Bob Strahan
committed
Merge branch 'develop' v0.4.8
2 parents b0d06c3 + 410fb0e commit 3fa8788

File tree

31 files changed

+10070
-2396
lines changed

31 files changed

+10070
-2396
lines changed

CHANGELOG.md

Lines changed: 48 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -5,6 +5,54 @@ SPDX-License-Identifier: MIT-0
55

66
## [Unreleased]
77

8+
## [0.4.8]
9+
10+
### Added
11+
12+
- **Section Data Download Feature for Document Results Export**
13+
- Added compact "Download" dropdown button in Document Sections panel for exporting section processing results
14+
- **Two Download Options**:
15+
- "Download Data" - Downloads prediction results from OutputBucket (always available)
16+
- "Download Baseline" - Downloads baseline/ground truth data from EvaluationBaselineBucket (only shown when baseline exists)
17+
18+
- **Configuration Library Import Feature for Enhanced Configuration Management**
19+
- Added Configuration Library browser enabling users to import pre-configured document processing workflows directly from the solution's configuration library
20+
- **Dual Import Options**: Users can now choose between importing from local files (existing) or from the Configuration Library (new)
21+
- **Pattern-Aware Filtering**: Automatically displays only configurations compatible with the currently deployed pattern (Pattern 1, 2, or 3)
22+
- **README Preview**: When available, displays markdown-formatted README documentation before importing to help users understand configuration purpose and features
23+
24+
- **Test Studio Interactive Charts and Document Analysis Enhancements**
25+
- **Interactive Score Distribution Charts**: Replaced CloudScape chart with native Recharts implementation featuring dual chart support (Bar Chart and Line Chart options with dropdown selector), native interactivity with built-in click events that open document details modal, and optimized layout with improved margins, labels, and space utilization
26+
- **Lowest Scoring Documents Analysis**: Enhanced TestResults with table showing documents with lowest weighted overall scores, TestComparison with cross-test comparison of problematic documents, user-configurable count dropdown (5, 10, 20, or 50 documents), side-by-side T1 vs T2 comparison format for easy analysis, and clickable document links for direct navigation to document viewer
27+
- **UI/UX Improvements**: Compact table styling with reduced spacing and improved readability, left-aligned content for better text alignment of document IDs, consistent design matching existing CloudScape design system, and responsive layout where charts adapt to container width
28+
29+
- **RealKIE-FCC-Verified Dataset Auto-Deployment for Test Studio**
30+
- Automatically deploys 75 FCC invoice documents from HuggingFace public dataset during stack deployment - zero manual steps required
31+
- Test set immediately available in Test Studio UI with complete ground truth for benchmarking extraction accuracy
32+
- Version controlled via CloudFormation property - skips re-download on stack updates unless version changes
33+
34+
### Fixed
35+
36+
- **Bedrock OCR Image Resizing Regression - Partial Dimension Configuration Support**
37+
- Fixed critical regression where configuring only `target_width` (without `target_height`) disabled all image resizing, causing Bedrock OCR to fail with "length limit exceeded" errors
38+
- **Root Cause**: OCR service used `and` condition requiring both dimensions, rejecting partial configs and sending full-resolution images that exceeded model input limits
39+
- **Solution**: Implemented aspect-ratio-preserving single-dimension resizing that calculates missing dimension from actual image aspect ratio
40+
41+
- **Test Studio Bug Fixes**
42+
- Fixed TestSets manual upload issues
43+
44+
- **Agentic Extraction Prompt Caching** - [GitHub PR #156](https://github.com/aws-solutions-library-samples/accelerated-intelligent-document-processing-on-aws/pull/156)
45+
- Removed additional cachepoints to prevent prompt caching conflicts in agentic extraction
46+
47+
- **GovCloud S3 Vectors Service Principal Deployment Failure** - [GitHub Issue #159](https://github.com/aws-solutions-library-samples/accelerated-intelligent-document-processing-on-aws/issues/159)
48+
- Fixed CloudFormation deployment failure in GovCloud regions caused by S3 Vectors service not being available
49+
- **Root Cause**: KMS key policy referenced `indexing.s3vectors.${AWS::URLSuffix}` service principal which doesn't exist in GovCloud (us-gov-west-1, us-gov-east-1)
50+
51+
### Templates
52+
- us-west-2: `https://s3.us-west-2.amazonaws.com/aws-ml-blog-us-west-2/artifacts/genai-idp/idp-main_0.4.8.yaml`
53+
- us-east-1: `https://s3.us-east-1.amazonaws.com/aws-ml-blog-us-east-1/artifacts/genai-idp/idp-main_0.4.8.yaml`
54+
- eu-central-1: `https://s3.eu-central-1.amazonaws.com/aws-ml-blog-eu-central-1/artifacts/genai-idp/idp-main_0.4.8.yaml`
55+
856
## [0.4.7]
957

1058
### Added

README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -28,7 +28,7 @@ A scalable, serverless solution for automated document processing and informatio
2828

2929
https://github.com/user-attachments/assets/272b543b-e506-48ce-acc1-361422d22322
3030

31-
White-glove customization, deployment, and integration support for production use cases is also available through [AWS Professional Services](https://aws.amazon.com/professional-services/).
31+
Concierge support for customization, deployment, and integration of production use cases is available through [AWS Professional Services](https://aws.amazon.com/professional-services/).
3232

3333
## Alternative Implementations
3434

VERSION

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1 @@
1-
0.4.7
1+
0.4.8
Lines changed: 17 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,17 @@
1+
Copyright Amazon.com, Inc. or its affiliates. All Rights Reserved.
2+
SPDX-License-Identifier: MIT-0
3+
4+
# RealKIE-FCC-Verified Configuration
5+
6+
This directory contains the default (starting point) configuration for the FCC Invoices in the RealKIE-FCC-Verified dataset used to benchmark the GenAI IDP Accelerator. This configuration is specifically designed for processing the FCC invoice samples using Test Studio.
7+
8+
## Pattern Association
9+
10+
**Pattern**: Pattern-1
11+
12+
## Validation Level
13+
14+
**Level**: 2 - Minimal Testing
15+
16+
- **Testing Evidence**: This configuration has been lightly tested with the RealKIE-FCC-Verified Dataset.
17+
- **Known Limitations**: Performance may vary - consider this configuration a starting point. We welome Pull Requests to improve the accuracy.

0 commit comments

Comments
 (0)