You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: CHANGELOG.md
+146Lines changed: 146 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -5,10 +5,156 @@ SPDX-License-Identifier: MIT-0
5
5
6
6
## [Unreleased]
7
7
8
+
## [0.4.6]
9
+
8
10
### Added
9
11
12
+
-**New State-Of-The-Art LLM Model Support**
13
+
- Added support for Amazon Nova 2 Lite model (`us.amazon.nova-2-lite-v1:0`, `eu.amazon.nova-2-lite-v1:0`)
14
+
- Added support for Claude Opus 4.5 model (`us.anthropic.claude-opus-4-5-20251101-v1:0`, `eu.anthropic.claude-opus-4-5-20251101-v1:0`)
15
+
- Added support for Qwen 3 VL model (`qwen.qwen3-vl-235b-a22b`)
16
+
- Available for configuration across all document processing steps
17
+
18
+
-**Test Studio for Comprehensive Test Management and Analysis**
19
+
- Added unified web interface for managing test sets, running tests, and analyzing results directly from the UI
20
+
-**Test Sets Tab**: Create and manage reusable test collections with three creation methods:
21
+
- Pattern-based creation with file patterns to match existing data sets (Input Bucket and Test Set Bucket)
22
+
- Zip upload with automatic extraction of `input/` and `baseline/` folder structure
23
+
-**Test Executions Tab**: Unified interface combining test execution and results management:
24
+
- Real-time status monitoring
25
+
- Multi-select comparison for side-by-side test analysis
26
+
- Integrated export and delete operations
27
+
-**Key Features**: File structure validation, progress-aware status updates, cached metrics for improved performance, dual bucket support for flexible test organization
28
+
-**Documentation**: Guide in `docs/test-studio.md` with architecture details and workflow examples
29
+
30
+
-**MCP Integration for External Application Access**
31
+
- Added MCP (Model Context Protocol) integration enabling external applications (like Amazon Quick Suite) to access IDP analytics through AWS Bedrock AgentCore Gateway with secure OAuth 2.0 authentication
32
+
- Implemented Analytics Agent with `search_genaiidp` tool for natural language queries of processed document data (statistics, trends, confidence scores, processing status)
33
+
- Controlled by `EnableMCP` parameter (default: true); provides MCPServerEndpoint and authentication outputs for external application integration; documentation in `docs/mcp-integration.md`
34
+
35
+
-**Configurable Section Splitting Strategies for Enhanced Document Segmentation Control**
36
+
- Added new `sectionSplitting` configuration option to control how classified pages are grouped into document sections
37
+
-**Three Strategies Available**:
38
+
-`disabled`: Entire document treated as single section with first detected class (simplest case)
39
+
-`page`: One section per page preventing automatic joining of same-type documents (deterministic, solves Issue #146)
-**Key Benefits**: Deterministic splitting for long documents with multiple same-type forms (e.g., multiple W-2s, multiple invoices), eliminates LLM boundary detection failures for critical government form processing, provides flexibility across simple to complex document scenarios
42
+
- Resolves #146
43
+
44
+
### Changed
45
+
46
+
-**Improved Temperature and Top_P Parameter Logic for Deterministic Output**
47
+
- Changed inference parameter selection logic to allow `temperature=0.0` for deterministic output (recommended by Anthropic and other model providers)
48
+
-**New Logic**: Uses `top_p` only when it has a positive value (> 0); otherwise uses `temperature` including `temperature=0.0`
49
+
-**Previous Logic**: Used `top_p` whenever `temperature=0.0`, preventing proper deterministic configuration
50
+
-**Key Benefits**: Enables proper deterministic output with `temperature=0.0`, more intuitive parameter behavior, aligns with model provider best practices (Anthropic recommends `temperature=0` for consistent outputs)
51
+
-**Affected Components**: Bedrock client (`lib/idp_common_pkg/idp_common/bedrock/client.py`), Agentic extraction service (`lib/idp_common_pkg/idp_common/extraction/agentic_idp.py`)
52
+
-**Configuration Guidance**: Set `top_p: 0` to use `temperature` parameter; set `top_p` to positive value to override temperature
53
+
- Set temperature to 0.0 in discovery config for deterministic discovery output (was previously set to 1.0)
54
+
- Set top_p to 0.0 in all repo config files to force use of temperature setting by default.
55
+
56
+
-**Removed page image limit entirely across all IDP services**
57
+
- removed image limits from multimodal inference steps (classification, extraction, assessment) following Amazon Bedrock API removal of image count restrictions. The system now processes all document pages without artificial truncation, with info logging to track image counts for monitoring purposes.
58
+
- Resolves #147
59
+
60
+
-**Knowledge Base Vector Store Default Changed to S3 Vectors**
61
+
- Changed default `KnowledgeBaseVectorStore` from `OPENSEARCH_SERVERLESS` to `S3_VECTORS` for cost-optimized deployments
62
+
- S3 Vectors provides 40-60% lower storage costs with sub-second latency suitable for most use cases
63
+
- OpenSearch Serverless remains available for applications requiring sub-millisecond query performance
64
+
- No action required for existing deployments - only affects new stack deployments
65
+
66
+
### Fixed
67
+
68
+
-**UI: Document Schema Editor Regex Fields Not Persisting** - Fixed issue where Document Name Regex and Page Content Regex fields were not being saved in configuration or restored after page refresh. Fixes #151
69
+
-**Document Schema Builder Enum Support** - Fixed enum value handling in schema builder to properly support enumeration constraints for attribute definitions
70
+
-**Agentic Extraction Parameter Passing** - Fixed temperature and top_p parameters now correctly passed to agentic extraction service, enabling proper model behavior control
71
+
-**Document Schema Builder UI Labels** - Enhanced field labels and formats in document schema builder for improved clarity and user experience
72
+
-**Retry Mechanism Improvements** - Enhanced retry logic for more reliable error handling and recovery across document processing workflows
73
+
-**Type Safety Enhancements** - Improved type annotations and fixed undefined items handling to prevent runtime errors
-**Document Split Classification Metrics for Evaluating Page-Level Classification and Document Segmentation**
86
+
- Added `DocSplitClassificationMetrics` class for comprehensive evaluation of document splitting and classification accuracy
87
+
-**Three Accuracy Types**: Page-level classification accuracy, split accuracy without order consideration, and split accuracy with exact page order matching
-**Automatic Integration**: Integrates with evaluation service when ground truth and predicted sections are available
90
+
-**Documentation**: Guide in `lib/idp_common_pkg/idp_common/evaluation/README.md` with usage examples, metric explanations, and best practices
91
+
92
+
-**Caching improvements to Agentic Extraction Service**
93
+
- Optimized prompt caching by caching document context (text/images) on first LLM call, reducing token costs and quota consumption
94
+
95
+
-**Enhanced Bedrock Retry Logic for Agentic Extraction**
96
+
- New `bedrock_utils.py` module with exponential backoff and comprehensive error handling
97
+
- Improves agentic extraction reliability for transient failures and rate limiting
98
+
99
+
-**Review Agent Model Configuration**
100
+
- Added `review_agent_model` parameter to enable separate model for reviewing extraction work
101
+
- Defaults to main extraction model if not specified
102
+
- Configurable through Web UI extraction settings
103
+
104
+
10
105
### Fixed
11
106
107
+
-**Evaluation Output URI Fields Lost Across All Patterns - causing (a) missing Page Text Confidence content in UI, (2) failed Assessment step when reprocessing document after editing classes (No module named 'fitz')**
108
+
- Fixed bug where `text_confidence_uri` was being set to null in evaluation output for all three patterns
109
+
- Root cause: AppSync service `_appsync_to_document()` method incorrectly mapped page URIs, and evaluation functions overwrote correct documents with corrupted AppSync responses
110
+
111
+
-**UI: Metering Data Not Displayed During Document Processing**
cd src/ui && npm ci --prefer-offline --no-audit && npm run build
120
129
121
130
commit: lint test
122
131
$(info Generating commit message...)
123
-
export COMMIT_MESSAGE="$(shell q chat --no-interactive --trust-all-tools "Understand pending local git change and changes to be committed, then infer a commit message. Return this commit message only"| tail -n 1 | sed 's/\x1b\[[0-9;]*m//g')"&&\
132
+
export COMMIT_MESSAGE="$(shell kiro-cli chat --no-interactive --trust-all-tools "Understand pending local git change and changes to be committed, then infer a commit message. Return this commit message only on a single line."| grep ">"| tail -n 1 | sed 's/\x1b\[[0-9;]*m//g')"&&\
124
133
git add .&&\
125
134
git commit -am "$${COMMIT_MESSAGE}"&&\
126
135
git push
127
136
128
137
fastcommit: fastlint
129
138
$(info Generating commit message...)
130
-
export COMMIT_MESSAGE="$(shell q chat --no-interactive --trust-all-tools "Understand pending local git change and changes to be committed, then infer a commit message. Return this commit message only"| tail -n 1 | sed 's/\x1b\[[0-9;]*m//g')"&&\
139
+
export COMMIT_MESSAGE="$(shell kiro-cli chat --no-interactive --trust-all-tools "Understand pending local git change and changes to be committed, then infer a commit message. Return this commit message only on a single line."| grep ">"| tail -n 1 | sed 's/\x1b\[[0-9;]*m//g')"&&\
Copy file name to clipboardExpand all lines: README.md
+6-2Lines changed: 6 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -34,6 +34,8 @@ White-glove customization, deployment, and integration support for production us
34
34
35
35
**Prefer AWS CDK?** This solution is also available as [GenAI IDP Accelerator for AWS CDK](https://github.com/cdklabs/genai-idp), providing the same functional capabilities through AWS CDK constructs for customers who prefer Infrastructure-as-Code with CDK.
36
36
37
+
**Prefer Terraform?** This solution is also available as [GenAI IDP Terraform](https://github.com/awslabs/genai-idp-terraform), providing the same functional capabilities as a Terraform module that integrates with existing infrastructure and supports customization through module variables.
38
+
37
39
## Key Features
38
40
39
41
-**Serverless Architecture**: Built entirely on AWS serverless technologies including Lambda, Step Functions, SQS, and DynamoDB
@@ -53,6 +55,7 @@ White-glove customization, deployment, and integration support for production us
53
55
-**Extraction Confidence Assessment**: LLM-powered assessment of extraction confidence with multimodal document analysis
54
56
-**Document Knowledge Base Query**: Ask questions about your processed documents
55
57
-**IDP Accelerator Help Chat Bot**: Ask questions about the IDP code base or features
58
+
-**MCP Integration**: Model Context Protocol integration enabling external applications like Amazon Quick Suite to access IDP data and analytics through AWS Bedrock AgentCore Gateway
0 commit comments