You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: CHANGELOG.md
+18Lines changed: 18 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -6,6 +6,23 @@ SPDX-License-Identifier: MIT-0
6
6
## [Unreleased]
7
7
8
8
### Added
9
+
10
+
## [0.3.16]
11
+
12
+
### Added
13
+
14
+
-**S3 Vectors Support for Cost-Optimized Knowledge Base Storage**
15
+
- Added S3 Vectors as alternative vector store option to OpenSearch Serverless for Bedrock Knowledge Base with lower storage costs
16
+
- Custom resource Lambda implementation for S3 vector bucket and index management (using boto3 s3vectors client) with proper IAM permissions and resource cleanup
17
+
- Unified Knowledge Base interface supporting both vector store types with automatic resource provisioning based on user selection
18
+
19
+
-**Page Limit Configuration for Classification Control**
20
+
- Added `maxPagesForClassification` configuration option to control how many pages are used during document classification
21
+
-**Default Behavior**: `"ALL"` - uses all pages for classification (existing behavior)
22
+
-**Limited Page Classification**: Set to numeric value (e.g., `"1"`, `"2"`, `"3"`) to classify only the first N pages
23
+
-**Important**: When using numeric limit, the classification result from the first N pages is applied to ALL pages in the document, effectively forcing the entire document to be assigned a single class with one section
24
+
-**Use Cases**: Performance optimization for large documents, cost reduction for documents with consistent classification patterns, simplified processing for homogeneous document types
25
+
9
26
-**CloudFormation Service Role for Delegated Deployment Access**
10
27
- Added example CloudFormation service role template that enables non-administrator users to deploy and maintain IDP stacks without requiring ongoing administrator permissions
11
28
- Administrators can provision the service role once with elevated privileges, then delegate deployment capabilities to developer/DevOps teams
@@ -18,6 +35,7 @@ SPDX-License-Identifier: MIT-0
18
35
- Fix occasional UI error 'Failed to get document details - please try again later' - resolves #58
19
36
- Fixed UI zipfile creation to exclude .aws-sam directories and .env files from deployment package
20
37
- Added security recommendation to set LogLevel parameter to WARN or ERROR (not INFO) for production deployments to prevent logging of sensitive information including PII data, document contents, and S3 presigned URLs
38
+
- Hardened several aspects of the new Discovery feature
Copy file name to clipboardExpand all lines: docs/knowledge-base.md
+53-2Lines changed: 53 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -5,12 +5,18 @@ SPDX-License-Identifier: MIT-0
5
5
6
6
The GenAIIDP solution includes an integrated Document Knowledge Base query feature that enables you to interactively ask questions about your processed document collection using natural language. This feature leverages the processed data to create a searchable knowledge base.
- Processed documents are automatically indexed in a vector database
12
17
- Documents are chunked into semantic segments for efficient retrieval
13
18
- Each chunk maintains reference to its source document
19
+
-**Ingestion Schedule**: Documents are ingested into the knowledge base every 30 minutes, so newly processed documents may not be immediately available for querying
14
20
15
21
2.**Interactive Query Interface**
16
22
- Access through the Web UI via the "Knowledge Base" section
@@ -33,6 +39,25 @@ The GenAIIDP solution includes an integrated Document Knowledge Base query featu
33
39
-**Markdown Formatting**: Responses support rich formatting for better readability
34
40
-**Real-time Processing**: Get answers in seconds, even across large document collections
35
41
42
+
## Architecture & Vector Storage Options
43
+
44
+
The Knowledge Base feature supports two vector storage backends to optimize for different performance and cost requirements:
Description: Enable/disable the Document Knowledge Base feature
48
73
74
+
KnowledgeBaseVectorStore:
75
+
Type: String
76
+
Default: "OPENSEARCH_SERVERLESS"
77
+
AllowedValues:
78
+
- "OPENSEARCH_SERVERLESS"
79
+
- "S3_VECTORS"
80
+
Description: Vector storage backend for the knowledge base
81
+
49
82
DocumentKnowledgeBaseModel:
50
83
Type: String
51
84
Default: "us.amazon.nova-pro-v1:0"
52
85
Description: Bedrock model to use for knowledge base queries (e.g., "us.anthropic.claude-3-7-sonnet-20250219-v1:0")
53
86
```
54
87
88
+
### Supported Embedding Models
89
+
90
+
Both vector store types support the same embedding models:
91
+
- `amazon.titan-embed-text-v2:0` (default)
92
+
- `cohere.embed-english-v3`(disabled by default)
93
+
- `cohere.embed-multilingual-v3`(disabled by default)
94
+
55
95
When the feature is enabled, the solution:
56
-
- Creates necessary OpenSearch resources for document indexing
96
+
- Creates the selected vector storage resources (OpenSearch or S3 Vectors)
57
97
- Configures API endpoints for querying the knowledge base
58
98
- Adds the query interface to the Web UI
59
99
@@ -111,3 +151,14 @@ The Knowledge Base feature maintains the security controls of the overall soluti
111
151
- Document visibility respects user permissions
112
152
- Questions and answers are processed securely within your AWS account
113
153
- No data is sent to external services beyond the configured Bedrock models
154
+
155
+
## Future Enhancements
156
+
157
+
### Potential Improvements & Community Contributions
158
+
- **CloudFormation Support**: When S3 Vectors gains native CloudFormation support
159
+
- **Migration Tools**: Utilities to migrate between vector store types
160
+
- **Hybrid Deployment**: Support for multiple Knowledge Bases with different vector stores
161
+
- **Document Chunking Options**: The system currently uses default chunking strategies, with additional chunking methods available for optimization based on document types and use cases
0 commit comments