You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/knowledge-base.md
+48-2Lines changed: 48 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -7,10 +7,11 @@ The GenAIIDP solution includes an integrated Document Knowledge Base query featu
7
7
8
8
## How It Works
9
9
10
-
1.**Document Indexing**
10
+
1.**Document Processing & Indexing**
11
11
- Processed documents are automatically indexed in a vector database
12
12
- Documents are chunked into semantic segments for efficient retrieval
13
13
- Each chunk maintains reference to its source document
14
+
-**Ingestion Schedule**: Documents are ingested into the knowledge base every 30 minutes, so newly processed documents may not be immediately available for querying
14
15
15
16
2.**Interactive Query Interface**
16
17
- Access through the Web UI via the "Knowledge Base" section
@@ -33,6 +34,25 @@ The GenAIIDP solution includes an integrated Document Knowledge Base query featu
33
34
-**Markdown Formatting**: Responses support rich formatting for better readability
34
35
-**Real-time Processing**: Get answers in seconds, even across large document collections
35
36
37
+
## Architecture & Vector Storage Options
38
+
39
+
The Knowledge Base feature supports two vector storage backends to optimize for different performance and cost requirements:
Description: Enable/disable the Document Knowledge Base feature
48
68
69
+
KnowledgeBaseVectorStore:
70
+
Type: String
71
+
Default: "OPENSEARCH_SERVERLESS"
72
+
AllowedValues:
73
+
- "OPENSEARCH_SERVERLESS"
74
+
- "S3_VECTORS"
75
+
Description: Vector storage backend for the knowledge base
76
+
49
77
DocumentKnowledgeBaseModel:
50
78
Type: String
51
79
Default: "us.amazon.nova-pro-v1:0"
52
80
Description: Bedrock model to use for knowledge base queries (e.g., "us.anthropic.claude-3-7-sonnet-20250219-v1:0")
53
81
```
54
82
83
+
### Supported Embedding Models
84
+
85
+
Both vector store types support the same embedding models:
86
+
- `amazon.titan-embed-text-v2:0` (default)
87
+
- `cohere.embed-english-v3`(disabled by default)
88
+
- `cohere.embed-multilingual-v3`(disabled by default)
89
+
55
90
When the feature is enabled, the solution:
56
-
- Creates necessary OpenSearch resources for document indexing
91
+
- Creates the selected vector storage resources (OpenSearch or S3 Vectors)
57
92
- Configures API endpoints for querying the knowledge base
58
93
- Adds the query interface to the Web UI
59
94
@@ -111,3 +146,14 @@ The Knowledge Base feature maintains the security controls of the overall soluti
111
146
- Document visibility respects user permissions
112
147
- Questions and answers are processed securely within your AWS account
113
148
- No data is sent to external services beyond the configured Bedrock models
149
+
150
+
## Future Enhancements
151
+
152
+
### Potential Improvements & Community Contributions
153
+
- **CloudFormation Support**: When S3 Vectors gains native CloudFormation support
154
+
- **Migration Tools**: Utilities to migrate between vector store types
155
+
- **Hybrid Deployment**: Support for multiple Knowledge Bases with different vector stores
156
+
- **Document Chunking Options**: The system currently uses default chunking strategies, with additional chunking methods available for optimization based on document types and use cases
0 commit comments