Add Support for Knowledge Base Retrieval Params & Preprocessing #61

tmichaeldb · 2024-11-22T00:08:57Z

This PR adds support for preprocessing knowledge base documents two ways:

TextChunkingConfig to split loaded documents into chunks before embedding
ContextualConfig to support Contextual Retrieval (ref)

It also supports passing in parameters to Knowledge Base creation to configure retrieval. Example:

params = {
    'search_kwargs': {
        'k': 10
    }
}

lucas-koontz · 2024-11-22T01:22:04Z

minds/knowledge_bases/preprocessing.py

+class LLMConfig(BaseModel):
+    model_name: str = Field(default=DEFAULT_LLM_MODEL, description='LLM model to use for context generation')
+    provider: str = Field(default=DEFAULT_LLM_MODEL_PROVIDER, description='LLM model provider to use for context generation')
+    params: Dict[str, Any] = Field(default={}, description='Additional parameters to pass in when initializing the LLM')


Add Support for Knowledge Base Retrieval Params & Preprocessing

7a18d23

tmichaeldb requested review from QuantumPlumber and lucas-koontz November 22, 2024 00:09

lucas-koontz reviewed Nov 22, 2024

View reviewed changes

lucas-koontz approved these changes Nov 22, 2024

View reviewed changes

tmichaeldb merged commit 5c6ddfb into main Nov 22, 2024
5 checks passed

mindsdb locked and limited conversation to collaborators Nov 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Support for Knowledge Base Retrieval Params & Preprocessing #61

Add Support for Knowledge Base Retrieval Params & Preprocessing #61

Uh oh!

tmichaeldb commented Nov 22, 2024

Uh oh!

lucas-koontz Nov 22, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add Support for Knowledge Base Retrieval Params & Preprocessing #61

Add Support for Knowledge Base Retrieval Params & Preprocessing #61

Uh oh!

Conversation

tmichaeldb commented Nov 22, 2024

Uh oh!

lucas-koontz Nov 22, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants