MongoDB Vector IO Provider for Llama Stack

A MongoDB integration for Llama Stack that provides vector search, full-text search, hybrid search, and graph-enhanced retrieval capabilities using native MongoDB Atlas features.

Features

Vector Search: Semantic similarity search using embedding vectors
Full-Text Search: Keyword-based search with advanced text analysis
Hybrid Search: Combine semantic and keyword search with flexible weighting
Graph-Enhanced Retrieval (TBD): Discover related documents through graph traversal
RankFusion Pipeline: Native MongoDB 8.1+ feature for optimal result ranking
Self-Managed or Atlas: Works with both MongoDB Atlas and self-hosted deployments
Automatic Index Creation: Optimized index provisioning for Atlas environments
Advanced Filtering: Combine vector search with metadata filters

Requirements

Python 3.10+
MongoDB Atlas cluster (recommended) or MongoDB 8.0+ instance
Llama Stack 0.2.0+
pymongo 4.5.0+

Getting Started

You can integrate this provider with Llama Stack using either the external providers directory method (recommended for development) or by installing it as a Python module.

Option 1: External Providers Directory (Development Mode)

This approach is ideal for development as it doesn't require reinstallation after code changes.

1. Clone and Set Up the Repository

# Clone the repository
git clone https://github.com/mongodb-partners/mongodb-llama-stack.git
cd mongodb-llama-stack

# Create a virtual environment (optional but recommended)
python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

# Install dependencies and the package in development mode
pip install -r requirements.txt
pip install -e .

2. Configure Your MongoDB Connection

You'll need a MongoDB Atlas cluster or a MongoDB 8.0+ instance with the Search and Vector Search capabilities enabled.

# Set required environment variables
export MONGODB_CONNECTION_STR='mongodb+srv://<username>:<password>@<cluster-address>/'
export MONGODB_NAMESPACE='<database>.<collection>'
export EXTERNAL_PROVIDERS_DIR="$(pwd)/mongodb_llama_stack/providers.d"

For production deployments, consider using a secure method to store and retrieve these credentials.

3. Verify the Connection

# Run the connection test script
python -m mongodb_llama_stack.mongodb.connection_test

# Expected output:
# MongoDB connection successful!
# Server version: 8.x.x
# Available features: vectorSearch, search, rankFusion, etc.

4. Add Provider to Your Llama Stack Configuration

Create or update your run.yaml file with the MongoDB provider:

version: '2'
apis:
  - vector_io
providers:
  vector_io:
    - provider_id: mongodb
      provider_type: remote::mongodb
      config:
        connection_str: ${env.MONGODB_CONNECTION_STR:+}
        namespace: ${env.MONGODB_NAMESPACE:+}
        # Optional configuration:
        # search_mode: vector | full_text | hybrid | native_rank_fusion | hybrid_graph
        # index_name: default
        # text_index_name: text_index
        # text_search_fields: ["title", "content", "description"]
external_providers_dir: ${env.EXTERNAL_PROVIDERS_DIR:=~/.llama/providers.d}

5. Build and Test the Provider

You can use the included build and test script to ensure everything works correctly:

# Make the script executable
chmod +x scripts/build_and_test.sh

# Run build and test script
./scripts/build_and_test.sh

This script will:

Set up a virtual environment
Install all required dependencies
Test the MongoDB connection
Run unit tests for each search mode (vector, full-text, hybrid, graph-enhanced)
Run integration tests against a real MongoDB instance
Generate coverage reports

Available Tests

The testing suite includes:

Unit Tests (tests/test_mongodb_provider.py):
- Test vector search functionality with different configurations
- Test full-text search with various analyzers and fields
- Test hybrid search with different weight configurations
- Test graph-enhanced document discovery
- Test index management and automatic creation
Integration Tests (tests/integration_test.py):
- Test end-to-end document ingestion and retrieval
- Test search accuracy with real vector embeddings
- Test filtering with metadata
- Test performance under various load conditions
- Test server feature detection and fallbacks

6. Build and Run Llama Stack

# Build Llama Stack with your configuration
llama stack build

# Run the Llama Stack server
llama stack run

You can verify the MongoDB provider is working correctly by checking the logs during startup.

Search Modes and Configuration

This provider supports multiple search modes that can be configured according to your needs:

Vector Search - Pure semantic search using embeddings
Full-Text Search - Keyword-based search
Hybrid Search - Combined vector and text search
Native Rank Fusion - Advanced results ranking (MongoDB 8.1+)
Graph-Enhanced Hybrid Search (TBD) - Hybrid search with related document discovery

For detailed configuration of each mode, see the Search Modes Documentation.

Example Usage

For a complete working example, check out the demo script:

# Set required environment variables
export MONGODB_CONNECTION_STR='mongodb+srv://<username>:<password>@<cluster-address>/'
export MONGODB_NAMESPACE='demo.documents'

# Run the demo
python examples/demo.py

This will demonstrate:

Document ingestion with automatic embedding generation
Basic search with hybrid mode (vector + text)
Filtered search using metadata
Various configuration options

Complete Build and Test Process

For a comprehensive build and test of all MongoDB Atlas search integrations:

Set Up MongoDB Atlas
- Create a cluster with Vector Search and Atlas Search enabled
- Create a database user with read/write permissions
- Whitelist your IP address

Clone and Configure

git clone https://github.com/mongodb-partners/mongodb-llama-stack.git
cd mongodb-llama-stack
python -m venv venv
source venv/bin/activate
pip install -r requirements.txt
pip install -e .

Set Environment Variables

export MONGODB_CONNECTION_STR='mongodb+srv://<username>:<password>@<cluster>/'
export MONGODB_NAMESPACE='test.documents'
export EXTERNAL_PROVIDERS_DIR="$(pwd)/mongodb_llama_stack/providers.d"

Test Each Search Mode

# Vector search test
python -m mongodb_llama_stack.mongodb.config_validator --mode vector

# Full text search test
python -m mongodb_llama_stack.mongodb.config_validator --mode full_text

# Hybrid search test
python -m mongodb_llama_stack.mongodb.config_validator --mode hybrid

# Native rank fusion test (MongoDB 8.1+)
python -m mongodb_llama_stack.mongodb.config_validator --mode native_rank_fusion

# Graph-enhanced search test
python -m mongodb_llama_stack.mongodb.config_validator --mode hybrid_graph

Run Comprehensive Test Suite

# Run all unit tests
pytest -xvs tests/test_mongodb_provider.py

# Run integration tests
pytest -xvs tests/integration_test.py

# Run specific test types
pytest -m vector_search
pytest -m text_search
pytest -m hybrid_search

# Generate coverage report
pytest --cov=mongodb_llama_stack tests/

Build and Run with Llama Stack

# Build Llama Stack with MongoDB provider
llama stack build

# Run Llama Stack server
llama stack run

Important Notes

Example configuration: See mongodb_llama_stack/run.yaml for a complete working example
Provider discovery: The file mongodb_llama_stack/providers.d/remote/vector_io/mongodb.yaml defines the provider
Implementation location: mongodb_llama_stack/mongodb/ contains the core provider code

Option 2: Module Installation (Production Use)

This approach is recommended for production deployments or when integrating into existing projects.

1. Install the Package

# Install from PyPI
pip install mongodb-llama-stack

# Or install from your local build
pip install .

2. Reference the Module in Your Configuration

Update your build.yaml or run.yaml to include the MongoDB provider:

providers:
  vector_io:
    - provider_type: remote::mongodb
      module: mongodb_llama_stack
      config:
        connection_str: ${env.MONGODB_CONNECTION_STR:+}
        namespace: ${env.MONGODB_NAMESPACE:+}
        # Additional config options same as Option 1

3. Build and Run Llama Stack

llama stack build
llama stack run

Note: The provider is discovered as remote::mongodb in both integration methods.

Building and Testing

This section covers how to build, test, and validate the MongoDB provider functionality.

Building the Provider

# Clone the repository (if not done already)
git clone https://github.com/mongodb-partners/mongodb-llama-stack.git
cd mongodb-llama-stack

# Create and activate a virtual environment (recommended)
python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

# Install development dependencies
pip install -e ".[dev,test]"

# Run the build script
./scripts/build_and_test.sh --build-only

Running Tests

The repository includes unit tests and integration tests to verify functionality:

# Run all tests (requires MongoDB connection)
./scripts/build_and_test.sh

# Run only unit tests (no MongoDB connection needed)
./scripts/build_and_test.sh --unit

# Run integration tests (requires MongoDB connection)
./scripts/build_and_test.sh --integration

Testing Specific Features

Test individual search capabilities:

Vector Search

export MONGODB_CONNECTION_STR="mongodb+srv://<user>:<password>@<host>/"
export MONGODB_NAMESPACE="test_llama.vector_test"
python examples/demo.py

Text Search and Hybrid Search

Run the demo script with specific search modes:

# Test text search
export MONGODB_SEARCH_MODE="full_text"
python examples/demo.py

# Test hybrid search 
export MONGODB_SEARCH_MODE="hybrid"
python examples/demo.py

MongoDB Environment Setup

MongoDB Atlas (Recommended)

Atlas is the fully managed cloud database service that provides the best experience for this provider.

Create a MongoDB Atlas account
Create a new cluster (M0 free tier works for testing)
Create a database user with read/write privileges
Whitelist your IP address in the Network Access settings
Get your connection string from the Atlas Dashboard:
- Click "Connect" on your cluster
- Choose "Connect your application"
- Select the appropriate driver version
- Copy the provided connection string

Self-hosted MongoDB

If using a self-hosted MongoDB instance, ensure you're running MongoDB 8.0+ for full feature support:

# Check MongoDB version
mongosh --eval "db.version()"

For full functionality, we recommend:

MongoDB 8.0+ for basic vector search
MongoDB 8.1+ for native rank fusion and advanced hybrid search
Proper index configuration for your collections

Provider Configuration

Basic Configuration

The MongoDB provider can be configured programmatically or through environment variables. Here's how to set it up:

Programmatic Configuration

from mongodb_llama_stack.mongodb.config import MongoDBIOConfig

# Create configuration object with minimum required settings
config = MongoDBIOConfig(
    connection_str="mongodb+srv://username:[email protected]/",
    namespace="mydb.mycollection"
)

# Optional: Add adapter configuration
from mongodb_llama_stack.mongodb.mongodb import MongoDBIOAdapter
from llama_stack.apis.inference import InferenceAPI

adapter = MongoDBIOAdapter(config, inference_api)
await adapter.initialize()

Environment Variables

Store your configuration in environment variables or a .env file:

# Required settings
MONGODB_CONNECTION_STR=mongodb+srv://username:[email protected]/
MONGODB_NAMESPACE=mydb.mycollection

# Optional settings
MONGODB_SEARCH_MODE=hybrid
MONGODB_INDEX_NAME=vector_index
MONGODB_TEXT_INDEX_NAME=text_index
MONGODB_EMBEDDINGS_KEY=embeddings

Validation and Troubleshooting

You can validate your configuration using the included test script:

# Run configuration validation
python -m mongodb_llama_stack.tests.config_validator

# Check MongoDB connection
python -m mongodb_llama_stack.tests.connection_test

Search Modes

The MongoDB provider offers multiple search modes to fit different use cases. Choose the best mode based on your retrieval needs.

1. Vector Search

Best for: Semantic similarity search using embeddings

Vector search excels at finding conceptually similar content even when exact keywords don't match, making it ideal for semantic retrieval tasks.

from mongodb_llama_stack.mongodb.config import MongoDBIOConfig, SearchMode

config = MongoDBIOConfig(
    connection_str="${env.MONGODB_CONNECTION_STR}",
    namespace="${env.MONGODB_NAMESPACE}",
    search_mode=SearchMode.VECTOR,
    embeddings_key="embeddings",  # Field containing vector embeddings
    index_name="vector_index"
)

YAML Configuration:

provider_id: mongodb
provider_type: remote::mongodb
config:
  connection_str: ${env.MONGODB_CONNECTION_STR:+}
  namespace: ${env.MONGODB_NAMESPACE:+}
  search_mode: vector
  embeddings_key: embeddings
  index_name: vector_index

2. Full-Text Search

Best for: Keyword-based search with exact matches

Full-text search is optimal for finding documents containing specific words, phrases, or terms, with advanced text analysis capabilities.

config = MongoDBIOConfig(
    connection_str="${env.MONGODB_CONNECTION_STR}",
    namespace="${env.MONGODB_NAMESPACE}",
    search_mode=SearchMode.FULL_TEXT,
    text_index_name="text_index",
    text_search_fields=["title", "content", "description"]
)

YAML Configuration:

provider_id: mongodb
provider_type: remote::mongodb
config:
  connection_str: ${env.MONGODB_CONNECTION_STR:+}
  namespace: ${env.MONGODB_NAMESPACE:+}
  search_mode: full_text
  text_index_name: text_index
  text_search_fields: ["title", "content", "description"]

3. Hybrid Search

Best for: Balanced retrieval combining semantic understanding with keyword matching

Hybrid search combines vector similarity with text matching to get the best of both worlds, ideal for robust RAG applications.

config = MongoDBIOConfig(
    connection_str="${env.MONGODB_CONNECTION_STR}",
    namespace="${env.MONGODB_NAMESPACE}",
    search_mode=SearchMode.HYBRID,
    embeddings_key="embeddings",
    text_search_fields=["title", "content"],
    text_index_name="text_index",
    hybrid_alpha=0.7  # 70% weight to vector, 30% to text
)

YAML Configuration:

provider_id: mongodb
provider_type: remote::mongodb
config:
  connection_str: ${env.MONGODB_CONNECTION_STR:+}
  namespace: ${env.MONGODB_NAMESPACE:+}
  search_mode: hybrid
  embeddings_key: embeddings
  text_search_fields: ["title", "content"]
  text_index_name: text_index
  hybrid_alpha: 0.7

4. Native Rank Fusion (MongoDB 8.1+)

Best for: Advanced multi-pipeline search with fine-grained control

Uses MongoDB's native $rankFusion operator for optimal performance and precise control over search pipelines.

config = MongoDBIOConfig(
    connection_str="${env.MONGODB_CONNECTION_STR}",
    namespace="${env.MONGODB_NAMESPACE}",
    search_mode=SearchMode.NATIVE_RANK_FUSION,
    rank_fusion_pipelines=[
        {
            "name": "vector_pipeline",
            "type": "vectorSearch",
            "weight": 1.5,  # Higher weight for vector search
            "limit": 20,
            "config": {
                "numCandidates": 100,
                "index": "vector_index"
            }
        },
        {
            "name": "text_pipeline",
            "type": "search",
            "weight": 1.0,
            "limit": 20,
            "config": {
                "index": "text_index",
                "operator": "phrase"  # or "text", "compound"
            }
        }
    ],
    enable_score_details=True  # Get detailed scoring information
)

YAML Configuration:

provider_id: mongodb
provider_type: remote::mongodb
config:
  connection_str: ${env.MONGODB_CONNECTION_STR:+}
  namespace: ${env.MONGODB_NAMESPACE:+}
  search_mode: native_rank_fusion
  rank_fusion_pipelines:
    - name: vector_pipeline
      type: vectorSearch
      weight: 1.5
      limit: 20
      config:
        numCandidates: 100
        index: vector_index
    - name: text_pipeline
      type: search
      weight: 1.0
      limit: 20
      config:
        index: text_index
        operator: phrase
  enable_score_details: true

Complete Setup and Testing Workflow

This section provides a step-by-step guide to set up, build, test, and run the MongoDB provider with Llama Stack.

1. Installation and Setup

# Clone the repository
git clone https://github.com/mongodb-partners/mongodb-llama-stack.git
cd mongodb-llama-stack

# Create a virtual environment
python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

# Install the provider with development dependencies
pip install -e ".[dev,test]"

2. Configure MongoDB Connection

# Set required environment variables
export MONGODB_CONNECTION_STR='mongodb+srv://username:[email protected]/'
export MONGODB_NAMESPACE='mydb.mycollection'
export EXTERNAL_PROVIDERS_DIR="$(pwd)/mongodb_llama_stack/providers.d"

3. Validate Installation and Connection

# Verify configuration and connection
python -m mongodb_llama_stack.tests.connection_test

4. Run Tests

# Run basic tests
pytest tests/test_mongodb_provider.py -v

# Run integration tests (requires active MongoDB connection)
python tests/integration_test.py

5. Try the Demo

# Run the demo script to see different search modes in action
python examples/demo.py

6. Build and Run Llama Stack

# Set up Llama Stack with the provider
export EXTERNAL_PROVIDERS_DIR="$(pwd)/mongodb_llama_stack/providers.d"
llama stack build
llama stack run

7. Verify Provider Registration

After starting Llama Stack, you can verify that the provider is registered:

# Check provider registration
curl http://localhost:8321/registry/providers | jq

You should see remote::mongodb listed in the providers.

8. Use in Applications

Now you can use the provider in your applications that interact with Llama Stack:

# Configure llama-stack-client to use your server
llama-stack-client configure --endpoint http://localhost:8321 --api-key none

# Test vector search using the client
llama-stack-client vector-io insert-chunks \
  --vector-db-id my_vector_db \
  --provider-id mongodb \
  --content "Test document for MongoDB vector search"

Advanced Usage Examples

For detailed examples showcasing various usage scenarios of the MongoDB provider with Llama Stack, see the Examples Documentation.

For a quick-start example, check the demo script.

Contributing

For information on how to contribute to this project, please see the contributing guidelines.

License

This project is licensed under the Apache License 2.0. Portions of the code are derived from Meta’s Llama Stack project, licensed under the MIT License.

See the full LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
docs		docs
examples		examples
mongodb_llama_stack		mongodb_llama_stack
scripts		scripts
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pytest.ini		pytest.ini
requirements-test.txt		requirements-test.txt
requirements.txt		requirements.txt
setup.py		setup.py

License

mongodb-partners/mongodb-llama-stack

Folders and files

Latest commit

History

Repository files navigation

MongoDB Vector IO Provider for Llama Stack

Features

Requirements

Getting Started

Option 1: External Providers Directory (Development Mode)

1. Clone and Set Up the Repository

2. Configure Your MongoDB Connection

3. Verify the Connection

4. Add Provider to Your Llama Stack Configuration

5. Build and Test the Provider

Available Tests

6. Build and Run Llama Stack

Search Modes and Configuration

Example Usage

Complete Build and Test Process

Important Notes

Option 2: Module Installation (Production Use)

1. Install the Package

2. Reference the Module in Your Configuration

3. Build and Run Llama Stack

Building and Testing

Building the Provider

Running Tests

Testing Specific Features

Vector Search

Text Search and Hybrid Search

MongoDB Environment Setup

MongoDB Atlas (Recommended)

Self-hosted MongoDB

Provider Configuration

Basic Configuration

Programmatic Configuration

Environment Variables

Validation and Troubleshooting

Search Modes

1. Vector Search

2. Full-Text Search

3. Hybrid Search

4. Native Rank Fusion (MongoDB 8.1+)

Complete Setup and Testing Workflow

1. Installation and Setup

2. Configure MongoDB Connection

3. Validate Installation and Connection

4. Run Tests

5. Try the Demo

6. Build and Run Llama Stack

7. Verify Provider Registration

8. Use in Applications

Advanced Usage Examples

Contributing

License

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Languages

Packages