Add MongoDB Vector Search Tool #319

blink1073 · 2025-06-03T16:34:40Z

I am a maintainer of langchain-mongodb, which I wrapped to create a crewAI tool.

…ON-580

lorenzejay · 2025-06-04T21:11:02Z

@blink1073 nice work. Can we try an approach where we are using the direct apis from mongo over wrapping another package just to call this? Don't want the dependency overhead of langchain here.

blink1073 · 2025-06-05T01:18:03Z

Not a problem, I'll just end up duplicating some code from langchain-mongodb.

blink1073 · 2025-06-09T13:40:11Z

@lorenzejay I'm made the requested changes while preserving the API.

- Changed import of EnvVar from tests.utils to crewai.tools in multiple files. - Updated README.md for MongoDB vector search tool with additional context. - Modified subprocess command in vector_search.py for package installation. - Cleaned up test_generate_tool_specs.py to improve mock patching syntax. - Deleted unused tests/utils.py file.

lorenzejay · 2025-06-10T18:10:43Z

@blink1073 made a push for some fixes. use crewai==0.126.0

using this:

if __name__ == "__main__":
    from crewai import Agent, Task, Crew
    from crewai_tools import MongoDBVectorSearchConfig, MongoDBVectorSearchTool

    # Setup custom embedding model and customize the parameters.
    query_config = MongoDBVectorSearchConfig(limit=10)
    tool = MongoDBVectorSearchTool(
        database_name="sample_mflix",
        collection_name="embedded_movies",
        connection_string="<>",
        query_config=query_config,
        vector_index_name="_id_",
        generative_model="gpt-4o",
    )

    # Adding the tool to an agent
    rag_agent = Agent(
        name="rag_agent",
        role="You are a helpful assistant that can answer questions with the help of the MongoDBVectorSearchTool.",
        goal="You are a helpful assistant that can answer questions with the help of the MongoDBVectorSearchTool.",
        backstory="You are a helpful assistant that can answer questions with the help of the MongoDBVectorSearchTool.",
        llm="gpt-4o-mini",
        tools=[tool],
    )
    task = Task(
        name="rag_task",
        description="You are a helpful assistant that can answer questions with the help of the MongoDBVectorSearchTool. the query: {query}",
        expected_output="The answer to the question",
        agent=rag_agent,
    )
    crew = Crew(agents=[rag_agent], tasks=[task], verbose=True)
    res = crew.kickoff(inputs={"query": "tell me about the movie: From Hand to Mouth"})
    print("res", res)

i'm getting pretty poor results on the default db. any help?

blink1073 · 2025-06-10T20:25:37Z

I'll take a look. Here's the integ test I had written that will run using our creds nightly: mongodb-labs/ai-ml-pipeline-testing#71.

blink1073 · 2025-06-11T16:23:14Z

Ah, I see the difference, what you're using it as is actually a follow-up capability, searching within a database itself. This initial PR is for vector search only, which is what my example does. It creates embeddings for each page of the PDF and then runs the query against those embeddings.

blink1073 · 2025-06-11T16:29:56Z

I updated the crewai dep

blink1073 · 2025-06-12T14:51:05Z

To clarify, as part of INTPYTHON-332 we would add a mongodb_search_tool directory which could be used to perform your query.

…ON-580

- Removed `auth0-python` package. - Updated `crewai` version to 0.140.0 and adjusted its dependencies. - Changed `json-repair` version to 0.25.2. - Updated `litellm` version to 1.72.6. - Modified dependency markers for several packages to improve compatibility with Python versions.

lucasgomide

Good work here!

I dropped a few comments let mw know what you think

crewai_tools/tools/mongodb_vector_search_tool/vector_search.py

…ling and new dimensions field - Added logging for error handling in the _run method and during client cleanup. - Introduced a new 'dimensions' field in the MongoDBVectorSearchConfig for embedding vector size. - Refactored the _run method to return JSON formatted results and handle exceptions gracefully. - Cleaned up import statements and improved code readability.

…ON-580

blink1073 · 2025-07-08T21:10:37Z

Just a heads up, I'm still working on updating our integration tests in mongodb-labs/ai-ml-pipeline-testing#71, I'll let y'all know when I get it working

lorenzejay · 2025-07-08T23:12:33Z

@blink1073 can you try running this:

i'm getting poor results:

if __name__ == "__main__":
    from crewai import Agent, Task, Crew

    tool = MongoDBVectorSearchTool(
        database_name="sample_mflix",
        collection_name="embedded_movies",
        connection_string="<>",
        embedding_key="plot_embedding",
    )
   

    agent = Agent(
        role="MongoDBVectorSearchTool",
        goal="You are a helpful assistant that can answer questions about the MongoDB database.",
        backstory="You are a helpful assistant that can answer questions about the MongoDB database.",
        tools=[tool],
        llm="gpt-4.1",
    )

    task = Task(
        description="get the movies with the director Alfred J. Goulding, use no filters",
        expected_output="The movies with the director Alfred J. Goulding",
        agent=agent,
    )

    crew = Crew(
        agents=[agent],
        tasks=[task],
        verbose=True,
    )
    result = crew.kickoff()
    print("result", result)

lorenzejay · 2025-07-08T23:12:55Z

im using the default collection when you create a new mongo instance

blink1073 · 2025-07-09T13:55:39Z

We had that same conversation last month. ;)

#319 (comment)

lorenzejay · 2025-07-09T14:00:20Z

Do you have an example of it working then? Something I can run to confirm ? Let’s bring this home today

blink1073 · 2025-07-09T14:18:09Z

Yes, the integration test is now passing: https://github.com/mongodb-labs/ai-ml-pipeline-testing/pull/71/files#diff-5c01b996bf644e0a14a5aa2a00ec357d24dbe961c3157919a979bc762f1344c4

lorenzejay

LGTM.

blink1073 · 2025-07-09T15:50:32Z

Excellent, thank you both!

* INTPYTHON-580 Design and Implement MongoDBVectorSearchTool * add implementation * wip * wip * finish tests * add todo * refactor to wrap langchain-mongodb * cleanup * address review * Fix usage of EnvVar class * inline code * lint * lint * fix usage of SearchIndexModel * Refactor: Update EnvVar import path and remove unused tests.utils module - Changed import of EnvVar from tests.utils to crewai.tools in multiple files. - Updated README.md for MongoDB vector search tool with additional context. - Modified subprocess command in vector_search.py for package installation. - Cleaned up test_generate_tool_specs.py to improve mock patching syntax. - Deleted unused tests/utils.py file. * update the crewai dep and the lockfile * chore: update package versions and dependencies in uv.lock - Removed `auth0-python` package. - Updated `crewai` version to 0.140.0 and adjusted its dependencies. - Changed `json-repair` version to 0.25.2. - Updated `litellm` version to 1.72.6. - Modified dependency markers for several packages to improve compatibility with Python versions. * refactor: improve MongoDB vector search tool with enhanced error handling and new dimensions field - Added logging for error handling in the _run method and during client cleanup. - Introduced a new 'dimensions' field in the MongoDBVectorSearchConfig for embedding vector size. - Refactored the _run method to return JSON formatted results and handle exceptions gracefully. - Cleaned up import statements and improved code readability. * address review * update tests * debug * fix test * fix test * fix test * support azure openai --------- Co-authored-by: lorenzejay <[email protected]>

blink1073 added 11 commits May 21, 2025 08:09

INTPYTHON-580 Design and Implement MongoDBVectorSearchTool

f73f539

add implementation

736275e

wip

d824c0c

wip

65c0a7b

finish tests

bcb8130

add todo

b408c5c

refactor to wrap langchain-mongodb

e69ca9c

cleanup

7f93b9a

address review

5c15fbe

Merge branch 'main' of github.com:crewAIInc/crewAI-tools into INTPYTH…

15adea6

…ON-580

Fix usage of EnvVar class

1b74a65

blink1073 mentioned this pull request Jun 3, 2025

Fix usage of EnvVar class #320

Closed

lorenzejay self-requested a review June 4, 2025 15:23

blink1073 added 2 commits June 9, 2025 08:33

inline code

8655a13

merge from master

7a522d2

blink1073 added 3 commits June 9, 2025 08:42

lint

69749e6

lint

c8168bb

fix usage of SearchIndexModel

6c3820a

lorenzejay self-assigned this Jun 9, 2025

update the crewai dep and the lockfile

10bf7a7

Merge branch 'main' of github.com:crewAIInc/crewAI-tools into INTPYTH…

7fce5f5

…ON-580

lucasgomide reviewed Jul 8, 2025

View reviewed changes

lorenzejay and others added 2 commits July 8, 2025 10:10

address review

5e3b4b1

blink1073 requested a review from lucasgomide July 8, 2025 18:29

blink1073 added 7 commits July 8, 2025 13:32

update tests

8e47c09

Merge branch 'main' of github.com:crewAIInc/crewAI-tools into INTPYTH…

0a11aef

…ON-580

debug

37d344b

fix test

1b73bb5

fix test

12d0d50

fix test

94bd04c

support azure openai

86f6eb5

lorenzejay approved these changes Jul 9, 2025

View reviewed changes

lucasgomide approved these changes Jul 9, 2025

View reviewed changes

lorenzejay merged commit d12ba28 into crewAIInc:main Jul 9, 2025
4 checks passed

blink1073 mentioned this pull request Jul 22, 2025

Fix MongoDBVectorSearchTool serialization and schema #389

Merged

Add MongoDB Vector Search Tool #319

Add MongoDB Vector Search Tool #319

Uh oh!

Conversation

blink1073 commented Jun 3, 2025

Uh oh!

lorenzejay commented Jun 4, 2025

Uh oh!

blink1073 commented Jun 5, 2025

Uh oh!

blink1073 commented Jun 9, 2025

Uh oh!

lorenzejay commented Jun 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

blink1073 commented Jun 10, 2025

Uh oh!

blink1073 commented Jun 11, 2025

Uh oh!

blink1073 commented Jun 11, 2025

Uh oh!

blink1073 commented Jun 12, 2025

Uh oh!

lucasgomide left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

blink1073 commented Jul 8, 2025

Uh oh!

lorenzejay commented Jul 8, 2025

Uh oh!

lorenzejay commented Jul 8, 2025

Uh oh!

blink1073 commented Jul 9, 2025

Uh oh!

lorenzejay commented Jul 9, 2025

Uh oh!

blink1073 commented Jul 9, 2025

Uh oh!

lorenzejay left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

blink1073 commented Jul 9, 2025

Uh oh!

Uh oh!

lorenzejay commented Jun 10, 2025 •

edited

Loading