INTPYTHON-580 Design and Implement MongoDBVectorSearchTool #1

blink1073 · 2025-05-21T21:03:28Z

Integration tests: mongodb-labs/ai-ml-pipeline-testing#71

caseyclements

LGTM with comments.

caseyclements · 2025-06-03T14:19:33Z

crewai_tools/tools/mongodb_vector_search_tool/README.md

+    collection_name='example_collections',
+    connection_string="<your_mongodb_connection_string>",
+    query_config=query_config,
+    index_name="my_vector_index",


I'd call this vector_index_name. It is explicit and will avoid backwards compatibility issues is this gets adoption and we want access to other search types.

caseyclements · 2025-06-03T14:24:22Z

crewai_tools/tools/mongodb_vector_search_tool/README.md

+from crewai_tools import MongoDBVectorSearchConfig, MongoDBVectorSearchTool
+
+# Setup custom embedding model and customize the parameters.
+query_config = MongoDBVectorSearchConfig(limit=10, oversampling_factor=2)


It feels like you could combine the two examples. "MongoDBVectorSearchTool provides a number of configurable parameters. The kwarg query_config takes a MongoDBVectorSearchConfig. For example....

On the vector index, is this automatically created? Is it clear what will you vectorized? It's worth noting that embedding models can embed any text, from plain text to embedded json.

I like showing the simplest case so they can copy-paste and get rolling. No, the vector index has to be explicitly created using create_vector_search_index. I don't follow the embedded json part, the type annotation for add_texts is texts: Iterable[str].

Just thoughts. This looks great. Wrapping langchain_mongodb was a brilliant move.

caseyclements · 2025-06-03T14:28:55Z

crewai_tools/tools/mongodb_vector_search_tool/vector_search.py

+
+    query: str = Field(
+        ...,
+        description="The query to search retrieve relevant information from the MongoDB database. Pass only the query, not the question.",


What's the difference between query and question? Is that a CrewAI thing?

I was following prior examples.

…ON-580

This version contains a dedicated fix fro CrewAIAdapter when schema doesn't allow null

- Changed import of EnvVar from tests.utils to crewai.tools in multiple files. - Updated README.md for MongoDB vector search tool with additional context. - Modified subprocess command in vector_search.py for package installation. - Cleaned up test_generate_tool_specs.py to improve mock patching syntax. - Deleted unused tests/utils.py file.

…rewAIInc#325)

…6.0 (crewAIInc#327)

crewAIInc#331) * refactor: remove token validation from EnterpriseActionKitToolAdapter and CrewaiEnterpriseTools This commit simplifies the initialization of the EnterpriseActionKitToolAdapter and CrewaiEnterpriseTools by removing the explicit validation for the enterprise action token. The token can now be set to None without raising an error, allowing for more flexible usage. * added loggers for monitoring * fixed typo

* feat: add explictly package_dependencies in the Tools * feat: collect package_dependencies from Tool to add in tool.specs.json * feat: add default value in run_params Tool' specs * fix: support get boolean values This commit also refactor test to make easier define newest attributes into a Tool

crewAIInc#332) (crewAIInc#333) We’re currently using the JSON Schema standard for these fields

This change allows accessing tools by name (tools["tool_name"]) in addition to index (tools[0]), making it more intuitive and convenient to work with multiple tools without needing to remember their position in the list

* Add Oxylabs tools * Review updates * Add package_dependencies attribute

* feat: support to complex filter on ToolCollection * refactor: use proper tool collection methot to filter tool in CrewAiEnterpriseTools * feat: allow to filter available MCP tools

* refactor: remove token validation from EnterpriseActionKitToolAdapter and CrewaiEnterpriseTools This commit simplifies the initialization of the EnterpriseActionKitToolAdapter and CrewaiEnterpriseTools by removing the explicit validation for the enterprise action token. The token can now be set to None without raising an error, allowing for more flexible usage. * added loggers for monitoring * fixed typo * fix: enhance token handling in EnterpriseActionKitToolAdapter and CrewaiEnterpriseTools This commit improves the handling of the enterprise action token by allowing it to be fetched from environment variables if not provided. It adds checks to ensure the token is set before making API requests, enhancing robustness and flexibility. * removed redundancy * test: add new test for environment token fallback in CrewaiEnterpriseTools This update introduces a new test case to verify that the environment token is used when no token is provided during the initialization of CrewaiEnterpriseTools. Additionally, minor formatting adjustments were made to existing assertions for consistency. * test: update environment token test to clear environment variables This change modifies the test for CrewaiEnterpriseTools to ensure that the environment variables are cleared before setting the test token. This ensures a clean test environment and prevents potential interference from other tests. * drop redundancy

…crewAIInc#346) * feat: add support for parsing actions list from environment variables This commit introduces a new function, _parse_actions_list, to handle the parsing of a string representation of a list of tool names from environment variables. The CrewaiEnterpriseTools now utilizes this function to filter tools based on the parsed actions list, enhancing flexibility in tool selection. Additionally, a new test case is added to verify the correct usage of the environment actions list. * test: simplify environment actions list test setup This commit refactors the test for CrewaiEnterpriseTools to streamline the setup of environment variables. The environment token and actions list are now set in a single patch.dict call, improving readability and reducing redundancy in the test code.

…andling (crewAIInc#351) - Added TYPE_CHECKING imports for FirecrawlApp to enhance type safety. - Updated configuration keys in FirecrawlCrawlWebsiteTool and FirecrawlScrapeWebsiteTool to camelCase for consistency. - Introduced error handling in the _run methods of both tools to ensure FirecrawlApp is properly initialized before usage. - Adjusted parameters passed to crawl_url and scrape_url methods to use 'params' instead of unpacking the config dictionary directly.

Signed-off-by: Emmanuel Ferdman <[email protected]>

) * refactor: enhance schema handling in EnterpriseActionTool - Extracted schema property and required field extraction into separate methods for better readability and maintainability. - Introduced methods to analyze field types and create Pydantic field definitions based on nullability and requirement status. - Updated the _run method to handle required nullable fields, ensuring they are set to None if not provided in kwargs. * refactor: streamline nullable field handling in EnterpriseActionTool - Removed commented-out code related to handling required nullable fields for clarity. - Simplified the logic in the _run method to focus on processing parameters without unnecessary comments.

…and uv.lock (crewAIInc#356)

…c#357)

…ON-580

- Removed `auth0-python` package. - Updated `crewai` version to 0.140.0 and adjusted its dependencies. - Changed `json-repair` version to 0.25.2. - Updated `litellm` version to 1.72.6. - Modified dependency markers for several packages to improve compatibility with Python versions.

* - Added CouchbaseFTSVectorStore as a CrewAI tool. - Wrote a README to setup the tool. - Wrote test cases. - Added Couchbase as an optional dependency in the project. * Fixed naming in some places. Added docstrings. Added instructions on how to create a vector search index. * Fixed pyproject.toml * error handling and response format - Removed unnecessary ImportError for missing 'couchbase' package. - Changed response format from a concatenated string to a JSON array for search results. - Updated error handling to return error messages instead of raising exceptions in certain cases. - Adjusted tests to reflect changes in response format and error handling. * Update dependencies in pyproject.toml and uv.lock - Changed pydantic version from 2.6.1 to 2.10.6 in both pyproject.toml and uv.lock. - Updated crewai-tools version from 0.42.2 to 0.42.3 in uv.lock. - Adjusted pydantic-core version from 2.33.1 to 2.27.2 in uv.lock, reflecting the new pydantic version. * Removed restrictive pydantic version and updated uv.lock * synced lockfile * regenerated lockfile * updated lockfile * regenerated lockfile * Update tool specifications for * Fix test cases --------- Co-authored-by: AayushTyagi1 <[email protected]> Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

…ling and new dimensions field - Added logging for error handling in the _run method and during client cleanup. - Introduced a new 'dimensions' field in the MongoDBVectorSearchConfig for embedding vector size. - Refactored the _run method to return JSON formatted results and handle exceptions gracefully. - Cleaned up import statements and improved code readability.

…ON-580

blink1073 added 6 commits May 21, 2025 08:09

INTPYTHON-580 Design and Implement MongoDBVectorSearchTool

f73f539

add implementation

736275e

wip

d824c0c

wip

65c0a7b

finish tests

bcb8130

add todo

b408c5c

blink1073 closed this May 21, 2025

blink1073 reopened this May 21, 2025

blink1073 added 2 commits May 23, 2025 14:16

refactor to wrap langchain-mongodb

e69ca9c

cleanup

7f93b9a

blink1073 mentioned this pull request May 27, 2025

INTPYTHON-580 Add CrewAI Integration Tests mongodb-labs/ai-ml-pipeline-testing#71

Merged

caseyclements approved these changes Jun 3, 2025

View reviewed changes

blink1073 and others added 18 commits June 3, 2025 10:33

address review

5c15fbe

Merge branch 'main' of github.com:crewAIInc/crewAI-tools into INTPYTH…

15adea6

…ON-580

Fix usage of EnvVar class

1b74a65

build: upgrade mcpadapt version (crewAIInc#323)

8412995

This version contains a dedicated fix fro CrewAIAdapter when schema doesn't allow null

chore: update version to 0.47.0 in pyproject.toml and uv.lock

21cec5b

Merge pull request crewAIInc#324 from crewAIInc/lorenze/version-0.47.0

13e374d

inline code

8655a13

merge from master

7a522d2

lint

69749e6

lint

c8168bb

fix usage of SearchIndexModel

6c3820a

fix: ensure the entire file will be read when the start_line is None (c…

d8729cf

…rewAIInc#325)

update the crewai dep and the lockfile

10bf7a7

chore: update version to 0.47.1 and upgrade crewai dependency to 0.12…

d910c7f

…6.0 (crewAIInc#327)

refactor: renaming init_params and run_params to reflect their schema. (

9ad5991

crewAIInc#332) (crewAIInc#333) We’re currently using the JSON Schema standard for these fields

lucasgomide and others added 29 commits June 20, 2025 08:06

feat: mapping explicitly tool environment variables (crewAIInc#338)

a158605

fix: add support for case-insensitive Enterprise filter (crewAIInc#340)

ccbb3f4

Add Oxylabs Web Scraping tools (crewAIInc#312)

9a21d05

* Add Oxylabs tools * Review updates * Add package_dependencies attribute

feat: support api_key fallback to EXA_API_KEY env-var (crewAIInc#341)

1773033

Support to filter available MCP Tools (crewAIInc#345)

923c7b0

* feat: support to complex filter on ToolCollection * refactor: use proper tool collection methot to filter tool in CrewAiEnterpriseTools * feat: allow to filter available MCP tools

new version of tools (crewAIInc#347)

60d4e49

build: fix command to generate tools specs on CI (crewAIInc#350)

e1e3299

Mapping required env vars of more tools (crewAIInc#353)

c8e0d73

update tool.spec.json (crewAIInc#354)

a491d61

fix: update Pydantic schema access (crewAIInc#337)

94c173d

Signed-off-by: Emmanuel Ferdman <[email protected]>

chore: update crewai dependency version to 0.134.0 in pyproject.toml …

1c0ed0e

…and uv.lock (crewAIInc#356)

chore: bump version to 0.49.0 in pyproject.toml and uv.lock (crewAIIn…

a3a5bdc

…c#357)

Merge branch 'main' of github.com:crewAIInc/crewAI-tools into INTPYTH…

7fce5f5

…ON-580

address review

5e3b4b1

update tests

8e47c09

Merge branch 'main' of github.com:crewAIInc/crewAI-tools into INTPYTH…

0a11aef

…ON-580

debug

37d344b

fix test

1b73bb5

fix test

12d0d50

fix test

94bd04c

support azure openai

86f6eb5

blink1073 closed this Jul 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

INTPYTHON-580 Design and Implement MongoDBVectorSearchTool #1

INTPYTHON-580 Design and Implement MongoDBVectorSearchTool #1

Uh oh!

blink1073 commented May 21, 2025 •

edited

Loading

Uh oh!

caseyclements left a comment

Uh oh!

caseyclements Jun 3, 2025

Uh oh!

blink1073 Jun 3, 2025

Uh oh!

caseyclements Jun 3, 2025

Uh oh!

blink1073 Jun 3, 2025

Uh oh!

caseyclements Jun 3, 2025

Uh oh!

caseyclements Jun 3, 2025

Uh oh!

blink1073 Jun 3, 2025

Uh oh!

Uh oh!

INTPYTHON-580 Design and Implement MongoDBVectorSearchTool #1

INTPYTHON-580 Design and Implement MongoDBVectorSearchTool #1

Uh oh!

Conversation

blink1073 commented May 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

caseyclements left a comment

Choose a reason for hiding this comment

Uh oh!

caseyclements Jun 3, 2025

Choose a reason for hiding this comment

Uh oh!

blink1073 Jun 3, 2025

Choose a reason for hiding this comment

Uh oh!

caseyclements Jun 3, 2025

Choose a reason for hiding this comment

Uh oh!

blink1073 Jun 3, 2025

Choose a reason for hiding this comment

Uh oh!

caseyclements Jun 3, 2025

Choose a reason for hiding this comment

Uh oh!

caseyclements Jun 3, 2025

Choose a reason for hiding this comment

Uh oh!

blink1073 Jun 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

blink1073 commented May 21, 2025 •

edited

Loading