Support for Hybrid Search in Databricks retriever (query_type="hybrid") #23883
reslleygabriel
announced in
Ideas
Replies: 1 comment
-
It seems to be addressed in this PR, and we are currently waiting for the merge. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Checked
Feature request
Include a new argument for the similarity search methods (similarity_score_threshold retrieval) or a new type of search "hybrid_search"
Databricks documentation
Motivation
Hybrid keyword-similarity search combines vector-based embedding search with traditional keyword-based search techniques. This approach matches exact words in the query while also using a vector-based similarity search to capture the semantic relationships and context of the query.
By integrating these two techniques, hybrid keyword-similarity search retrieves documents that contain not only the exact keywords but also those that are conceptually similar, providing more comprehensive and relevant search results. This method is particularly useful in RAG applications where source data has unique keywords such as SKUs or identifiers that are not well suited to pure similarity search.
Proposal (If applicable)
Include query_type="hybrid" as a parameter in the as_retriever method (as search_type and search_kwargs)
Beta Was this translation helpful? Give feedback.
All reactions