Using as a "traditional" search engine? #370
Closed
BarrySmith
started this conversation in
Ideas
Replies: 2 comments 2 replies
-
|
@BarrySmith -- thanks for the proposal. Can you explain more what you would like to see / have? |
Beta Was this translation helpful? Give feedback.
1 reply
-
|
I see. So, Kreuzberg is focused on text-extraction and whats called "document intelligence". Its not a search engine in this regard. Its adjacent but not what you are looking for. You can integrate Kreuzberg as part of a pipeline for full-text search. There are libraries for this in TS/JS, Python, Rust and many others. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
This is an amazing technical contribution. But it appears to be focused "completely?" on gathering data for use with embeddings/RAG? Am I accurate in that assessment?
After having using both RAG and "classical" website search engines (for example what comes with Sphinx websites) I am convinced that "classical" non-embedding based text search engineers are often much better than RAG both for feeding directly to humans and for feeding into LLM prompts. So my question is have you considered adding classical search technology to the data you gather? What I mean be this is providing multiple language APIs for "searching" the processed data, allowing developers the freedom to then display results they want or to find the results onto further pipelines. (For example, though Sphinx websites are great at search, it is all hardwired directly into the webpage environment and there is no API for calling the search outside of that environment (for example, pure JavaScript or Python). I, for one, would start using such a search API immediately if you did. Or I am missing something you already provide and just don't emphasize in your front facing documentation?
Thanks
Beta Was this translation helpful? Give feedback.
All reactions