Replies: 1 comment 2 replies
-
Semantra looks promising: https://github.com/freedmand/semantra
|
Beta Was this translation helpful? Give feedback.
-
Semantra looks promising: https://github.com/freedmand/semantra
|
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
My GPU can't run this because it doesn't have enough memory for the LLM. I can't run it in CPU-only mode either, because of a bug: #156 Even when that's fixed, I imagine it will be pretty slow.
Edit: It still doesn't work with that fixed; I just get
TimeoutError
while trying to download the models orDefaultCPUAllocator: not enough memory
while trying to run them.Are there any similar tools that I might be able to run?
I think I would be fine with something that uses the GPT API for the actual thinking part, and just does the document ingestion and embeddings locally.
Or even just something that doesn't think at all, and just does an embeddings-based search across many local documents. If I understand correctly, embeddings can enable searching for similar concepts, or it can be used to do Q&A type queries of the documents. Whichever it does, I would want it to return exact quote snippets for each search result from the original document anyway, both so I can read it in the original context and so I can know it's not hallucinating or misunderstanding.
Oh, it should be omnilingual, too.
Basically I want to be able to search by concept instead of by keyword:
I think I can roll my own with LangChain or SentenceTransformers, but I haven't learned them yet and there are so many tools in this space, something nicer probably already exists?
Beta Was this translation helpful? Give feedback.
All reactions