Replies: 2 comments
-
|
PR proposed here: #329 |
Beta Was this translation helpful? Give feedback.
0 replies
-
|
This has been implemented in the PR mentioned |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Current architecture
Currently we have two distinct approaches to RAG based on the type of data.
For unstructured data (usually
text,htmletc.), we generate embeddings and store in a Vector DB (Milvus). When querying we convert the query to embeddings, and do a similarity search in the vector space to get the most relavant chunks.For structured data (usually
JSON,YAMLetc.) we store them in a Labeled property graph (Neo4J), we then run analyse the data using LLM to generate the relations. When querying, an agent has access to the database to do queries and traverse the graph.Problems
Both approaches has worked decently in isolation and for the data they support and the use-cases targeted by AI platform engineer.
Proposed architecture
In this discussion, we propose to create a Unified system to do RAG thats specialised for Platform engineering.
The system will have the following for ingestion/indexing:
The system will have the following for querying:
The following diagram shows how this can be done:
Beta Was this translation helpful? Give feedback.
All reactions