Replies: 2 comments 1 reply
-
When no using websearch to get the docs on the fly. Otherwise if you want to build a RAG, what is the blocker? MCP is only a wrapper over the tools you have. Check the SDK And examples: https://github.com/modelcontextprotocol/python-sdk https://github.com/modelcontextprotocol/python-sdk/tree/main/examples/servers/simple-tool I thinkn the examples are already simple. |
Beta Was this translation helpful? Give feedback.
-
I think it's an important to distinguish between (classic) RAG and tool-based (sometimes called agentic) RAG. The former doesn't use tools, but is based on the application post-processing user messages and injecting context (Augmenting them as the per the A) with data retrieved (as per the R) from somewhere (could be a vector indexed db or some other source). This happens at the application-level, not the tool level. MCP sort of support this via Resource Templates, but it's very awkward. I have an open proposal PR to add a capability specifically for this kind of application layer injection. Tool RAG is a very different kind of beast, as it relies on the model deciding when to retrieve additional context and is usually not based on embeddings and similarity search. It also adds latency and higher token expenditure compared to classic RAG. It is still the superior choice for some use cases. I believe some of the agentic SE applications use agentic search / RAG as a core feature. But classic RAG doesn't really have a good fit in the current MCP capability landscape. You can retrofit it pretty easily by adding your own methods to the server - but that doesn't provide hosts like Cline with any way to use them. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Pre-submission Checklist
Question Category
Your Question
I am using Cline (https://github.com/cline/cline) with Claude 3.5 Sonnet to support my projects realized with fast-changing libraries like LangGraph.
Unfortunately, Claude is not trained on its current API/docs. I would like to configure Cline in a way that when asked to create/correct my codebase, it would consult actual docs.
Do you have an idea how to achieve this?
I was thinking about a custom MDP server with a RAG system based on these docs. I can see two options:
For the first case, a very simple reference RAG can be found e.g. here:
https://github.com/langchain-ai/intro-to-langsmith/tree/main/notebooks/module_0
I would appreciate Your help both in the high-level approach, and implementation.
Beta Was this translation helpful? Give feedback.
All reactions