Update Blog “part-8-agentic-ai-and-qdrant-building-semantic-memory-with-mcp-protocol”

dineshr1493 · dineshr1493 · commit 9a2dbfd3bc1f · 2025-07-21T11:10:20.000Z
diff --git a/content/blog/part-8-agentic-ai-and-qdrant-building-semantic-memory-with-mcp-protocol.md b/content/blog/part-8-agentic-ai-and-qdrant-building-semantic-memory-with-mcp-protocol.md
@@ -4,16 +4,18 @@ date: 2025-07-21T10:50:25.839Z
 author: Dinesh R Singh
 authorimage: /img/dinesh-192-192.jpg
 disable: false
+tags:
+  - MCP
+  - Agentic AI
+  - Generative AI
 ---
 As Agentic AI systems evolve from reactive language models to structured thinkers, a new challenge emerges — how do we give these agents memory? Not just logs or files, but real, searchable memory that understands context. Enter Qdrant and the Model Context Protocol (MCP) — a modular pairing that brings semantic search and knowledge storage to agent workflows.
 
-Inspired by a Medium post by Dinesh R, this article explores how MCP standardizes interactions between intelligent agents and vector databases like Qdrant. By enabling seamless storage and retrieval of embeddings, agents can now “remember” useful information and leverage it in future reasoning.
+[Inspired by my Medium post](https://dineshr1493.medium.com/all-you-need-to-know-about-the-evolution-of-generative-ai-to-agentic-ai-part-8-agentic-ai-mcp-281567e26838), this article explores how MCP standardizes interactions between intelligent agents and vector databases like Qdrant. By enabling seamless storage and retrieval of embeddings, agents can now “remember” useful information and leverage it in future reasoning.
 
 Let’s walk through the full architecture and code implementation of this cutting-edge pattern.
 
-
-
-## Why This Matters: Agentic AI + MCP
+## Why this matters: Agentic AI + MCP
 
 In Agentic AI, a language model doesn’t just generate — it thinks, acts, and reflects using external tools. That’s where MCP comes in.
 
@@ -23,35 +25,21 @@ Qdrant itself is a high-performance vector database — capable of powering sema
 
 This is solved by wrapping Qdrant inside an MCP server, giving agents a semantic API they can call like a function.
 
-
-
 ### Architecture Overview
 
 ```
-LLM Agent
-
-\|
-
-\|-- \[MCP Client]
-
-\|
-
-\[MCP Protocol]
-
-\|
-
-\|-- \[Qdrant MCP Server]
-
-\|   |-- Tool: qdrant-store
-
-\|   |-- Tool: qdrant-find
-
-\|
-
-\[Qdrant Vector DB]
+[LLM Agent]
+    |
+    |-- [MCP Client]
+[MCP Protocol]
+    |
+    |-- [Qdrant MCP Server]
+    |   |-- Tool: qdrant-store
+    |   |-- Tool: qdrant-find
+    |
+[Qdrant Vector DB]
 ```
 
-
 ### Use Case: Support Ticket Memory for AI Assistants
 
 Imagine an AI assistant answering support queries.
@@ -60,20 +48,16 @@ Imagine an AI assistant answering support queries.
 * But it has semantic memory from prior support logs stored in Qdrant.
 * It uses qdrant-find to semantically retrieve similar issues and then formulates a contextual response.
 
-
-
 ## Step-by-Step Implementation
 
 ### Step 1: Launch Qdrant MCP Server
 
 ```
 export COLLECTION_NAME="support-tickets"
-
 export QDRANT_LOCAL_PATH="./qdrant_local_db"
-
 export EMBEDDING_MODEL="sentence-transformers/all-MiniLM-L6-v2"
 ```
- 
+
 ```
 uvx mcp-server-qdrant --transport sse
 ```
@@ -84,130 +68,88 @@ uvx mcp-server-qdrant --transport sse
 * QDRANT_LOCAL_PATH: Local vector DB storage path
 * EMBEDDING_MODEL: Embedding model for vectorization
 
-
 ### Step 2: Connect the MCP Client
 
 ```
 from mcp import ClientSession, StdioServerParameters
-
 from mcp.client.stdio import stdio_client
-
 async def main():
-
 server_params = StdioServerParameters(
-
      command="uvx",
-
      args=\["mcp-server-qdrant"],
-
      env={
-
          "QDRANT_LOCAL_PATH": "./qdrant_local_db",
-
          "COLLECTION_NAME": "support-tickets",
-
          "EMBEDDING_MODEL": "sentence-transformers/all-MiniLM-L6-v2"
-
      }
-
 )
 ```
+
 ```
 async with stdio_client(server_params) as (read, write):
-
      async with ClientSession(read, write) as session:
-
          await session.initialize()
-
          tools = await session.list_tools()
-
          print(tools)
 ```
+
 ```
 Expected Output: Lists tools like qdrant-store, qdrant-find
 ```
 
 ### Step 3: Ingest a New Memory
-```
 
+```
 ticket_info = "Order #1234 was delayed due to heavy rainfall in transit zone."
-
- 
-
 result = await session.call_tool("qdrant-store", arguments={
-
 "information": ticket_info,
-
 "metadata": {"order_id": 1234}
-
 })
 ```
+
 This stores an embedded version of the text in Qdrant.
 
 ### Step 4: Perform a Semantic Search
-```
 
+```
 query = "Why was order 1234 delayed?"
-
 search_response = await session.call_tool("qdrant-find", arguments={
-
 "query": "order 1234 delay"
-
 })
 ```
-```
-Example Output:
-
-json
 
-CopyEdit
+## Example Output:
 
+```
 [
-
   {
-
 "content": "Order #1234 was delayed due to heavy rainfall in transit zone.",
-
 "metadata": {"order_id": 1234}
-
   }
-
 ]
 ```
 
 ### Step 5: Use with LLM
+
 ```
 import openai
-
 context = "\n".join(\[r["content"] for r in search_response])
-
 prompt = f"""
-
 You are a helpful assistant. Use this context to answer:
-
-\"\"\"
-
+"""
 {context}
-
-\"\"\"
-
+"""
 Question: Why was order #1234 delayed?
-
 """
-
 response = openai.ChatCompletion.create(
-
 model="gpt-3.5-turbo",
-
-messages=\[{"role": "user", "content": prompt}]
-
+messages=[{"role": "user", "content": prompt}]
 )
-
 print(response\["choices"]\[0]\["message"]\["content"])
 ```
-```
 
-Final Answer:\
+```
+Final Answer:
 "Order #1234 was delayed due to heavy rainfall in the transit zone."
 ```
 
@@ -250,8 +192,7 @@ Final Answer:\
   </tbody>
 </table>
 
-
-## Pro Tip: Chain MCP Servers
+## Pro Tip: Chain MCP servers
 
 You can deploy multiple MCP servers for different tools and plug them into agent workflows:
 
@@ -261,8 +202,6 @@ You can deploy multiple MCP servers for different tools and plug them into agent
 
 Then orchestrate it all using Agentic AI Teams to perform high-level, multi-tool reasoning.
 
-
-
 ## Final Thought
 
 By pairing Qdrant with MCP, Agentic AI gains powerful, semantic memory — a critical enabler of contextual understanding and long-term knowledge retention. This pattern abstracts the complexity of vector DBs behind a unified protocol, empowering agents to think, recall, and act without manual data plumbing.