Skip to content

Commit 32d50b8

Browse files
committed
first bit of workshop 2 - Add GraphRAG workshop notes and images from AI Engineer Summit
1 parent 8d938b5 commit 32d50b8

File tree

6 files changed

+121
-17
lines changed

6 files changed

+121
-17
lines changed

assets/image_1740246298715_0.png

1.86 MB
Loading

assets/image_1740246700998_0.png

540 KB
Loading

assets/image_1740246723611_0.png

600 KB
Loading

assets/image_1740246865344_0.png

1.71 MB
Loading

journals/2025_02_22.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,8 @@
22
- [[AI/ES/25/ws/1/Building Agents with Model Context Protocol]]
33
- [[lastmileAI]]
44
- [[MCP/mcp-agent]]
5-
-
5+
- [[AI/ES/25/ws/2/GraphRAG - Knowledge Graphs for Agents]]
6+
- [[graphrag.com]] https://graphrag.com/
67
- #Filed
78
- [[CLI/Tool/ffmpeg]]
89
- [[CLI/Tool/yt-dlp]]
Lines changed: 119 additions & 16 deletions
Original file line numberDiff line numberDiff line change
@@ -1,16 +1,119 @@
1-
# [Smarter AI with GraphRAG – Connecting Structured & Unstructured Data for Better Retrieval](https://www.ai.engineer/summit/2025/schedule/smarter-ai-with-graphrag-connecting-structured-and-unstructured-data-for-better-retrieval)
2-
- **Date**: February 22, 2025
3-
- **Time**: 12:30 PM
4-
- Generative AI is only as good as the data it retrieves—but traditional RAG (Retrieval-Augmented Generation) is limited by flat, disconnected search. AI models struggle to connect structured data (databases, APIs) with unstructured data (docs, PDFs, chat logs), leading to incomplete answers and hallucinations.
5-
- Enter GraphRAG—a next-generation retrieval approach powered by Neo4j. This workshop will show you how graphs unlock deeper context, better reasoning, and enterprise-ready AI applications. No graph experience needed!
6-
- You’ll learn how to:
7-
- Integrate structured and unstructured data into a unified retrieval system.
8-
Use GraphRAG to improve accuracy, explainability, and trust in AI-generated answers.
9-
Leverage graph-native retrieval, embeddings, and hybrid search for better responses.
10-
Apply GraphRAG to real-world use cases, from enterprise AI copilots to knowledge-driven automation.
11-
Whether you’re building AI-powered search, chatbots, or intelligent assistants—GraphRAG makes AI retrieval smarter. Join us to learn how Neo4j can take your Generative AI applications to the next level.
12-
- [[Person/Alison Cossette]]
13-
- Alison Cossette is a dynamic Data Science Strategist, Educator, and Podcast Host. As a Developer Advocate at Neo4j specializing in Graph Data Science, she brings a wealth of expertise to the field. With her strong technical background and exceptional communication skills, Alison bridges the gap between complex data science concepts and practical applications. Alison’s passion for responsible AI shines through in her work. She actively promotes ethical and transparent AI practices and believes in the transformative potential of responsible AI for industries and society. Through her engagements with industry professionals, policymakers, and the public, she advocates for the responsible development and deployment of AI technologies. She is currently a Volunteer Member of the US Department of Commerce - National Institute of Standards and Technology’s Generative AI Public Working Group Alison’s academic journey includes Masters of Science in Data Science studies, specializing in Artificial Intelligence, at Northwestern University and research with Stanford University Human-Computer Interaction Crowd Research Collective. Alison combines academic knowledge with real-world experience. She leverages this expertise to educate and empower individuals and organizations in the field of data science. Overall, Alison Cossette’s multifaceted background, commitment to responsible AI, and expertise in data science make her a respected figure in the field. Through her role as a Developer Advocate at Neo4j and her podcast, she continues to drive innovation, education, and responsible practices in the exciting realm of data science and AI.
14-
- Alison Cossette, Data Science Strategist, Advocate, Educator
15-
LinkedIn
16-
Twitter
1+
tags:: [[Neo4j]]
2+
created-by:: [[Person/Alison Cossette]]
3+
4+
- # [Smarter AI with GraphRAG – Connecting Structured & Unstructured Data for Better Retrieval](https://www.ai.engineer/summit/2025/schedule/smarter-ai-with-graphrag-connecting-structured-and-unstructured-data-for-better-retrieval)
5+
- Summary
6+
collapsed:: true
7+
- **Date**: February 22, 2025
8+
- **Time**: 12:30 PM
9+
- Generative AI is only as good as the data it retrieves—but traditional RAG (Retrieval-Augmented Generation) is limited by flat, disconnected search. AI models struggle to connect structured data (databases, APIs) with unstructured data (docs, PDFs, chat logs), leading to incomplete answers and hallucinations.
10+
- Enter GraphRAG—a next-generation retrieval approach powered by Neo4j. This workshop will show you how graphs unlock deeper context, better reasoning, and enterprise-ready AI applications. No graph experience needed!
11+
- You’ll learn how to:
12+
- Integrate structured and unstructured data into a unified retrieval system.
13+
Use GraphRAG to improve accuracy, explainability, and trust in AI-generated answers.
14+
Leverage graph-native retrieval, embeddings, and hybrid search for better responses.
15+
Apply GraphRAG to real-world use cases, from enterprise AI copilots to knowledge-driven automation.
16+
Whether you’re building AI-powered search, chatbots, or intelligent assistants—GraphRAG makes AI retrieval smarter. Join us to learn how Neo4j can take your Generative AI applications to the next level.
17+
- [[Person/Alison Cossette]]
18+
- Alison Cossette is a dynamic Data Science Strategist, Educator, and Podcast Host. As a Developer Advocate at Neo4j specializing in Graph Data Science, she brings a wealth of expertise to the field. With her strong technical background and exceptional communication skills, Alison bridges the gap between complex data science concepts and practical applications. Alison’s passion for responsible AI shines through in her work. She actively promotes ethical and transparent AI practices and believes in the transformative potential of responsible AI for industries and society. Through her engagements with industry professionals, policymakers, and the public, she advocates for the responsible development and deployment of AI technologies. She is currently a Volunteer Member of the US Department of Commerce - National Institute of Standards and Technology’s Generative AI Public Working Group Alison’s academic journey includes Masters of Science in Data Science studies, specializing in Artificial Intelligence, at Northwestern University and research with Stanford University Human-Computer Interaction Crowd Research Collective. Alison combines academic knowledge with real-world experience. She leverages this expertise to educate and empower individuals and organizations in the field of data science. Overall, Alison Cossette’s multifaceted background, commitment to responsible AI, and expertise in data science make her a respected figure in the field. Through her role as a Developer Advocate at Neo4j and her podcast, she continues to drive innovation, education, and responsible practices in the exciting realm of data science and AI.
19+
- Alison Cossette, Data Science Strategist, Advocate, Educator
20+
LinkedIn
21+
Twitter
22+
- form: https://docs.google.com/forms/d/e/1FAIpQLScKDk4bwnHLBKhTZXJ4XP-YBkGLgUh3B1C7-mc38jC7r0BAUQ/viewform
23+
- ## Intro
24+
- problems with #RAG
25+
- how do I handle
26+
- relative information
27+
- temporal information
28+
- you can ground chunks in relevant context
29+
- try to find ways that you can connect the types of storage you have
30+
- vector dbs
31+
- unstructured data, structured data
32+
- the info you need to get back is rarely just the chunk
33+
- we want chains of thought
34+
- we want to give insight into how we cnn connect the pieces of data into one thing
35+
- we are considering #MCP to feed *into* a knowledge graph
36+
- ## Graph db theory - Neo4j Graph Components
37+
- ![image.png](../assets/image_1740246298715_0.png)
38+
- This slide explains **Neo4j Graph Components**, breaking them down into:
39+
- **Nodes**: Represent entities in the graph (e.g., a person or a car).
40+
- **Relationships**: Represent associations or interactions between nodes (e.g., "KNOWS," "LIVES WITH," "OWNS").
41+
- **Properties**: Attributes of nodes or relationships, including metadata like names, birthdates, or embeddings.
42+
- ### Example from the slide:
43+
- **Person ("Andre")** knows **Person ("Mica")**.
44+
- **Andre lives with Mica** and has a Twitter handle `@dan`.
45+
- **Andre drives a Volvo V70**, which has properties like brand, model, and description embedding.
46+
- This structure highlights how **Neo4j stores interconnected data efficiently**—a key advantage of graph databases over relational models.
47+
- terms
48+
- nodes
49+
- the nouns
50+
- relationships
51+
- interactions
52+
- properties
53+
- of nodes or relationships
54+
- *the vector is a FEATURE of the relationship*
55+
- in the image above, notice the `DescEmbedding`
56+
- ### [[graphrag.com]] is their recommendation for learning
57+
- ### Terms
58+
- #### Domain Graph
59+
- it could be that domain structured knowledge is extracted from unstructured
60+
- #### Lexical Graph
61+
- unstructured data
62+
- ![image.png](../assets/image_1740246700998_0.png)
63+
- This slide illustrates how a **Knowledge Graph** can be structured, separating it into two main components:
64+
- ### **1. Domain Graph**
65+
- Represents **topics** in a structured way.
66+
- Shows how **topics are extracted** from documents.
67+
- ### **2. Lexical Graph**
68+
- Represents **documents** and their granular components.
69+
- Documents are broken into **chunks** (e.g., paragraphs, sections).
70+
- A relationship (`HAS_CHUNK`) connects a document to its respective chunks.
71+
- ### **Key Takeaways**
72+
- **Knowledge Graphs** integrate domain-specific relationships (topics) with textual content (documents and chunks).
73+
- This structure enables **contextual linking** between extracted knowledge and text sources, useful for **Retrieval-Augmented Generation (RAG)**.
74+
- ### knowledge graph -> memory graph
75+
- ![image.png](../assets/image_1740246723611_0.png)
76+
- ### knowledge graph with domain, lexical and memory graph
77+
- ![image.png](../assets/image_1740246865344_0.png)
78+
- cgpt-notes
79+
- This slide presents a **comprehensive Knowledge Graph structure**, integrating **Domain Graph, Lexical Graph, and Memory Graph**.
80+
- ### **1. Domain Graph (Structured Knowledge)**
81+
- Represents **entities and relationships**.
82+
- Example:
83+
- **Entity Type A** → relates to → **Entity Type B**.
84+
- **Entity Type B** → PRODUCES → **Document**.
85+
- **Entity Type C** → HAS_ENTITY → **Document**.
86+
- ### **2. Lexical Graph (Unstructured Knowledge)**
87+
- Links **documents** to their **chunks** (smaller text segments).
88+
- Example:
89+
- **Document** → HAS_CHUNK → **Chunk**.
90+
- ### **3. Memory Graph (Application-Level Interactions)**
91+
- Represents user interactions within a system.
92+
- Example:
93+
- **User** → OPENS → **Session**.
94+
- **Session** → CONTAIN → **Prompt**.
95+
- **Prompt** → NEXT → **Response**.
96+
- **Prompt** → RETRIEVES → **Chunk**.
97+
- **Response** → INCLUDES → **Chunk**.
98+
- ### **Key Takeaways**
99+
- This model **bridges structured and unstructured data**, making it ideal for **LLM-powered Retrieval-Augmented Generation (RAG)**.
100+
- **Memory Graph** ensures **context retention** by linking user prompts to relevant knowledge chunks.
101+
- **Lexical and Domain Graphs** provide structured and unstructured content retrieval for AI applications.
102+
103+
This setup is well-suited for **AI-driven search, chatbots, or recommendation systems**.
104+
- #### example
105+
- which companies are susceptible to a lithium shortage
106+
- which *asset managers* are vulnerable to a lithium shortage
107+
- who are the managers that own those companies
108+
- getting the answer you need will cross the traversal from the lexical into the domain
109+
- *this is how humans answer questions*
110+
- what we haven't had an ability to do so far, and what we are challenged by is how do we provide that meta moment where we get the EXACT retrievals that we need
111+
- #### example
112+
- data science in rag apps
113+
- what are the subjects that people are communicating on
114+
- what are the areas of knowledge that they are interacting on
115+
- they came in asking about this and they left asking about that
116+
- having this understanding ... as a builder itmmight not be important to you, but to product owner, highly relevant
117+
-
118+
-
119+
-

0 commit comments

Comments
 (0)