|
1 | | -# [Smarter AI with GraphRAG – Connecting Structured & Unstructured Data for Better Retrieval](https://www.ai.engineer/summit/2025/schedule/smarter-ai-with-graphrag-connecting-structured-and-unstructured-data-for-better-retrieval) |
2 | | -- **Date**: February 22, 2025 |
3 | | -- **Time**: 12:30 PM |
4 | | -- Generative AI is only as good as the data it retrieves—but traditional RAG (Retrieval-Augmented Generation) is limited by flat, disconnected search. AI models struggle to connect structured data (databases, APIs) with unstructured data (docs, PDFs, chat logs), leading to incomplete answers and hallucinations. |
5 | | -- Enter GraphRAG—a next-generation retrieval approach powered by Neo4j. This workshop will show you how graphs unlock deeper context, better reasoning, and enterprise-ready AI applications. No graph experience needed! |
6 | | -- You’ll learn how to: |
7 | | -- Integrate structured and unstructured data into a unified retrieval system. |
8 | | - Use GraphRAG to improve accuracy, explainability, and trust in AI-generated answers. |
9 | | - Leverage graph-native retrieval, embeddings, and hybrid search for better responses. |
10 | | - Apply GraphRAG to real-world use cases, from enterprise AI copilots to knowledge-driven automation. |
11 | | - Whether you’re building AI-powered search, chatbots, or intelligent assistants—GraphRAG makes AI retrieval smarter. Join us to learn how Neo4j can take your Generative AI applications to the next level. |
12 | | -- [[Person/Alison Cossette]] |
13 | | - - Alison Cossette is a dynamic Data Science Strategist, Educator, and Podcast Host. As a Developer Advocate at Neo4j specializing in Graph Data Science, she brings a wealth of expertise to the field. With her strong technical background and exceptional communication skills, Alison bridges the gap between complex data science concepts and practical applications. Alison’s passion for responsible AI shines through in her work. She actively promotes ethical and transparent AI practices and believes in the transformative potential of responsible AI for industries and society. Through her engagements with industry professionals, policymakers, and the public, she advocates for the responsible development and deployment of AI technologies. She is currently a Volunteer Member of the US Department of Commerce - National Institute of Standards and Technology’s Generative AI Public Working Group Alison’s academic journey includes Masters of Science in Data Science studies, specializing in Artificial Intelligence, at Northwestern University and research with Stanford University Human-Computer Interaction Crowd Research Collective. Alison combines academic knowledge with real-world experience. She leverages this expertise to educate and empower individuals and organizations in the field of data science. Overall, Alison Cossette’s multifaceted background, commitment to responsible AI, and expertise in data science make her a respected figure in the field. Through her role as a Developer Advocate at Neo4j and her podcast, she continues to drive innovation, education, and responsible practices in the exciting realm of data science and AI. |
14 | | -- Alison Cossette, Data Science Strategist, Advocate, Educator |
15 | | - LinkedIn |
16 | | - Twitter |
| 1 | +tags:: [[Neo4j]] |
| 2 | +created-by:: [[Person/Alison Cossette]] |
| 3 | + |
| 4 | +- # [Smarter AI with GraphRAG – Connecting Structured & Unstructured Data for Better Retrieval](https://www.ai.engineer/summit/2025/schedule/smarter-ai-with-graphrag-connecting-structured-and-unstructured-data-for-better-retrieval) |
| 5 | + - Summary |
| 6 | + collapsed:: true |
| 7 | + - **Date**: February 22, 2025 |
| 8 | + - **Time**: 12:30 PM |
| 9 | + - Generative AI is only as good as the data it retrieves—but traditional RAG (Retrieval-Augmented Generation) is limited by flat, disconnected search. AI models struggle to connect structured data (databases, APIs) with unstructured data (docs, PDFs, chat logs), leading to incomplete answers and hallucinations. |
| 10 | + - Enter GraphRAG—a next-generation retrieval approach powered by Neo4j. This workshop will show you how graphs unlock deeper context, better reasoning, and enterprise-ready AI applications. No graph experience needed! |
| 11 | + - You’ll learn how to: |
| 12 | + - Integrate structured and unstructured data into a unified retrieval system. |
| 13 | + Use GraphRAG to improve accuracy, explainability, and trust in AI-generated answers. |
| 14 | + Leverage graph-native retrieval, embeddings, and hybrid search for better responses. |
| 15 | + Apply GraphRAG to real-world use cases, from enterprise AI copilots to knowledge-driven automation. |
| 16 | + Whether you’re building AI-powered search, chatbots, or intelligent assistants—GraphRAG makes AI retrieval smarter. Join us to learn how Neo4j can take your Generative AI applications to the next level. |
| 17 | + - [[Person/Alison Cossette]] |
| 18 | + - Alison Cossette is a dynamic Data Science Strategist, Educator, and Podcast Host. As a Developer Advocate at Neo4j specializing in Graph Data Science, she brings a wealth of expertise to the field. With her strong technical background and exceptional communication skills, Alison bridges the gap between complex data science concepts and practical applications. Alison’s passion for responsible AI shines through in her work. She actively promotes ethical and transparent AI practices and believes in the transformative potential of responsible AI for industries and society. Through her engagements with industry professionals, policymakers, and the public, she advocates for the responsible development and deployment of AI technologies. She is currently a Volunteer Member of the US Department of Commerce - National Institute of Standards and Technology’s Generative AI Public Working Group Alison’s academic journey includes Masters of Science in Data Science studies, specializing in Artificial Intelligence, at Northwestern University and research with Stanford University Human-Computer Interaction Crowd Research Collective. Alison combines academic knowledge with real-world experience. She leverages this expertise to educate and empower individuals and organizations in the field of data science. Overall, Alison Cossette’s multifaceted background, commitment to responsible AI, and expertise in data science make her a respected figure in the field. Through her role as a Developer Advocate at Neo4j and her podcast, she continues to drive innovation, education, and responsible practices in the exciting realm of data science and AI. |
| 19 | + - Alison Cossette, Data Science Strategist, Advocate, Educator |
| 20 | + LinkedIn |
| 21 | + Twitter |
| 22 | + - form: https://docs.google.com/forms/d/e/1FAIpQLScKDk4bwnHLBKhTZXJ4XP-YBkGLgUh3B1C7-mc38jC7r0BAUQ/viewform |
| 23 | + - ## Intro |
| 24 | + - problems with #RAG |
| 25 | + - how do I handle |
| 26 | + - relative information |
| 27 | + - temporal information |
| 28 | + - you can ground chunks in relevant context |
| 29 | + - try to find ways that you can connect the types of storage you have |
| 30 | + - vector dbs |
| 31 | + - unstructured data, structured data |
| 32 | + - the info you need to get back is rarely just the chunk |
| 33 | + - we want chains of thought |
| 34 | + - we want to give insight into how we cnn connect the pieces of data into one thing |
| 35 | + - we are considering #MCP to feed *into* a knowledge graph |
| 36 | + - ## Graph db theory - Neo4j Graph Components |
| 37 | + -  |
| 38 | + - This slide explains **Neo4j Graph Components**, breaking them down into: |
| 39 | + - **Nodes**: Represent entities in the graph (e.g., a person or a car). |
| 40 | + - **Relationships**: Represent associations or interactions between nodes (e.g., "KNOWS," "LIVES WITH," "OWNS"). |
| 41 | + - **Properties**: Attributes of nodes or relationships, including metadata like names, birthdates, or embeddings. |
| 42 | + - ### Example from the slide: |
| 43 | + - **Person ("Andre")** knows **Person ("Mica")**. |
| 44 | + - **Andre lives with Mica** and has a Twitter handle `@dan`. |
| 45 | + - **Andre drives a Volvo V70**, which has properties like brand, model, and description embedding. |
| 46 | + - This structure highlights how **Neo4j stores interconnected data efficiently**—a key advantage of graph databases over relational models. |
| 47 | + - terms |
| 48 | + - nodes |
| 49 | + - the nouns |
| 50 | + - relationships |
| 51 | + - interactions |
| 52 | + - properties |
| 53 | + - of nodes or relationships |
| 54 | + - *the vector is a FEATURE of the relationship* |
| 55 | + - in the image above, notice the `DescEmbedding` |
| 56 | + - ### [[graphrag.com]] is their recommendation for learning |
| 57 | + - ### Terms |
| 58 | + - #### Domain Graph |
| 59 | + - it could be that domain structured knowledge is extracted from unstructured |
| 60 | + - #### Lexical Graph |
| 61 | + - unstructured data |
| 62 | + -  |
| 63 | + - This slide illustrates how a **Knowledge Graph** can be structured, separating it into two main components: |
| 64 | + - ### **1. Domain Graph** |
| 65 | + - Represents **topics** in a structured way. |
| 66 | + - Shows how **topics are extracted** from documents. |
| 67 | + - ### **2. Lexical Graph** |
| 68 | + - Represents **documents** and their granular components. |
| 69 | + - Documents are broken into **chunks** (e.g., paragraphs, sections). |
| 70 | + - A relationship (`HAS_CHUNK`) connects a document to its respective chunks. |
| 71 | + - ### **Key Takeaways** |
| 72 | + - **Knowledge Graphs** integrate domain-specific relationships (topics) with textual content (documents and chunks). |
| 73 | + - This structure enables **contextual linking** between extracted knowledge and text sources, useful for **Retrieval-Augmented Generation (RAG)**. |
| 74 | + - ### knowledge graph -> memory graph |
| 75 | + -  |
| 76 | + - ### knowledge graph with domain, lexical and memory graph |
| 77 | + -  |
| 78 | + - cgpt-notes |
| 79 | + - This slide presents a **comprehensive Knowledge Graph structure**, integrating **Domain Graph, Lexical Graph, and Memory Graph**. |
| 80 | + - ### **1. Domain Graph (Structured Knowledge)** |
| 81 | + - Represents **entities and relationships**. |
| 82 | + - Example: |
| 83 | + - **Entity Type A** → relates to → **Entity Type B**. |
| 84 | + - **Entity Type B** → PRODUCES → **Document**. |
| 85 | + - **Entity Type C** → HAS_ENTITY → **Document**. |
| 86 | + - ### **2. Lexical Graph (Unstructured Knowledge)** |
| 87 | + - Links **documents** to their **chunks** (smaller text segments). |
| 88 | + - Example: |
| 89 | + - **Document** → HAS_CHUNK → **Chunk**. |
| 90 | + - ### **3. Memory Graph (Application-Level Interactions)** |
| 91 | + - Represents user interactions within a system. |
| 92 | + - Example: |
| 93 | + - **User** → OPENS → **Session**. |
| 94 | + - **Session** → CONTAIN → **Prompt**. |
| 95 | + - **Prompt** → NEXT → **Response**. |
| 96 | + - **Prompt** → RETRIEVES → **Chunk**. |
| 97 | + - **Response** → INCLUDES → **Chunk**. |
| 98 | + - ### **Key Takeaways** |
| 99 | + - This model **bridges structured and unstructured data**, making it ideal for **LLM-powered Retrieval-Augmented Generation (RAG)**. |
| 100 | + - **Memory Graph** ensures **context retention** by linking user prompts to relevant knowledge chunks. |
| 101 | + - **Lexical and Domain Graphs** provide structured and unstructured content retrieval for AI applications. |
| 102 | + |
| 103 | + This setup is well-suited for **AI-driven search, chatbots, or recommendation systems**. |
| 104 | + - #### example |
| 105 | + - which companies are susceptible to a lithium shortage |
| 106 | + - which *asset managers* are vulnerable to a lithium shortage |
| 107 | + - who are the managers that own those companies |
| 108 | + - getting the answer you need will cross the traversal from the lexical into the domain |
| 109 | + - *this is how humans answer questions* |
| 110 | + - what we haven't had an ability to do so far, and what we are challenged by is how do we provide that meta moment where we get the EXACT retrievals that we need |
| 111 | + - #### example |
| 112 | + - data science in rag apps |
| 113 | + - what are the subjects that people are communicating on |
| 114 | + - what are the areas of knowledge that they are interacting on |
| 115 | + - they came in asking about this and they left asking about that |
| 116 | + - having this understanding ... as a builder itmmight not be important to you, but to product owner, highly relevant |
| 117 | + - |
| 118 | + - |
| 119 | + - |
0 commit comments