Context Retrieval Persona #13

jonahjung22 · 2025-07-17T17:12:22Z

Problem

Professionals working in Jupyter Notebooks often need to find relevant examples, documentation, and best practices while coding, but existing solutions require manually searching through documentation or switching between multiple tools. Users need a context-aware assistant that can analyze their current notebook work and understand their technical context, search through comprehensive data science resources using semantic search, and generate structured reports with relevant code examples and next steps.

Solution

This is a persona that combines multi-agent intelligence with RAG to provide intelligent context-aware assistance. The solution uses a 3-agent team architecture: NotebookAnalyzer extracts libraries and analysis stage from notebooks, KnowledgeSearcher performs semantic search through the Python Data Science Handbook using RAG, and MarkdownGenerator creates comprehensive actionable reports.

Changes

Code: Implemented RAG and file reading tools as well as the main persona
Tests: Included a test_rag_integration.py file that helps users to set up the RAG system. You may experiment with the persona using the text_context_retrieval.ipynb notebook.
Docs: Follow the README.md for explicit instructions and information.

Testing Instructions

Test Notebook: Open test_context_retrieval.ipynb and follow the test cases. RAG Integration: Run test_rag_integration() function to verify RAG system which auto-clones handbook and builds vector store. Jupyter-AI Chat: Test with @ContextRetriever help me with pandas operations or @ContextRetriever notebook: /path/to/test_context_retrieval.ipynb.

Future Work

I may potentially explore opportunities to implement Pocketflow into this persona to implement a more simple and efficient core graph abstraction.

…ilities

srdas

Tested the PR and the code works well.

With a notebook in context, the persona collects the relevant context from the Python Data Science Handbook - the context retrieved appears to be relevant, however need to check if the persona misses anything that is relevant.
Markdown file created explaining what is added to the RAG db extracted from the Python Data Science Handbook repo.
RAG db created as well.

Items to check:

If additional notebooks are used for context retrieval, does it overwrite or add to the existing vector store?
The markdown file is overwritten at the moment but we may want to retain it. Better to create a new markdown file for each notebook that is processed with the title of the notebook included.

Will review the code and leave comments as well.

srdas

Needs some revisions.

jupyter_ai_personas/context_retrieval_persona/README.md

srdas · 2025-08-08T17:39:33Z

jupyter_ai_personas/context_retrieval_persona/README.md

+Modify parameters in `rag_core.py`:
+```python
+rag = PythonDSHandbookRAG(
+    embedding_model="sentence-transformers/all-MiniLM-L6-v2",


Can this be updated to take in the chosen embedding model from Jupyter-AI. The embedding model would then need to be called using the functions in Jupyter AI.

srdas · 2025-08-08T18:23:11Z

jupyter_ai_personas/context_retrieval_persona/file_reader_tool.py

+            if not os.path.exists(notebook_path):
+                return f"Error: Notebook file not found at {notebook_path}"
+
+            if not notebook_path.endswith('.ipynb'):
+                return f"Error: File must be a .ipynb notebook file, got {notebook_path}"
+


This should be sent to the chat panel not just printed in the logs.

When I tried this with a .py file instead of a notebook .ipynb file, it still processed the context retrieval? Not sure why.

When I gave it a non-existent file, it still processed the RAG, pulling up various pandas notebook from the PDSH.

jupyter_ai_personas/context_retrieval_persona/persona.py

srdas · 2025-08-08T19:08:55Z

jupyter_ai_personas/context_retrieval_persona/rag_core.py

+        repo_url: str = "https://github.com/jakevdp/PythonDataScienceHandbook.git",
+        local_repo_path: str = None,
+        vector_store_path: str = None,
+        embedding_model: str = "sentence-transformers/all-MiniLM-L6-v2",


Do we have to use this model? Can we take the chosen embedding model from Jupyter AI's config file?

jupyter_ai_personas/context_retrieval_persona/rag_core.py

jupyter_ai_personas/context_retrieval_persona/rag_integration_tool.py

jonahjung22 added 22 commits July 10, 2025 08:39

New persona integrating jupyter ai tools

57ed47a

Context Retrieval Persona

5caff8f

test context file

461f281

updated toml and wrapper tool

74ed255

Increased RAG chunks, specific md file naming, logging

c2d52ed

Removed unnecessary files

c89afb3

updated the names of the files; updated README with persona new capab…

cc2b99d

…ilities

modified toml

2a24e18

building out the context persona using pocketflow

20262e4

new method for rag based approach using pocketflow

772d548

Merge branch 'main' into data_science_persona

55157b7

added test notebook

95b68f8

added greetings

d910e61

Separating 1 persona for each PR

82834fc

cleaned up some code

e5e188b

updated README

ad6a220

added test files

a882287

removing some lines

1ebe1f2

updated persona code and removed unnecessary components

7488430

remove unnecessary comments

af883f9

updated dependencies

cb68c1f

deleted unnecessary folder

46d26cd

srdas reviewed Aug 8, 2025

View reviewed changes

srdas requested changes Aug 8, 2025

View reviewed changes

jonahjung22 added 2 commits August 11, 2025 14:54

Changes to the whole RAG structure implemented

dd81447

removed unnecessary file

b7f6eac

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Context Retrieval Persona #13

Context Retrieval Persona #13

Uh oh!

jonahjung22 commented Jul 17, 2025

Uh oh!

srdas left a comment

Uh oh!

srdas left a comment

Uh oh!

Uh oh!

srdas Aug 8, 2025

Uh oh!

srdas Aug 8, 2025

Uh oh!

Uh oh!

srdas Aug 8, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Context Retrieval Persona #13

Are you sure you want to change the base?

Context Retrieval Persona #13

Uh oh!

Conversation

jonahjung22 commented Jul 17, 2025

Problem

Solution

Changes

Testing Instructions

Future Work

Uh oh!

srdas left a comment

Choose a reason for hiding this comment

Uh oh!

srdas left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

srdas Aug 8, 2025

Choose a reason for hiding this comment

Uh oh!

srdas Aug 8, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

srdas Aug 8, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!