Agentic RAG Assistant

An AI-powered document Q&A system that lets you upload PDF documents and ask questions about them. Built with LlamaIndex, Groq, and Gradio.

Features

Multi-Document Support: Upload up to 5 PDF documents
Agentic RAG: Uses FunctionAgent with dynamic tool selection
Per-Document Tools: Each document gets vector search + summary tools
Persistent Storage: Indices are cached to disk for fast startup
Free & Fast: Uses Groq (Llama 4 Scout 17B) + local HuggingFace embeddings

Architecture

User Query
    │
    ▼
┌─────────────────────┐
│   Tool Retriever    │  ← Selects top-k relevant tools
│   (ObjectIndex)     │
└─────────────────────┘
    │
    ▼
┌─────────────────────┐
│   FunctionAgent     │  ← Executes selected tools
│   (Groq LLM)        │
└─────────────────────┘
    │
    ▼
┌─────────────────────┐
│  Document Tools     │
│  - vector_doc1      │  ← Semantic search
│  - summary_doc1     │  ← Summarization
│  - vector_doc2      │
│  - ...              │
└─────────────────────┘
    │
    ▼
   Response

Tools

Vector Search Tool: Retrieves specific information from documents using semantic similarity
Summary Tool: Generates summaries of entire documents using tree summarization

Models Used

LLM: Groq Llama 4 Scout 17B (free, fast inference, higher rate limits)
Embeddings: HuggingFace BGE-small-en-v1.5 (local, no API needed)
Framework: LlamaIndex with FunctionAgent

Installation

Clone the repository:

git clone https://github.com/cloudchristina/Agentic-RAG.git
cd Agentic-RAG

Install dependencies:

pip install -r requirements.txt

Set up environment variables:

cp .env.example .env
# Edit .env and add your GROQ_API_KEY

Get a free Groq API key:
- Visit https://console.groq.com/keys
- Create an account and generate an API key

Usage

Run the application:

python app.py

Open http://127.0.0.1:7860 in your browser
Upload PDF documents (max 5) and start asking questions!

Example Queries

"What is the main contribution of this paper?"
"Summarize the methodology section"
"Compare the approaches in document A and document B"

Project Structure

agentic_rag/
├── app.py              # Gradio UI application
├── agent.py            # FunctionAgent setup
├── indexer.py          # Indexing and tool creation
├── helper.py           # Environment utilities
├── requirements.txt    # Python dependencies
├── .env.example        # Environment template
├── data/               # Uploaded PDF documents
└── storage/            # Persisted document indices

How It Works

Document Processing: PDFs are loaded and split into chunks (1024 tokens)
Index Creation: Each document gets a VectorStoreIndex and SummaryIndex
Tool Generation: Two tools per document (vector search + summary)
Tool Index: ObjectIndex enables dynamic tool selection based on query
Agent Execution: FunctionAgent uses relevant tools to answer queries

Test

Download sample PDFs to test the application:

mkdir -p test/
wget -O test/metagpt.pdf "https://openreview.net/pdf?id=VtmBAGCN7o"
wget -O test/longlora.pdf "https://openreview.net/pdf?id=6PmJoRfdaK"
wget -O test/loftq.pdf "https://openreview.net/pdf?id=LzPWWPAdY4"
wget -O test/swebench.pdf "https://openreview.net/pdf?id=VTF8yNQM66"
wget -O test/selfrag.pdf "https://openreview.net/pdf?id=hSyW5go0v8"
# wget -O test/zipformer.pdf "https://openreview.net/pdf?id=9WD9KwssyT"
# wget -O test/values.pdf "https://openreview.net/pdf?id=yV6fD7LYkF"
# wget -O test/finetune_fair_diffusion.pdf "https://openreview.net/pdf?id=hnrB5YHoYu"
# wget -O test/knowledge_card.pdf "https://openreview.net/pdf?id=WbWtOYIzIK"
# wget -O test/metra.pdf "https://openreview.net/pdf?id=c5pwL0Soay"
# wget -O test/vr_mcl.pdf "https://openreview.net/pdf?id=TpD2aG1h0D"

Then run python app.py and try queries like:

"What is MetaGPT?"
"Summarize zipformer"
"Tell me about the evaluation dataset used in MetaGPT and compare it against longlora"

Requirements

Python 3.10+
Groq API key (free tier available)
~500MB disk space for embeddings model

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Agentic RAG Assistant

Features

Architecture

Tools

Models Used

Installation

Usage

Example Queries

Project Structure

How It Works

Test

Requirements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
agent.py		agent.py
app.py		app.py
helper.py		helper.py
indexer.py		indexer.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Agentic RAG Assistant

Features

Architecture

Tools

Models Used

Installation

Usage

Example Queries

Project Structure

How It Works

Test

Requirements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages