PDF QA with RAG & Self-Reflection

This project lets you upload PDF documents and ask questions about their content using AI. It's like having a conversation with your documents 😄

Built as a self-learning project, it uses Retrieval-Augmented Generation (RAG) to find relevant information from your PDFs and generate accurate answers. The simple web interface allows to upload files and start chatting right away.

Under the hood, it includes some features like self-reflection (the AI double-checks its own answers) and different ways to find the best information in your documents.

Features

Document Processing

Automatic Chunking: Upload PDFs that are automatically split into manageable chunks with overlap
Smart Embeddings: Utilizes OpenAI's embeddings for semantic understanding
Source Tracking: Preserves document metadata for accurate source attribution

Retrieval Strategies

Stuff - Fast and simple: concatenates all relevant chunks into a single context
Map Reduce - Processes chunks separately then combines results for comprehensive answers
Refine - Iteratively improves answers by refining with each relevant chunk

AI-Powered Features

Contextual Compression: Optional LLM-based compression to focus on most relevant content
Self-Reflection:
- Automatically evaluates answer quality
- Regenerates responses that don't meet quality thresholds
- Provides transparent reasoning for evaluations
Conversational Memory: Maintains context across questions for natural follow-ups

Analytics & Integration

Langfuse & LangSmith Integration: Detailed monitoring and analytics of LLM usage
Web Search: Augments answers with real-time information when needed
Performance Tracking: Monitors response quality and retrieval effectiveness
Usage Analytics: Tracks user interactions and system performance

Tech Stack

Backend: Python 3.9+
LLM: OpenAI GPT-3.5-turbo
Document Processing: PyMuPDF, LangChain
Vector Database: Chroma DB
Analytics: Langfuse, LangSmith
Search: Tavily Search API
Web Framework: Streamlit
Containerization: Docker

Project Structure

.
├── app/                     # Main application directory
│   ├── chroma_db/           # Chroma vector database storage directory (will be created on startup)
│   ├── pdfs/                # PDF document storage directory (uploaded PDFs will be stored here)
│   ├── .env                 # Environment variables (must be copied from .env.example and filled in)
│   ├── .env.example         # Example environment variables
│   ├── analytics.py         # Analytics and metrics dashboard
│   ├── app.py               # Main Streamlit application
│   ├── config.py            # Configuration settings
│   ├── eval_results.csv     # Evaluation results storage file (will be created after receiving responses)
│   ├── evaluation.py        # Response evaluation logic
│   ├── langfuse_utils.py    # Langfuse tracing functions
│   ├── llm_eval_metrics.csv # LLM evaluation metrics storage file (will be created after receiving responses)
│   ├── rag_pipeline.py      # Core RAG implementation
│   ├── rag_tools.py         # RAG-specific utility functions
│   ├── requirements.txt     # Python dependencies
│   ├── self_reflection.py   # Self-reflection implementation
│   └── web_tools.py         # Web search tools integration
├── docker-compose.yml       # Docker Compose configuration
├── Dockerfile               # Docker configuration for the application
├── README.md                # Project documentation

Getting Started

Prerequisites

Docker and Docker Compose
OpenAI API key (LLM usage)
Langfuse credentials (analytics tool)
LangChain API key (another analytics tool)
Tavily API key (web search functionality)

Installation

Clone the repository:

git clone https://github.com/igorsuhinin/rag-pdf-qa.git
cd rag-pdf-qa

Create an .env file in the app directory with your API keys:

cd app
cp .env.example .env

Then edit the .env file and fill in your API keys:

OPENAI_API_KEY=sk-proj-...
LANGCHAIN_API_KEY=lsv2_pt_...
LANGFUSE_SECRET_KEY=sk-lf-...
LANGFUSE_PUBLIC_KEY=pk-lf-...
TAVILY_API_KEY=tvly-dev-...

Running with Docker Compose

The application is configured to run with Docker Compose, which will set up all necessary services.

Build and start the containers:
```
docker-compose up --build
```
The application will be available at: http://localhost:8501
To stop the application, press Ctrl+C in the terminal or run:
```
docker-compose down
```

Local Development (Without Docker)

If you prefer to run the application locally:

Ensure you have Python 3.9+ installed

Create and activate a virtual environment:

python -m venv venv
# On Windows:
.\venv\Scripts\activate
# On macOS/Linux:
source venv/bin/activate

Install dependencies:
```
pip install -r app/requirements.txt
```
Run the application:
```
cd app
streamlit run app.py
```
Access the application at: http://localhost:8501

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
app		app
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PDF QA with RAG & Self-Reflection

Features

Document Processing

Retrieval Strategies

AI-Powered Features

Analytics & Integration

Tech Stack

Project Structure

Getting Started

Prerequisites

Installation

Running with Docker Compose

Local Development (Without Docker)

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

PDF QA with RAG & Self-Reflection

Features

Document Processing

Retrieval Strategies

AI-Powered Features

Analytics & Integration

Tech Stack

Project Structure

Getting Started

Prerequisites

Installation

Running with Docker Compose

Local Development (Without Docker)

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages