Advanced RAG with Rerankers

This repository demonstrates the application of rerankers as an Advanced RAG (Retrieval-Augmented Generation) technique to improve retrieval performance. The project showcases how reranking can enhance the quality of retrieved documents before they are passed to the generation model.

Installation

This project uses uv for fast and reliable dependency management.

Prerequisites

Python 3.10+
uv package manager

Install uv

pip install uv

Install Dependencies

uv sync

This will create a virtual environment and install all required dependencies automatically.

Setup

1. Start Milvus Vector Database

Before running the application, you need to start the Milvus vector database:

cd milvus
docker-compose up -d

This will start Milvus using Docker Compose with the configuration provided in milvus/docker-compose.yml.

2. Set up Environment Variables

Copy the example environment file and configure your API key:

cp .env.example .env

Then edit the .env file and replace your_openai_api_key_here with your actual OpenAI API key. You can obtain an API key from the OpenAI Platform.

Usage

To run the complete pipeline, execute:

uv run src/main.py

This will run the complete RAG pipeline with reranking evaluation:

Data Download: Downloads EUR-Lex legal documents
Vector Database Setup: Creates embeddings using sentence transformers and stores them in Milvus
Evaluation Dataset Generation: Generates question-answer pairs from selected documents using OpenAI's LLM
Cross-Encoder Reranking: Initializes a BAAI/bge-reranker model for improving retrieval results
Retrieval Comparison: Tests retrieval performance with and without reranking across different top-k values (3, 5, 10, 15)
RAG Evaluation: Performs end-to-end RAG evaluation comparing baseline retrieval vs. reranked retrieval for answer generation

The results are saved as JSON files in the data/ directory for analysis.

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
data		data
milvus		milvus
src		src
test		test
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Advanced RAG with Rerankers

Installation

Prerequisites

Install uv

Install Dependencies

Setup

1. Start Milvus Vector Database

2. Set up Environment Variables

Usage

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

MrSpokeMan/monke-project

Folders and files

Latest commit

History

Repository files navigation

Advanced RAG with Rerankers

Installation

Prerequisites

Install uv

Install Dependencies

Setup

1. Start Milvus Vector Database

2. Set up Environment Variables

Usage

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages