Enhanced Retrieval-Augmented Generation (RAG) System

Advanced document Q&A with semantic search, reranking, and hybrid retrieval.

Features

Hybrid dense/sparse retrieval
Semantic chunking
Query expansion
Reranking
Document compression
GPU/CPU support

Quick Start

Clone the repository

git clone https://github.com/feyzollahi/SimpleRAG.git
cd SimpleRAG

Install dependencies
```
poetry install
```
Configure environment variables

Create a .env file in the project root (see below for example).
```
HF_TOKEN=your_huggingface_token
MODEL_NAME=google/gemma-2b-it
EMBEDDING_MODEL_NAME=BAAI/bge-small-en-v1.5
RERANKER_MODEL_NAME=BAAI/bge-reranker-base
DOC_DIR=docs
CACHE_DIR=.cache
```
- HF_TOKEN is required for some Hugging Face models. Get it from Hugging Face.
- Adjust other variables as needed.
Add your documents

Place .txt, .md, or .pdf files in the docs directory.
Run the app
```
streamlit run app.py
```

.env Example

HF_TOKEN=your_huggingface_token
MODEL_NAME=google/gemma-2b-it
EMBEDDING_MODEL_NAME=BAAI/bge-small-en-v1.5
RERANKER_MODEL_NAME=BAAI/bge-reranker-base
DOC_DIR=docs
CACHE_DIR=.cache

Usage

Enter your question in the app UI.
Adjust retrieval and generation settings in the sidebar.
View sources and context for each answer.

Notes

For best results, use high-quality documents.
GPU is recommended for faster inference.
All configuration is managed via .env.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.idea		.idea
.vscode		.vscode
docs		docs
.env		.env
Dockerfile		Dockerfile
Readme.md		Readme.md
app.py		app.py
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Enhanced Retrieval-Augmented Generation (RAG) System

Features

Quick Start

.env Example

Usage

Notes

License

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Languages

feyzollahi/SimpleRAG

Folders and files

Latest commit

History

Repository files navigation

Enhanced Retrieval-Augmented Generation (RAG) System

Features

Quick Start

.env Example

Usage

Notes

License

Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages