Mini LLM-Powered Question-Answering System using RAG

Objective

This project implements a Retrieval-Augmented Generation (RAG) based question-answering system over a document. The system ingests a PDF, performs semantic chunking and embedding, stores it in a FAISS vector index, and answers user queries using an open-source LLM.

Components

Document Ingestion & Chunking

Loader Used: PyPDFLoader from langchain_community
Splitter: RecursiveCharacterTextSplitter
- chunk_size: 400 characters
- chunk_overlap: 50 characters
- Separation Strategy: ["\n\n", "\n", " ", ""] to preserve paragraph context

Embedding & Vector Store

Model: sentence-transformers/all-MiniLM-L6-v2
Framework: HuggingFaceEmbeddings
Storage: FAISS

Query Interface

Command-line interface using input()
Accepts a user query and retrieves top-k relevant chunks

LLM Integration

Model: vblagoje/bart_lfqa (HuggingFace)
Pipeline: text2text-generation
Prompt includes instruction to answer based on context or indicate no answer

Output

Printed answer
Source document and pages
Time taken to generate answer

Tools Used

langchain: document loading and chunking
transformers: LLM pipeline
sentence-transformers: embedding generation
FAISS: vector similarity search
ChatGPT: used for architectural decisions and class structure

Sample Test Query & Output

Query: What are the diagnostic criteria for Obsessive-Compulsive Disorder (OCD)?

Output: Answer: [LLM-generated answer here] Source: book.pdf

Completed Features

PDF ingestion
Chunking with overlap
FAISS vector store
HuggingFace LLM-based QA
CLI interface
Prompt engineering

Skipped / Incomplete

No GUI (Gradio/Streamlit)
No caching for repeated queries
No reranking logic
CPU-only inference
No dynamic document uploads

Design Assumptions

Input is a clean, extractable-text PDF
All models loadable on CPU
Responses rely only on provided document context

How to Run

pip install -r req.txt

python main.py

Use of Generative Tools

Code structure and class design
Prompt design
Model selection
README formatting

Conclusion

This MVP provides an end-to-end RAG QA pipeline with clean modularity, extensibility, and full local execution support.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
README.md		README.md
book.pdf		book.pdf
project.py		project.py
req.txt		req.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Mini LLM-Powered Question-Answering System using RAG

Objective

Components

Tools Used

Sample Test Query & Output

Completed Features

Skipped / Incomplete

Design Assumptions

How to Run

Use of Generative Tools

Conclusion

About

Uh oh!

Releases

Packages

Languages

saai07/clinical-bot

Folders and files

Latest commit

History

Repository files navigation

Mini LLM-Powered Question-Answering System using RAG

Objective

Components

Tools Used

Sample Test Query & Output

Completed Features

Skipped / Incomplete

Design Assumptions

How to Run

Use of Generative Tools

Conclusion

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages