🧠 Chatbot Using RAG and LangChain

A Streamlit-based chatbot powered by Retrieval-Augmented Generation (RAG) and OpenAI. Upload your PDFs and chat with them! This app leverages LangChain, FAISS, and OpenAI’s GPT models to extract and query document content with metadata-aware answers.

🔧 Features

🔍 Upload multiple PDFs and query across all of them
📄 Metadata-rich answers with filename and page references
🧠 Uses LangChain + FAISS for semantic search
🤖 Streamlit Chat UI for natural conversation
💾 OpenAI API support with streaming responses

📁 Project Structure

.
├── .gitignore
├── LICENSE
├── README.md             # ← You're reading it
├── app.py                # Main Streamlit app
├── brain.py              # PDF parsing and vector index logic
├── compare medium.gif    # Optional UI illustration
├── requirements.txt      # Python dependencies
└── thumbnail.webp        # Preview image

🚀 Getting Started

1. Clone the Repository

git clone https://github.com/shamazhooda/chatbot-rag-langchain.git
cd chatbot-rag-langchain

2. Install Dependencies

pip install -r requirements.txt

3. Set OpenAI API Key

Create a .streamlit/secrets.toml file with:

OPENAI_API_KEY = "your-openai-key"

Or export it via environment variable:

export OPENAI_API_KEY="your-openai-key"

4. Run the App

streamlit run app.py

📚 How It Works

Upload PDFs via the UI
Each PDF is parsed using PyPDF2 and chunked via LangChain’s RecursiveCharacterTextSplitter
Chunks are embedded using OpenAI Embeddings
Stored in a FAISS vector store for semantic similarity search
Queries are matched to top PDF chunks and passed to ChatGPT with context
Answers include file name and page number metadata for citation

🛠️ Tech Stack

Streamlit – UI framework
LangChain – PDF chunking and retrieval
FAISS – Vector search backend
OpenAI GPT – LLM-based answer generation
PyPDF2 – PDF parsing

✅ Example Prompt

"What are the main points from the introduction?"

Answer: The introduction highlights... (example.pdf, page 1)

Architecture and Storage

UI: app.py (Streamlit)
RAG pipeline: brain.py (parse → chunk → embed → index)
Model: OpenAI Chat Completions API (streaming)
Retriever: FAISS similarity search

Storage

PDFs: in-memory during session (not persisted by default).
Chunks/metadata: in-memory Document objects.
Vector store: FAISS, created in-memory and cached via st.cache_resource. Lives for the server process lifetime; cleared on restart or cache clear.
Chat history: st.session_state (per browser session).

Optional persistence

To keep vectors across restarts, persist FAISS:

# brain.py
from langchain_openai import OpenAIEmbeddings
from langchain_community.vectorstores import FAISS

def save_index_local(vectordb: FAISS, path: str):
    vectordb.save_local(path)

def load_index_local(path: str, openai_api_key: str) -> FAISS:
    embeddings = OpenAIEmbeddings(openai_api_key=openai_api_key)
    return FAISS.load_local(path, embeddings, allow_dangerous_deserialization=True)

# app.py (around create_vectordb)
DB_DIR = "data/faiss_index"
if os.path.isdir(DB_DIR):
    vectordb = load_index_local(DB_DIR, OPENAI_API_KEY)
else:
    vectordb = get_index_for_pdf([f.getvalue() for f in files], filenames, OPENAI_API_KEY)
    save_index_local(vectordb, DB_DIR)

What each module does

app.py
- Loads OPENAI_API_KEY from st.secrets or environment.
- Uploads PDFs with st.file_uploader.
- Builds or retrieves the FAISS index using @st.cache_resource in create_vectordb(...).
- On a question:
  - Retrieves top-k chunks with vectordb.similarity_search(...).
  - Injects those chunks into a system prompt.
  - Streams model output using OpenAI’s v1 Chat Completions API.
brain.py
- parse_pdf(...): uses pypdf to extract text from each page and normalizes whitespace/hyphenation.
- text_to_docs(...): splits text into chunks using RecursiveCharacterTextSplitter, attaches filename, page, chunk metadata.
- docs_to_index(...): creates a FAISS index via FAISS.from_documents(...) with OpenAIEmbeddings.
- get_index_for_pdf(...): orchestrates PDF parse → chunk → embed → FAISS index.

Data flow of a query

Upload PDFs → parse_pdf extracts text per page.
Text → text_to_docs creates chunked Document objects with metadata.
Docs → docs_to_index embeds with OpenAIEmbeddings and builds a FAISS index.
On a user question → similarity_search(k=3) returns the most relevant chunks.
App forms a system prompt with those chunks and streams a response from the model.
The UI displays tokens as they arrive.

Security and limits

Your OpenAI key is required; store it in .streamlit/secrets.toml or environment.
Uploaded files stay in memory and are not written to disk unless you add persistence.
FAISS index is in-memory unless you add the optional save/load shown above.

Updated README content you can paste

<code_block_to_apply_changes_from>

Run

Set OPENAI_API_KEY in .streamlit/secrets.toml or as an env var.
pip install -r requirements.txt
streamlit run app.py


- Want me to apply these README edits and add the optional persistence helpers to `brain.py` for you?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧠 Chatbot Using RAG and LangChain

🔧 Features

📁 Project Structure

🚀 Getting Started

1. Clone the Repository

2. Install Dependencies

3. Set OpenAI API Key

4. Run the App

📚 How It Works

🛠️ Tech Stack

✅ Example Prompt

Architecture and Storage

Storage

Optional persistence

What each module does

Data flow of a query

Security and limits

Updated README content you can paste

Run

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.streamlit		.streamlit
__pycache__		__pycache__
.gitignore		.gitignore
README.md		README.md
app.py		app.py
brain.py		brain.py
requirements.txt		requirements.txt

nitinog10/RAG-chatbot-using-langchain

Folders and files

Latest commit

History

Repository files navigation

🧠 Chatbot Using RAG and LangChain

🔧 Features

📁 Project Structure

🚀 Getting Started

1. Clone the Repository

2. Install Dependencies

3. Set OpenAI API Key

4. Run the App

📚 How It Works

🛠️ Tech Stack

✅ Example Prompt

Architecture and Storage

Storage

Optional persistence

What each module does

Data flow of a query

Security and limits

Updated README content you can paste

Run

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages