RAG Chatbot with Ollama & Azure OpenAI

A modular RAG (Retrieval-Augmented Generation) chatbot that supports both Ollama (local) and Azure OpenAI (cloud) as LLM providers.

Quick Start

1. Setup Environment

# Create conda environment
conda create -n rag_chatbot python=3.11 -y
conda activate rag_chatbot

# Install dependencies
pip install -r requirements.txt

2. Setup LLM Provider

Option A: Ollama (Local - Free)

# Start Ollama with Docker
docker run -d --name ollama -p 11434:11434 -v ollama:/root/.ollama ollama/ollama

# Pull a model
docker exec ollama ollama pull qwen2.5:1.5b

Option B: Azure OpenAI (Cloud)

Create a .env file:

cp .env.example .env

Edit .env with your Azure credentials:

AZURE_OPENAI_API_KEY=your-api-key
AZURE_OPENAI_ENDPOINT=https://your-resource.openai.azure.com/
AZURE_LLM_DEPLOYMENT_NAME=gpt-4

3. Run the Application

# Run with Ollama (default)
python run_app.py

# Run with Azure OpenAI
python run_app.py --provider azure

# Run with specific options
python run_app.py --provider ollama --model llama3.2:1b --port 8501

Open http://localhost:8501 in your browser.

Command Line Options

python run_app.py --help

Argument	Short	Default	Description
`--provider`	`-p`	`ollama`	LLM provider: `ollama` or `azure`
`--model`	`-m`	`qwen2.5:1.5b`	Ollama model name
`--ollama-url`		`http://localhost:11434`	Ollama base URL
`--api-key`		from .env	Azure OpenAI API key
`--endpoint`		from .env	Azure OpenAI endpoint
`--deployment`	`-d`	`gpt-4`	Azure deployment name
`--temperature`	`-t`	`0.7`	LLM temperature
`--port`		`8501`	Streamlit port
`--check`			Validate config only

Examples

# Check configuration without starting
python run_app.py --check

# Run with Ollama and specific model
python run_app.py --provider ollama --model llama3.2:3b

# Run with Azure OpenAI
python run_app.py --provider azure

# Run with Azure and custom deployment
python run_app.py --provider azure --deployment gpt-4-turbo

# Run on different port
python run_app.py --port 8502

Project Structure

.
├── app.py                 # Streamlit UI application
├── chatbot.py             # Main chatbot orchestrator
├── config.py              # Configuration settings
├── document_processor.py  # PDF processing module
├── llm_handler.py         # LLM integration (Ollama + Azure)
├── vector_store.py        # ChromaDB vector store management
├── utils.py               # Utility functions
├── run_app.py             # CLI launcher with argparse
├── run.sh                 # Shell script launcher
├── .env.example           # Environment template
├── pdfFiles/              # Directory for uploaded PDFs
└── vectorDB/              # Directory for vector database

Configuration

Environment Variables

Variable	Description	Required
`LLM_PROVIDER`	`ollama` or `azure`	No (default: ollama)
`OLLAMA_MODEL`	Ollama model name	No
`OLLAMA_BASE_URL`	Ollama API URL	No
`AZURE_OPENAI_API_KEY`	Azure API key	For Azure
`AZURE_OPENAI_ENDPOINT`	Azure endpoint URL	For Azure
`AZURE_LLM_DEPLOYMENT_NAME`	Azure deployment name	For Azure
`AZURE_OPENAI_API_VERSION`	Azure API version	No
`LLM_TEMPERATURE`	Generation temperature	No

Supported Ollama Models

# Small models (fast, low memory)
docker exec ollama ollama pull qwen2.5:1.5b
docker exec ollama ollama pull llama3.2:1b
docker exec ollama ollama pull phi3:mini

# Medium models (balanced)
docker exec ollama ollama pull llama3.2:3b
docker exec ollama ollama pull mistral

# List available models
docker exec ollama ollama list

Module Descriptions

config.py

Central configuration with environment variable support for both Ollama and Azure.

llm_handler.py

Unified LLM handler supporting:

Ollama (local models)
Azure OpenAI (cloud models)
Provider switching at runtime

document_processor.py

PDF processing with:

Text extraction
Document chunking
Multiple file support

vector_store.py

ChromaDB vector database:

Document embedding (via Ollama)
Similarity search
Persistent storage

chatbot.py

Main orchestrator combining all components.

app.py

Streamlit UI with:

File upload
Chat interface
Provider info display

Usage

Start the application with your preferred provider
Upload PDF files through the sidebar
Click "Process PDFs" to analyze documents
Start asking questions about your documents

Troubleshooting

Ollama not running

docker start ollama
# or
docker run -d --name ollama -p 11434:11434 -v ollama:/root/.ollama ollama/ollama

Model not found

docker exec ollama ollama pull qwen2.5:1.5b

Azure authentication error

Verify API key in .env
Check endpoint URL format
Confirm deployment name matches Azure portal

Port already in use

python run_app.py --port 8502

License

MIT License

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG Chatbot with Ollama & Azure OpenAI

Quick Start

1. Setup Environment

2. Setup LLM Provider

Option A: Ollama (Local - Free)

Option B: Azure OpenAI (Cloud)

3. Run the Application

Command Line Options

Examples

Project Structure

Configuration

Environment Variables

Supported Ollama Models

Module Descriptions

config.py

llm_handler.py

document_processor.py

vector_store.py

chatbot.py

app.py

Usage

Troubleshooting

Ollama not running

Model not found

Azure authentication error

Port already in use

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
pdfFiles		pdfFiles
README.md		README.md
__init__.py		__init__.py
app.py		app.py
chatbot.py		chatbot.py
config.py		config.py
document_processor.py		document_processor.py
example_usage.py		example_usage.py
llm_handler.py		llm_handler.py
requirements.txt		requirements.txt
run.sh		run.sh
run_app.py		run_app.py
setup_ollama.sh		setup_ollama.sh
test_setup.py		test_setup.py
utils.py		utils.py
vector_store.py		vector_store.py
vector_store_old.py		vector_store_old.py

AIFahim/LLM-Improvement-with-RAG-End-to-End-Chatbot-Development

Folders and files

Latest commit

History

Repository files navigation

RAG Chatbot with Ollama & Azure OpenAI

Quick Start

1. Setup Environment

2. Setup LLM Provider

Option A: Ollama (Local - Free)

Option B: Azure OpenAI (Cloud)

3. Run the Application

Command Line Options

Examples

Project Structure

Configuration

Environment Variables

Supported Ollama Models

Module Descriptions

config.py

llm_handler.py

document_processor.py

vector_store.py

chatbot.py

app.py

Usage

Troubleshooting

Ollama not running

Model not found

Azure authentication error

Port already in use

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages