docs: added setup and usage info

SourasishBasu · SourasishBasu · commit f98f077fe7fa · 2025-05-14T16:58:47.000+05:30
diff --git a/README.md b/README.md
@@ -0,0 +1,130 @@
+# RAGnarok  
+> RAG powered research and documentation assistant for professionals
+
+---
+
+### Tech Stack
+![NextJs](https://img.shields.io/badge/Nextjs-black?style=for-the-badge&logo=nextdotjs&logoColor=white)
+![Python](https://img.shields.io/badge/Python-blue?style=for-the-badge&logo=python&logoColor=white)
+![FastAPI](https://img.shields.io/badge/FastAPI-009688.svg?style=for-the-badge&logo=FastAPI&logoColor=white)
+![AstraDB](https://img.shields.io/badge/astradb-3b0764?style=for-the-badge&logo=expo&logoColor=b85930)
+![Docker](https://img.shields.io/badge/docker-%230db7ed.svg?style=for-the-badge&logo=docker&logoColor=white)
+![Gemini](https://img.shields.io/badge/gemini-8E75B2?style=for-the-badge&logo=google%20gemini&logoColor=white)
+![Bun](https://img.shields.io/badge/Bun-000000.svg?style=for-the-badge&logo=Bun&logoColor=white)
+![GitHub Actions](https://img.shields.io/badge/github%20actions-%232671E5.svg?style=for-the-badge&logo=githubactions&logoColor=white)
+![Ollama](https://img.shields.io/badge/Ollama-FFF.svg?style=for-the-badge&logo=Ollama&logoColor=black)
+![Caddy](https://img.shields.io/badge/Caddy-1F88C0.svg?style=for-the-badge&logo=Caddy&logoColor=white)
+![UV](https://img.shields.io/badge/uv-DE5FE9.svg?style=for-the-badge&logo=uv&logoColor=white)
+![DigitalOcean](https://img.shields.io/badge/DigitalOcean-0080FF.svg?style=for-the-badge&logo=DigitalOcean&logoColor=white)
+
+## Overview
+
+RAGnarok is a full-stack, retrieval-augmented generation (RAG) system that lets you:
+
+1. **Ingest** arbitrary documents pertaining to diverse professions into a vector store  
+2. **Serve** a REST API that retrieves relevant passages and generates answers via 
+3. **Interact** through a modern NextJS based chat UI  
+4. **Deploy** the entire stack using Docker Compose onto the cloud or self host locally
+
+The project consists of four main components:
+
+- **Ingestion Pipeline**  
+  Reads documents (PDFs, text, markdown), splits and vectorizes them, and stores embeddings in a vector database (e.g. Pinecone, Weaviate).
+
+- **Backend API**  
+  A FastAPI service exposing endpoints for querying the vector store, invoking SoTa LLMs such as Deepseek, and streaming chat responses.
+
+- **Frontend UI**  
+  A NextJS + Typescript chat interface that calls the Backend API for conversational RAG.
+
+- **Deployment Manifests**  
+  Dockerfiles and Docker Compose manifests to deploy entire project locally or in the cloud.
+
+![System Architecture](assets/arch.jpeg)
+
+## Prerequisites
+
+- **Docker & Docker Compose**  
+- **Node.js** (v20+) and **Bun**  
+- **Python** (v3.11+)  
+- **LLM API key** (Groq, Gemini, etc.)  
+- **[AstraDB](https://www.datastax.com/lp/vector-database) Vector database credentials**  
+
+## Project Structure
+```
+.
+├───.github
+│   └───workflows
+├───assets
+├───backend
+│   └───app
+│       ├───models
+│       ├───routes
+│       ├───services
+│       └───utils
+├───deployment
+├───frontend
+│   ├───public
+│   └───src
+│       ├───app
+│       │   └───test
+│       ├───components
+│       │   └───ui
+│       ├───context
+│       └───lib
+└───ingestion-pipeline
+
+20 directories, 65 files
+```
+
+
+## Setup
+
+1. **Clone the repo**  
+   ```bash
+   git clone https://github.com/3xCaffeine/rag.git
+   cd rag
+   ```
+2. Configure environment variables
+
+    Copy the example `.env.example` into each service folder and add the API keys, URLs and AstraDB database credentials.
+
+3. Build & run with Docker Compose
+
+    ```
+    docker-compose up -d
+    ```
+> [!NOTE]  
+> In case GPU support is unavailable on the machine, utilize a hosted text embedding model (OpenAI Ada, Jina AI, Cohere etc.) and remove the Ollama section from the Compose config.
+
+4. (Optional) For remote deployment with custom domains, configure the `deployment/Caddyfile` otherwise remove the Caddy section from Compose configuration.
+
+### Expected Result
+
+```bash
+$ docker compose ps
+NAME            IMAGE                  COMMAND                  SERVICE   CREATED         STATUS                   PORTS
+backend-api-1   ragbackend:latest      "fastapi run app/mai…"   api       3 minutes ago   Up 3 minutes (healthy)   0.0.0.0:8000->8000/tcp
+ollama          ollama/ollama:latest   "/bin/sh ./run_model…"   ollama    9 seconds ago   Up 9 seconds             0.0.0.0:11434->11434/tcp
+```
+
+## Usage
+
+Visit Ragnarok chat at http://localhost:3000.
+
+
+#### Demo
+
+
+
+## Authors
+This project was built for the AISoC Chronos Hackathon 2025 by the 3xCaffeine team.
+
+- Sourasish Basu ([@SourasishBasu](https://github.com/SourasishBasu))
+- Swapnil Dutta ([@rycerzes])(https://github.com/rycerzes)
+- Vaibhav Singh ([@monkeplication])(https://github.com/monkeplication)
+
+## Version
+| Version | Date          		| Comments        |
+| ------- | ------------------- | --------------- |
+| 1.0     | May 14th, 2025      | Revised release |
diff --git a/deployment/docker-compose.yml b/deployment/docker-compose.yml
@@ -1,5 +1,17 @@
 ---
 services:
+  api:
+    image: ghcr.io/3xCaffeine/rag-backend:latest
+    container_name: api
+    restart: unless-stopped
+    env_file:
+      - .env
+    expose:
+      - "8000"
+    ports:
+      - "8000:8000"
+
+# Add below section to use Caddy as a reverse proxy with custom domains
   caddy:
     image: caddy:latest
     restart: unless-stopped
@@ -11,15 +23,8 @@ services:
       - ./Caddyfile:/etc/caddy/Caddyfile
       - caddy_data:/data
 
-  api:
-    image: ghcr.io/3xCaffeine/rag-backend:latest
-    container_name: api
-    restart: unless-stopped
-    env_file:
-      - .env
-    expose:
-      - "8000"
-    
+  
+# Add the service below to run Nomic text embedding model via Ollama if machine has GPU support
   ollama:
     container_name: ollama
     tty: true
diff --git a/deployment/env.example b/deployment/env.example
diff --git a/ingestion-pipeline/test_pipeline.ipynb b/ingestion-pipeline/test_pipeline.ipynb
@@ -42,7 +42,7 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 3,
+   "execution_count": null,
    "metadata": {},
    "outputs": [
     {
@@ -62,7 +62,7 @@
    ],
    "source": [
     "# load documents\n",
-    "documents = SimpleDirectoryReader(\"./data/medical/\").load_data(num_workers=10)\n",
+    "documents = SimpleDirectoryReader(\"./data/paul_graham/\").load_data(num_workers=10)\n",
     "print(f\"Total documents: {len(documents)}\")\n",
     "print(f\"First document, id: {documents[0].doc_id}\")\n",
     "print(f\"First document, hash: {documents[0].hash}\")\n",
@@ -103,7 +103,7 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 7,
+   "execution_count": null,
    "metadata": {},
    "outputs": [
     {
@@ -128,7 +128,7 @@
     "llm = Groq(model=\"deepseek-r1-distill-llama-70b\", api_key=groq_api_token)\n",
     "\n",
     "query_engine = index.as_query_engine(llm=llm)\n",
-    "response = query_engine.query(\"A BRIEF REVIEW OF THE FATTY LIVER DISEASE\")\n",
+    "response = query_engine.query(\"A BRIEF REVIEW OF THE ESSAY\")\n",
     "\n",
     "print(response.response)"
    ]