kyawzinthant-coding
diff --git a/‎.dockerignore‎
Lines changed: 5 additions & 0 deletions b/‎.dockerignore‎
Lines changed: 5 additions & 0 deletions
diff --git a/‎.env.example‎
Lines changed: 12 additions & 0 deletions b/‎.env.example‎
Lines changed: 12 additions & 0 deletions
diff --git a/‎.gitignore‎
Lines changed: 1 addition & 0 deletions b/‎.gitignore‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎Dockerfile‎
Lines changed: 17 additions & 0 deletions b/‎Dockerfile‎
Lines changed: 17 additions & 0 deletions
diff --git a/‎README.MD‎
Lines changed: 132 additions & 5 deletions b/‎README.MD‎
Lines changed: 132 additions & 5 deletions
diff --git a/‎docker-compose.yml‎
Lines changed: 39 additions & 0 deletions b/‎docker-compose.yml‎
Lines changed: 39 additions & 0 deletions
diff --git a/‎dump.rdb‎
0 Bytes b/‎dump.rdb‎
0 Bytes
diff --git a/‎package.json‎
Lines changed: 17 additions & 17 deletions b/‎package.json‎
Lines changed: 17 additions & 17 deletions
@@ -0,0 +1,5 @@
+node_modules
+dist
+.env
+npm-debug.log
+chroma-data
@@ -0,0 +1,12 @@
+
+
+PORT = 4000
+
+REDIS_HOST=redis
+REDIS_PORT=6379
+REDIS_PASSWORD=1234
+
+CHROMA_HOST=chroma
+CHROMA_PORT=8000
+
+GEMINI_API_KEY=
@@ -9,6 +9,7 @@ lerna-debug.log*
 
 /node_modules
 node_modules
+/chroma-data
 dist
 dist-ssr
 *.local
 
@@ -0,0 +1,17 @@
+
+FROM node:18-alpine
+
+WORKDIR /usr/src/app
+
+COPY package.json pnpm-lock.yaml ./
+
+RUN npm install -g pnpm
+RUN pnpm install
+
+COPY . .
+
+RUN pnpm run build
+
+EXPOSE 4000
+
+CMD [ "node", "dist/src/index.js" ]
@@ -1,5 +1,132 @@
-- [x] Upload Multiple Files
-- [x] implement queue worker
-- [x] read pdf and chunk the text
-- [x] combine those knowledge
-- [x] store in vector db
+# Insight Engine
+
+This is the backend for **Insight Engine**, an intelligent application designed to transform a collection of documents into a conversational knowledge base. It allows a user to upload documents (e.g., PDFs), and then ask questions in natural language to receive AI-powered answers based on the information contained within those documents.
+
+The system is built as a multi-container application orchestrated with Docker Compose, ensuring a clean separation of concerns and a production-ready setup.
+
+## System Architecture
+
+The application operates using a **Retrieval-Augmented Generation (RAG)** architecture.
+
+- **API Server (`app`)**: An Express.js server that handles incoming requests. It manages file uploads, adds processing jobs to the queue, and exposes the chat endpoint to the user.
+- **Worker (`worker`)**: A background service using BullMQ that processes long-running tasks. It listens for jobs from the queue, extracts text from documents, generates vector embeddings using the Gemini API, and stores them in ChromaDB.
+- **Redis (`redis`)**: Acts as a high-speed message broker for the BullMQ task queue, facilitating communication between the API Server and the Worker.
+- **ChromaDB (`chroma`)**: The vector database that stores the document embeddings. It enables efficient semantic search to find information relevant to a user's query.
+- **Shared Volume (`uploads`)**: A Docker volume mounted to both the API Server and the Worker, allowing the server to save an uploaded file and the worker to access it for processing.
+
+### How it Works (Step-by-Step)
+
+1.  **Knowledge Ingestion**:
+
+    - A user uploads PDF files to the `POST /api/v1/pdf/uploads` endpoint.
+    - The API Server clears the old ChromaDB collection to start a fresh session.
+    - It adds a job for each file to the Redis queue.
+    - The Worker picks up each job, processes the PDF, creates chunks, generates embeddings with the Gemini API, and stores them in ChromaDB.
+    - The worker then deletes the temporary file from the shared volume.
+
+2.  **Knowledge Retrieval**:
+    - A user sends a question to the `POST /api/v1/pdf/chat` endpoint.
+    - The API Server queries ChromaDB to retrieve the most relevant text chunks based on the question's meaning.
+    - It _augments_ a prompt by combining this retrieved context with the original question.
+    - This detailed prompt is sent to the Gemini generative model.
+    - The model _generates_ a final answer based on the provided context, which is then sent back to the user.
+
+## Project Setup & How to Run
+
+1.  **Clone the Repository**
+
+    ```bash
+    git clone <your-repo-url>
+    cd insight-engine
+    ```
+
+2.  **Create Environment File**
+    Create a `.env` file in the root directory and add your configuration.
+
+    ```env
+    # Server Port
+    PORT=4000
+
+    # Redis Configuration for Docker
+    REDIS_HOST=redis
+    REDIS_PORT=6379
+
+    # ChromaDB Configuration for Docker
+    CHROMA_HOST=chroma
+    CHROMA_PORT=8000
+
+    # Your Gemini API Key
+    GEMINI_API_KEY=AIzaSy...
+    ```
+
+3.  **Run with Docker Compose**
+    Make sure you have Docker Desktop running. Then, from the project root, run:
+
+    ```bash
+    docker-compose up --build
+    ```
+
+    This command will build the Docker images and start all the necessary services (`app`, `worker`, `redis`, `chroma`).
+
+4.  **Interact with the API**
+    - **Upload Files:** Send a `POST` request with `form-data` to `http://localhost:4000/api/v1/pdf/uploads`. The key for your files should be `files`.
+    - **Chat:** Send a `POST` request with a JSON body to `http://localhost:4000/api/v1/pdf/chat`. The body should look like: `{ "question": "your question here" }`.
+
+## Development
+
+For local development with hot-reloading, create a `docker-compose.override.yml` file to mount your source code into the containers and use `nodemon` (as configured in your `package.json`). This allows changes to your code to be reflected instantly without rebuilding the image.
+
+## Architecture Diagram
+
+```mermaid
+graph TD
+    subgraph "User / Client"
+        User(["<br><b>User</b><br>via Frontend / Postman"])
+    end
+
+    subgraph "Backend Application (Docker)"
+        subgraph "API Server (app-1)"
+            API[("Express.js API<br>localhost:4000")]
+        end
+
+        subgraph "Background Worker (worker-1)"
+            Worker[("BullMQ Worker")]
+        end
+
+        subgraph "Databases & Services"
+            Redis[("Redis<br>Queue")]
+            Chroma[("ChromaDB<br>Vector Store")]
+        end
+
+        subgraph "Shared Storage"
+            Volume[("</br>uploads</br>Shared Volume")]
+        end
+    end
+
+    subgraph "External Services"
+        Gemini[("Google Gemini AI<br>Embeddings & Generation")]
+    end
+
+    %% Ingestion Flow (Uploading a document)
+    User -- "1. Uploads PDF (POST /uploads)" --> API
+    API -- "2. Saves file to" --> Volume
+    API -- "3. Enqueues Job" --> Redis
+    Worker -- "4. Dequeues Job" --> Redis
+    Worker -- "5. Reads file from" --> Volume
+    Worker -- "6. Generates Embeddings" --> Gemini
+    Worker -- "7. Stores Embeddings" --> Chroma
+    Worker -- "8. Deletes file from" --> Volume
+
+    %% Retrieval Flow (Asking a question)
+    User -- "9. Asks Question (POST /chat)" --> API
+    API -- "10. Retrieves Context" --> Chroma
+    API -- "11. Augments Prompt & Generates Answer" --> Gemini
+    API -- "12. Sends Answer" --> User
+
+    classDef default fill:#1E293B,stroke:#334155,stroke-width:2px,color:#fff;
+    classDef user fill:#2563EB,stroke:#1D4ED8,stroke-width:2px,color:#fff;
+    classDef external fill:#166534,stroke:#14532D,stroke-width:2px,color:#fff;
+
+    class User user;
+    class Gemini external;
+```
@@ -0,0 +1,39 @@
+services:
+  app:
+    build: .
+    ports:
+      - "4000:4000"
+    volumes:
+      - ./uploads:/usr/src/app/uploads
+    env_file:
+      - .env
+    depends_on:
+      - redis
+      - chroma
+    command: node dist/index.js
+
+  worker:
+    build: . # Use the same image as the app service
+    env_file:
+      - .env
+    depends_on:
+      - redis
+      - chroma
+    volumes:
+      - ./uploads:/usr/src/app/uploads
+    command: node dist/jobs/workers/filesWorker.js
+
+  redis:
+    image: "redis:alpine"
+    ports:
+      - "6379:6379"
+
+  chroma:
+    image: "chromadb/chroma:1.0.13.dev120"
+    ports:
+      - "8000:8000"
+    volumes:
+      - chroma_data:/chroma/chroma
+
+volumes:
+  chroma_data:
@@ -5,41 +5,41 @@
   "main": "index.js",
   "scripts": {
     "dev": "nodemon src/index.ts",
-    "build": "tsc --project tsconfig.prod.json",
+    "build": "tsc",
     "work": "nodemon src/jobs/workers/filesWorker.ts",
-    "start": "node dist/src/index.js",
+    "start": "node dist/index.js",
     "inspect": "ts-node src/inspect-db.ts"
   },
   "keywords": [],
   "author": "",
   "license": "ISC",
   "devDependencies": {
-    "@types/cors": "^2.8.17",
-    "@types/express": "^5.0.0",
-    "@types/morgan": "^1.9.9",
-    "@types/multer": "^1.4.12",
-    "@types/node": "^22.13.9",
-    "nodemon": "^3.1.9",
-    "prisma": "^6.4.1",
+    "@types/cors": "^2.8.19",
+    "@types/express": "^5.0.3",
+    "@types/morgan": "^1.9.10",
+    "@types/multer": "^1.4.13",
+    "@types/node": "^22.15.32",
+    "nodemon": "^3.1.10",
+    "prisma": "^6.9.0",
     "ts-node": "^10.9.2",
-    "tsx": "^4.19.3",
-    "typescript": "^5.8.2"
+    "tsx": "^4.20.3",
+    "typescript": "^5.8.3"
   },
   "dependencies": {
     "@google/generative-ai": "^0.24.1",
-    "@types/cookie-parser": "^1.4.8",
+    "@types/cookie-parser": "^1.4.9",
     "@types/node-cron": "^3.0.11",
-    "axios": "^1.8.2",
-    "bullmq": "^5.53.3",
+    "axios": "^1.10.0",
+    "bullmq": "^5.54.0",
     "chromadb": "^3.0.3",
     "chromadb-default-embed": "^2.14.0",
     "cookie-parser": "^1.4.7",
     "cors": "^2.8.5",
-    "dotenv": "^16.4.7",
-    "express": "^5.0.1",
+    "dotenv": "^16.5.0",
+    "express": "^5.1.0",
     "express-rate-limit": "^7.5.0",
     "express-validator": "^7.2.1",
-    "helmet": "^8.0.0",
+    "helmet": "^8.1.0",
     "ioredis": "^5.6.1",
     "langchain": "^0.3.28",
     "morgan": "^1.10.0",