NCTU-SYNC · david20571015 · Mar 10, 2025 · Mar 10, 2025 · Mar 10, 2025 · gemini-code-assist
diff --git a/README.md b/README.md
@@ -15,13 +15,13 @@ uv sync --no-dev --frozen
 uv run gen-protos
 ```
 
-## Usage
+## Configuration
 
-This section explains how to run the SYNC Server LLM using different methods.
+Before running the server, you need to:
 
-1. Configure the server by editing `configs/config.toml`
+1. Configure the server settings in `configs/config.toml`
 
-2. Set up the required environment variables by adding them to a `.env` file
+2. Create a `.env` file with the following environment variables:
 
    | Variable            | Description                   |
    | ------------------- | ----------------------------- |
@@ -30,38 +30,58 @@ This section explains how to run the SYNC Server LLM using different methods.
    | `QDRANT_PORT`       | The Qdrant host REST API port |
    | `QDRANT_COLLECTION` | The Qdrant collection name    |
 
-3. Start the server:
+## Running the Server
 
-   - To run the server locally:
+You can run SYNC Server LLM using one of the following methods:
 
-      ```shell
-      uv run scripts/serve.py --config configs/config.toml
-      ```
+### Method 1: Running Locally
 
-   - To run the server using Docker:
+```shell
+uv run scripts/serve.py --config configs/config.toml
+```
+
+### Method 2: Using Docker
+
+1. Build the Docker image:
+
+   ```shell
+   docker build -t sync/backend-llm .
+   ```
+
+2. Run the container:
+
+   ```shell
+   docker run -p 50051:50051 \
+         --env-file .env \
+         -v $(pwd)/path/to/configs:/app/configs/config.toml \
+         -v $(pwd)/path/to/hf_cache:/tmp/llama_index \
+         sync/backend-llm
+   ```
+
+   > Notes:
+   > - For Windows users, add `--gpus=all` to use GPU capabilities (requires Docker with GPU support)
+   > - We strongly recommend mounting the `hf_cache` directory to avoid re-downloading Hugging Face models on container restart
+   > - Make sure to [set up and run the Qdrant server](https://qdrant.tech/documentation/guides/installation/#docker-and-docker-compose) before starting
+
+### Method 3: Using Docker Compose
 
-      Build the Docker image:
+A `docker-compose.yaml` file is included in the repository to simplify deployment with both the server and Qdrant database.
 
-      ```shell
-      docker build -t sync/backend-llm .
-      ```
+1. Build the services:
 
-      Run the container:
+   ```shell
+   docker-compose build
+   ```
 
-      ```shell
-      docker run -p 50051:50051 \
-            --env-file .env \
-            -v $(pwd)/path/to/configs:/app/configs/config.toml \
-            -v $(pwd)/path/to/hf_cache:/tmp/llama_index \
-            sync/backend-llm
-      ```
+2. Start the services:
 
-      > 1. If you are using Windows, you can add `--gpus=all` to the `docker run` command. Ensure that your Docker installation supports GPU usage.
-      > 2. It is strongly recommended to mount the `hf_cache` directory to a persistent volume to avoid re-downloading the Hugging Face models every time the container is started.
+   ```shell
+   docker-compose up -d
+   ```
 
 ## Client Example
 
-You can refer to `scripts/client.py` for an example implementation of a client:
+To test the server, you can use the provided client example:
 
 ```shell
 uv run scripts/client.py

diff --git a/docker-compose.yaml b/docker-compose.yaml
@@ -0,0 +1,27 @@
+# Docker Compose configuration for SYNC Server LLM
+# This sets up both the backend-llm service and its required Qdrant vector database
+
+services:
+  # Main backend service for SYNC Server LLM
+  backend-llm:
+    build: sync-server-llm # Path to the directory with Dockerfile of sync-server-llm
+    restart: always
+    ports:
+      # Maps the container port to host (must match server.port in config.toml)
+      - 50051:50051
+    env_file:
+      - .env
+    environment:
+      QDRANT_HOST: qdrant
+    volumes:
+      # Mount configuration and cache for persistence
+      - ./configs/config.toml:/app/configs/config.toml
+      - ./.hf_cache:/tmp/llama_index
+
+  # Qdrant vector database service
+  qdrant:
+    image: qdrant/qdrant:latest
-    image: qdrant/qdrant:latest
+    image: qdrant/qdrant:v1.0.0
-    image: qdrant/qdrant:latest
+    image: qdrant/qdrant:v1.0.0
+    restart: always
+    volumes:
+      # Mount storage for persistence
+      - ./qdrant_storage:/qdrant/storage