vcon-dev
diff --git a/‎README.md‎
Lines changed: 41 additions & 0 deletions b/‎README.md‎
Lines changed: 41 additions & 0 deletions
diff --git a/‎example_config.yml‎
Lines changed: 31 additions & 0 deletions b/‎example_config.yml‎
Lines changed: 31 additions & 0 deletions
diff --git a/‎poetry.lock‎
Lines changed: 414 additions & 2 deletions b/‎poetry.lock‎
Lines changed: 414 additions & 2 deletions
diff --git a/‎pyproject.toml‎
Lines changed: 2 additions & 0 deletions b/‎pyproject.toml‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎server/storage/milvus/README.md‎
Lines changed: 157 additions & 0 deletions b/‎server/storage/milvus/README.md‎
Lines changed: 157 additions & 0 deletions
@@ -268,3 +268,44 @@ vcon-server-conserver-1  | {"asctime": "2024-08-23 17:27:22,240", "levelname": "
 ```
 
 The [vCon admin program](https://github.com/vcon-dev/vcon-admin) is a nice tool for managing the conserver.&#x20;
+
+## Storage Modules
+
+### Milvus Vector Database Storage
+
+The vcon-server includes support for storing vCons in Milvus, a vector database that enables semantic search across vCon content. This is particularly useful for finding conversations based on meaning rather than exact keyword matches.
+
+To set up Milvus storage:
+
+1. Install the required packages:
+   ```bash
+   poetry add pymilvus>=2.3.0 openai>=1.54.3 python-dateutil
+   ```
+
+2. Add Milvus storage configuration to your config.yml:
+   ```yaml
+   storages:
+     milvus:
+       module: storage.milvus
+       options:
+         host: "localhost"                  # Milvus server host
+         port: "19530"                      # Milvus server port
+         collection_name: "vcons"           # Name of collection in Milvus
+         embedding_model: "text-embedding-3-small"  # OpenAI embedding model
+         embedding_dim: 1536                # Dimensions for the embedding model
+         api_key: "your-openai-api-key"     # Your OpenAI API key
+         organization: "your-org-id"        # Optional: Your OpenAI organization ID
+         create_collection_if_missing: true # Auto-create collection if needed
+   ```
+
+3. Include the Milvus storage in your processing chain:
+   ```yaml
+   chains:
+     main_chain:
+       # ... other configuration ...
+       storages:
+         - milvus
+         # ... other storages ...
+   ```
+
+See the [Milvus Storage Module README](server/storage/milvus/README.md) for more details on configuration and usage.
@@ -52,6 +52,36 @@ storages:
       aws_access_key_id: some_key
       aws_secret_access_key: some_secret
       aws_bucket: some_bucket
+  milvus:
+    module: storage.milvus
+    options:
+      # Connection settings
+      host: "localhost"
+      port: "19530"
+      collection_name: "vcons"
+      
+      # Embedding settings
+      embedding_model: "text-embedding-3-small"
+      embedding_dim: 1536
+      api_key: "your-openai-api-key"
+      organization: ""
+      
+      # Operation settings
+      create_collection_if_missing: true
+      skip_if_exists: true
+      
+      # Vector index settings (Default: IVF_FLAT with L2 distance)
+      index_type: "IVF_FLAT"  # Options: IVF_FLAT, IVF_SQ8, IVF_PQ, HNSW, FLAT
+      metric_type: "L2"       # Options: L2, IP, COSINE
+      nlist: 128              # For IVF indexes: number of clusters
+      
+      # Advanced HNSW settings (used only if index_type is HNSW)
+      # m: 16                 # Number of edges per node 
+      # ef_construction: 200  # Size of dynamic candidate list during construction
+      
+      # Advanced IVF_PQ settings (used only if index_type is IVF_PQ)
+      # pq_m: 8               # Number of sub-quantizers
+      # pq_nbits: 8           # Bit depth per quantizer
 chains:
   sample_chain:
     links:
@@ -65,6 +95,7 @@ chains:
     - mongo
     - postgres
     - s3
+    - milvus
     egress_lists:
     - test_output
     enabled: 1
@@ -37,6 +37,8 @@ pytest = "^8.3.4"
 anyio = "^4.8.0"
 faker = "^33.3.1"
 jq = "^1.8.0"
+pymilvus = ">=2.3.0"
+python-dateutil = "^2.8.2"
 
 
 [tool.poetry.group.dev.dependencies]
 
@@ -0,0 +1,157 @@
+# Milvus Storage Module for vCon Server
+
+This storage module enables storing vCons in a Milvus vector database for semantic search capabilities.
+
+## Description
+
+The Milvus storage module:
+- Extracts text content from vCons (transcripts, summaries, metadata, etc.)
+- Generates embeddings using OpenAI's embedding models
+- Stores the embeddings in Milvus along with related metadata
+- Enables semantic search across vCon content
+
+## Requirements
+
+- Milvus 2.0+ running and accessible
+- OpenAI API key for generating embeddings
+- Additional Python packages:
+  ```
+  pymilvus>=2.3.0
+  openai>=1.0.0
+  ```
+
+## Configuration
+
+Add the following to your config.yml file:
+
+```yaml
+storages:
+  milvus:
+    module: storage.milvus
+    options:
+      # Connection settings
+      host: "localhost"                  # Milvus server host
+      port: "19530"                      # Milvus server port
+      collection_name: "vcons"           # Name of the collection in Milvus
+      
+      # Embedding settings
+      embedding_model: "text-embedding-3-small"  # OpenAI embedding model
+      embedding_dim: 1536                # Dimensions for the chosen model
+      api_key: "your-openai-api-key"     # Your OpenAI API key
+      organization: "your-org-id"        # Optional: Your OpenAI organization ID
+      
+      # Operation settings
+      create_collection_if_missing: true # Whether to create collection if it doesn't exist
+      skip_if_exists: true               # Skip storing vCons that already exist
+      
+      # Vector index settings (optional, shown are defaults)
+      index_type: "IVF_FLAT"             # Vector index type
+      metric_type: "L2"                  # Distance metric type
+      nlist: 128                         # Number of clusters for IVF indexes
+```
+
+### Vector Index Types
+
+The module supports different vector index types with appropriate parameters:
+
+#### IVF_FLAT (Default)
+Good balance of search accuracy and speed. Uses more storage but gives exact results within each cluster.
+
+```yaml
+index_type: "IVF_FLAT"  
+metric_type: "L2"       # Or "IP" for inner product, or "COSINE"
+nlist: 128              # Number of clusters, higher values = faster search but less accurate
+```
+
+#### IVF_SQ8
+Similar to IVF_FLAT but with scalar quantization (8-bit) to reduce memory usage. Good for large datasets.
+
+```yaml
+index_type: "IVF_SQ8"
+metric_type: "L2"
+nlist: 128
+```
+
+#### IVF_PQ
+Product Quantization for maximum memory optimization. Sacrifices some accuracy for much smaller index size.
+
+```yaml
+index_type: "IVF_PQ"
+metric_type: "L2"
+nlist: 128
+pq_m: 8                 # Number of sub-quantizers
+pq_nbits: 8             # Bit depth per quantizer
+```
+
+#### HNSW
+Hierarchical Navigable Small World graph index. Very fast for searching, especially with smaller datasets.
+
+```yaml
+index_type: "HNSW"
+metric_type: "L2"
+m: 16                   # Number of edges per node
+ef_construction: 200    # Size of the dynamic candidate list during construction
+```
+
+#### FLAT
+The simplest index that compares to every vector. Most accurate but slowest for large datasets.
+
+```yaml
+index_type: "FLAT"
+metric_type: "L2"
+```
+
+### Distance Metrics
+
+- `L2`: Euclidean distance (default). Good for most embeddings including OpenAI embeddings.
+- `IP`: Inner product. Use when vectors are normalized and you want to measure closeness.
+- `COSINE`: Cosine similarity. Good for measuring the angle between vectors regardless of magnitude.
+
+## Searching vCons in Milvus
+
+While not part of the storage module itself, you can search vCons in Milvus using the pymilvus client:
+
+```python
+from pymilvus import connections, Collection
+
+# Connect to Milvus
+connections.connect(host="localhost", port="19530")
+
+# Load collection
+collection = Collection("vcons")
+collection.load()
+
+# Get embedding for search query (using same OpenAI client as storage module)
+from openai import OpenAI
+client = OpenAI(api_key="your-api-key")
+response = client.embeddings.create(input="your search query", model="text-embedding-3-small")
+query_embedding = response.data[0].embedding
+
+# Search parameters
+search_params = {
+    "metric_type": "L2",
+    "params": {"nprobe": 10}
+}
+
+# Search vCons
+results = collection.search(
+    data=[query_embedding],
+    anns_field="embedding",
+    param=search_params,
+    limit=5,
+    output_fields=["vcon_uuid", "party_id", "text", "metadata_title"]
+)
+
+# Process results
+for hits in results:
+    for hit in hits:
+        print(f"vCon UUID: {hit.entity.get('vcon_uuid')}")
+        print(f"Score: {hit.score}")
+        print(f"Title: {hit.entity.get('metadata_title')}")
+```
+
+## Notes
+
+- Vector dimensions must match the embedding model used (1536 for text-embedding-3-small)
+- For larger vCon datasets, consider configuring Milvus index parameters for better performance
+- Milvus collections created by this module include a L2 vector index for similarity search