fix: apply markdown linting fixes to demo documentation

yossiovadia · yossiovadia · commit af87950a96b7 · 2025-10-15T15:28:18.000-07:00
Signed-off-by: Yossi Ovadia &lt;yovadia@redhat.com&gt;
diff --git a/deploy/openshift/demo/CATEGORY-MODEL-MAPPING.md b/deploy/openshift/demo/CATEGORY-MODEL-MAPPING.md
@@ -51,7 +51,6 @@ These prompts have **100% classification accuracy** and route as follows:
 | **Health** | "How to maintain a healthy lifestyle?" | Model-B | ~0.221 |
 | **Health** | "What is a balanced diet?" | Model-B | ~0.268 |
 
-
 ---
 
 ## Reasoning Mode (Chain-of-Thought)
@@ -70,7 +69,6 @@ Categories with **reasoning enabled** use extended thinking for complex problems
 - **Fallback Category:** "other" (score: 0.7)
 - **Unmatched queries** route to Model-A with the "other" category system prompt
 
-
 ### Key Parameters:
 
 - **name:** Category identifier
@@ -81,7 +79,6 @@ Categories with **reasoning enabled** use extended thinking for complex problems
 
 ---
 
-
 ## Confidence Scores Explained
 
 **Why are confidence scores low (0.2-0.4)?**
@@ -92,6 +89,7 @@ Categories with **reasoning enabled** use extended thinking for complex problems
 4. **Highest score wins** - 0.326 for "math" means it beat all other 13 categories
 
 **What's important:**
+
 - ✅ Classification is **consistent** across multiple runs
 - ✅ Same prompt → same category every time
 - ✅ Confidence is **relative** to other categories, not absolute certainty
diff --git a/deploy/openshift/demo/DEMO-README.md b/deploy/openshift/demo/DEMO-README.md
@@ -13,6 +13,7 @@ Shows real-time classification, routing, and security decisions:
 ```
 
 **What it shows:**
+
 - 📨 **Incoming requests** with user prompts
 - 🛡️ **Security checks** (jailbreak detection)
 - 🔍 **Classification** (category detection with confidence)
@@ -33,17 +34,20 @@ python3 deploy/openshift/demo/demo-semantic-router.py
 ```
 
 **Features:**
+
 1. **Single Classification** - Tests random prompt from golden set
 2. **All Classifications** - Tests all 10 golden prompts
 3. **PII Detection Test** - Tests personal information filtering
 4. **Jailbreak Detection Test** - Tests security filtering
 5. **Run All Tests** - Executes all tests sequentially
 
 **Requirements:**
+
 - ✅ Must be logged into OpenShift (`oc login`)
 - URLs are discovered automatically from routes
 
 **What it does:**
+
 - Goes through Envoy (same path as OpenWebUI)
 - Shows routing decisions and response previews
 - **Appears in Grafana dashboard!**
@@ -76,9 +80,11 @@ python3 deploy/openshift/demo/demo-semantic-router.py
    - Show the architecture diagram
 
 2. **Run interactive demo** (Terminal 2)
+
    ```bash
    python3 deploy/openshift/demo/demo-semantic-router.py
    ```
+
    Choose option 2 (All Classifications)
 
 3. **Point to live logs** (Terminal 1)
@@ -102,26 +108,31 @@ python3 deploy/openshift/demo/demo-semantic-router.py
 ## Key Talking Points
 
 ### Classification Accuracy
+
 - **10 golden prompts** with 100% accuracy
 - Categories: Chemistry, History, Psychology, Health, Math
 - Shows consistent classification behavior
 
 ### Security Features
+
 - **Jailbreak detection** on every request
 - Shows "BENIGN" for safe requests
 - Confidence scores displayed
 
 ### Smart Routing
+
 - Automatic model selection based on content
 - Load balancing across Model-A and Model-B
 - Routing decisions visible in logs
 
 ### Performance
+
 - **Semantic caching** reduces latency
 - Cache hits shown in logs with similarity scores
 - Sub-second response times
 
 ### Observability
+
 - Real-time logs with structured JSON
 - Grafana metrics and dashboards
 - Request tracing and debugging
@@ -131,6 +142,7 @@ python3 deploy/openshift/demo/demo-semantic-router.py
 ## Troubleshooting
 
 ### Log viewer shows no output
+
 ```bash
 # Check if semantic-router pod is running
 oc get pods -n vllm-semantic-router-system | grep semantic-router
@@ -140,6 +152,7 @@ oc logs -n vllm-semantic-router-system deployment/semantic-router --tail=20
 ```
 
 ### Classification test fails
+
 ```bash
 # Verify Envoy route is accessible
 curl http://envoy-http-vllm-semantic-router-system.apps.cluster-pbd96.pbd96.sandbox5333.opentlc.com/v1/models
@@ -149,6 +162,7 @@ oc get pods -n vllm-semantic-router-system
 ```
 
 ### Grafana doesn't show metrics
+
 - Wait 15-30 seconds for metrics to appear
 - Refresh the dashboard
 - Check the time range (last 5 minutes)
@@ -158,13 +172,15 @@ oc get pods -n vllm-semantic-router-system
 ## Cache Management
 
 ### Check Cache Status
+
 ```bash
 ./deploy/openshift/demo/cache-management.sh status
 ```
 
 Shows recent cache activity and cached queries.
 
 ### Clear Cache (for demo)
+
 ```bash
 ./deploy/openshift/demo/cache-management.sh clear
 ```
@@ -176,22 +192,27 @@ Restarts semantic-router deployment to clear in-memory cache (~30 seconds).
 **Workflow to show caching in action:**
 
 1. Clear the cache:
+
    ```bash
    ./deploy/openshift/demo/cache-management.sh clear
    ```
 
 2. Run classification test (first time - no cache):
+
    ```bash
    python3 deploy/openshift/demo/demo-semantic-router.py
    ```
+
    Choose option 2 (All Classifications)
    - Processing time: ~3-4 seconds per query
    - Logs show queries going to model
 
 3. Run classification test again (second time - with cache):
+
    ```bash
    python3 deploy/openshift/demo/demo-semantic-router.py
    ```
+
    Choose option 2 (All Classifications) again
    - Processing time: ~400ms per query (10x faster!)
    - Logs show "💾 CACHE HIT" for all queries