Merge branch 'master' of https://github.com/ahhyoushh/FileSense

ahhyoushh · ahhyoushh · commit 156835dd9995 · 2025-12-15T19:48:11.000+05:30
diff --git a/wiki/metrics.md b/wiki/metrics.md
@@ -118,7 +118,7 @@ The RL agent has demonstrated that **Policy C** (Generation Disabled) provides o
 
 ### 2.5 Reference Model Comparison
 
-We conducted a head-to-head comparison of three embedding models to determine the optimal balance between speed and accuracy for the FileSense pipeline.
+I conducted a head-to-head comparison of three embedding models to determine the optimal balance between speed and accuracy for the FileSense pipeline.
 
 **Models Tested:**
 1.  **all-mpnet-base-v2** (110M params) - *The previous gold standard*
@@ -139,7 +139,7 @@ We conducted a head-to-head comparison of three embedding models to determine th
 *   **Robustness:** `bge-base` solved all edge cases where the other models failed (e.g., noisy PDF text extraction in `Ray optics.pdf` and `chem work.pdf`).
 *   **Confidence:** The similarity distribution shifted significantly higher (0.60+), reducing the system's reliance on fallback mechanisms.
 
-**Conclusion:** We have officially switched the default model to **BAAI/bge-base-en-v1.5** as of Dec 2025.
+**Conclusion:** switched the default model to **BAAI/bge-base-en-v1.5** as of Dec 2025.
 
 ---
 
@@ -238,7 +238,7 @@ The consistency of keyword superiority across NCERT and STEM datasets (academic
 1. **Dataset Size:** Evaluation limited to <100 files per dataset
 2. **Domain Coverage:** Primarily academic content
 3. **Language:** English-only evaluation
-4. **Model:** Single embedding model tested (all-mpnet-base-v2)
+
 
 ---