You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: wiki/metrics.md
+3-3Lines changed: 3 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -118,7 +118,7 @@ The RL agent has demonstrated that **Policy C** (Generation Disabled) provides o
118
118
119
119
### 2.5 Reference Model Comparison
120
120
121
-
We conducted a head-to-head comparison of three embedding models to determine the optimal balance between speed and accuracy for the FileSense pipeline.
121
+
I conducted a head-to-head comparison of three embedding models to determine the optimal balance between speed and accuracy for the FileSense pipeline.
@@ -139,7 +139,7 @@ We conducted a head-to-head comparison of three embedding models to determine th
139
139
***Robustness:**`bge-base` solved all edge cases where the other models failed (e.g., noisy PDF text extraction in `Ray optics.pdf` and `chem work.pdf`).
140
140
***Confidence:** The similarity distribution shifted significantly higher (0.60+), reducing the system's reliance on fallback mechanisms.
141
141
142
-
**Conclusion:**We have officially switched the default model to **BAAI/bge-base-en-v1.5** as of Dec 2025.
142
+
**Conclusion:** switched the default model to **BAAI/bge-base-en-v1.5** as of Dec 2025.
143
143
144
144
---
145
145
@@ -238,7 +238,7 @@ The consistency of keyword superiority across NCERT and STEM datasets (academic
238
238
1.**Dataset Size:** Evaluation limited to <100 files per dataset
239
239
2.**Domain Coverage:** Primarily academic content
240
240
3.**Language:** English-only evaluation
241
-
4.**Model:** Single embedding model tested (all-mpnet-base-v2)
0 commit comments