You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
-**Vector store size impact varies by model**: GPT-4.1 series shows minimal latency impact across vector store sizes, while GPT-5 series shows significant increases.
225
219
@@ -239,10 +233,6 @@ In addition to the above evaluations which use a 3 MB sized vector store, the ha
|| Extra Large (105 MB) | 0.636 | 0.528 | 0.528 | 0.528 |
258
244
259
245
**Key Insights:**
260
246
261
247
-**Best Performance**: gpt-5-mini consistently achieves the highest ROC AUC scores across all vector store sizes (0.909-0.939)
262
-
-**Best Latency**: gpt-4.1-nano shows the most consistent and lowest latency across all scales (4,171-4,809ms P50) but shows poor performance
248
+
-**Best Latency**: gpt-4.1-mini shows the most consistent and lowest latency across all scales (6,661-7,374ms P50) while maintaining solid accuracy
263
249
-**Most Stable**: gpt-4.1-mini (default) maintains relatively stable performance across vector store sizes with good accuracy-latency balance
264
250
-**Scale Sensitivity**: gpt-5 shows the most variability in performance across vector store sizes, with performance dropping significantly at larger scales
265
251
-**Performance vs Scale**: Most models show decreasing performance as vector store size increases, with gpt-5-mini being the most resilient
0 commit comments