Skip to content

Commit 585b0db

Browse files
FroeMicclaude
andauthored
docs(score-analytics): update sampling, intervals, and metrics description (#2280)
- Add detailed sampling documentation explaining >100k threshold behavior - Update configurable intervals to match actual UI (5m, 30m, 1h, 3h, 1d, 7d, 30d, 90d, 1y) - Remove "industry-standard" qualifier from statistical metrics description 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Claude <[email protected]>
1 parent 8cede07 commit 585b0db

File tree

1 file changed

+3
-3
lines changed

1 file changed

+3
-3
lines changed

pages/docs/evaluation/evaluation-methods/score-analytics.mdx

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -19,7 +19,7 @@ Score Analytics complements Langfuse's [experiment SDK](/docs/evaluation/overvie
1919
- **Lightweight Setup**: No configuration needed—start analyzing scores immediately after they're ingested
2020
- **Quick Validation**: Compare scores from different sources (e.g., GPT-4 vs Gemini as judges) to measure agreement and ensure reliability
2121
- **Out-of-the-Box Insights**: Visualize distributions, track trends, and discover correlations without custom dashboard configuration
22-
- **Statistical Rigor**: Access industry-standard metrics like Pearson correlation, Cohen's Kappa, and F1 scores with built-in interpretation
22+
- **Statistical Rigor**: Access metrics like Pearson correlation, Cohen's Kappa, and F1 scores with built-in interpretation
2323

2424
For advanced analyses requiring custom metrics or complex comparisons, use the [experiment SDK](/docs/evaluation/overview) for deeper investigation.
2525

@@ -114,7 +114,7 @@ Score Analytics provides two views for understanding your data:
114114
### Time-Based Analysis
115115

116116
The Trend Over Time chart helps you monitor score patterns with:
117-
- **Configurable intervals**: From seconds to years (1s, 5s, 10s, 30s, 1m, 5m, 15m, 30m, 1h, 6h, 1d, 7d, 30d, 90d, 365d)
117+
- **Configurable intervals**: From minutes to years (5m, 30m, 1h, 3h, 1d, 7d, 30d, 90d, 1y)
118118
- **Automatic interval selection**: Smart defaults based on your selected time range
119119
- **Gap filling**: Missing time periods are filled with zeros for consistent visualization
120120
- **Average calculations**: Subtitle shows overall average for the time period
@@ -219,7 +219,7 @@ Score Analytics provides industry-standard statistical metrics with interpretati
219219
**Current Constraints**:
220220
- **Two scores maximum**: Currently supports comparing up to two scores at a time. For multi-way comparisons, perform pairwise analyses.
221221
- **Same data type only**: You can only compare scores of the same data type (numeric with numeric, categorical with categorical, boolean with boolean).
222-
- **Query truncation**: Very large queries (100k+ matched scores) may be truncated to maintain performance. Use time range or object type filters to narrow your analysis if needed.
222+
- **Sampling**: For performance optimization, queries expecting >100k scores (for either score1 or score2) automatically apply random sampling. This sampling approximates true random sampling and maintains statistical properties of your data. A visible indicator will show when sampling is active, and you can use time range or object type filters to narrow your analysis if you need the complete dataset.
223223
</Callout>
224224

225225
## Tips and Best Practices

0 commit comments

Comments
 (0)