Skip to content

Commit ef42113

Browse files
committed
changelog
1 parent 8c45b4b commit ef42113

1 file changed

Lines changed: 12 additions & 0 deletions

File tree

docs/changes/53.feature.md

Lines changed: 12 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -23,3 +23,15 @@
2323
- `plot_training_evaluation.py`: Energy resolution and residual distribution visualization
2424
- **Comprehensive test suites**: 20 new tests covering residual computation, standardization, energy weighting, apply inference
2525
- **Robust error handling**: Clear messages for missing scalers; guaranteed row-count preservation in apply pipeline
26+
27+
### Enhanced Diagnostic Pipeline
28+
29+
- **Generalization-gap metrics cached during training**: Train/test RMSE, gap %, and generalization ratio computed and cached in the model artifact, enabling fast overfitting assessment without recomputation
30+
- **Residual normality statistics cached during training**: Normality tests (Kolmogorov-Smirnov, Anderson-Darling), distribution shape metrics (skewness, kurtosis, Q-Q R²), and outlier counts computed once during training and cached for fast retrieval
31+
- **Diagnostic reconstruction from model metadata**: All regression diagnostics (generalization-gap, partial-dependence, residual-normality) now reconstruct the held-out test split from stored model metadata + input file list, enabling reproducibility and offline analysis without CSV exports
32+
- **Cache-first diagnostic workflows**: Diagnostic scripts load cached metrics first (fast) with graceful fallback to reconstruction if cache unavailable (backward compatible with older models)
33+
- **CLI entry points for all diagnostics**:
34+
- `eventdisplay-ml-diagnostic-generalization-gap`: Quantify overfitting via train/test RMSE comparison
35+
- `eventdisplay-ml-diagnostic-partial-dependence`: Validate model captures physics via partial dependence curves
36+
- `eventdisplay-ml-diagnostic-residual-normality`: Validate residual normality and detect outliers
37+
- **Fixed sklearn FutureWarning**: Partial dependence plots convert feature data to float64 to avoid integer dtype warnings in newer scikit-learn versions

0 commit comments

Comments
 (0)