Skip to content

📝 IQR outlier detection at the start of a timeseries#49

Merged
sambra95 merged 14 commits intomainfrom
docs_preprocess
Mar 1, 2026
Merged

📝 IQR outlier detection at the start of a timeseries#49
sambra95 merged 14 commits intomainfrom
docs_preprocess

Conversation

@enryH
Copy link
Collaborator

@enryH enryH commented Feb 28, 2026

No description provided.

@enryH enryH requested a review from sambra95 February 28, 2026 12:07
Copy link
Collaborator

@sambra95 sambra95 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I tested a set of different outlier detection methods (see outlier_detection.ipynb) on some real growth curves that had some outliers and I also added several synthetic outliers (Test data and labels are stored in test_labels.csv and test_traces.csv).

I refactored the outlier detection logic to have a single detect_outlier function that then takes a method argument to decide which method to use. I put you iqr based method (method=iqr) and two other promising methods (ecod and hampel) as options. ECOD means another dependency but on the other hand it is both much more accurate and >50x faster than original method (see my Teams message)

I have updated the tutorial accordingly.

@sambra95 sambra95 self-requested a review March 1, 2026 11:37
@sambra95 sambra95 merged commit 2fbaeb0 into main Mar 1, 2026
7 checks passed
@sambra95 sambra95 deleted the docs_preprocess branch March 1, 2026 11:46
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants