FEAT: Training window filtering by marcopeix · Pull Request #1344 · Nixtla/neuralforecast

marcopeix · 2025-06-19T20:32:04Z

Currently, we create the maximum number of training windows, meaning that we might have windows with only 1 available insample data point and 1 available outsample data point.

These are technically low quality windows.

This PR adds the parameter available_sample_fractions to control how many available insample and outsample data points should be available as a fraction of input size and horizon for insample and outsample respectively.

review-notebook-app · 2025-06-19T20:32:09Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

marcopeix · 2025-06-19T20:53:21Z

It's basically a rework of #1059 . Many thanks to @jasminerienecker for the initial idea!

elephaint

Nice work, few comments!

nbs/common.base_model.ipynb

nbs/models.autoformer.ipynb

…nd TSMixer

elephaint

Thanks, good work! 2 things:

It needs a proper test (the current one has no value)
Minor adjustment in the implementation is required for multivariate models I think (which becomes apparent in a test)

nbs/common.base_model.ipynb

elephaint

Great work! Good for me if you agree with the final changes I made (cosmetic and I changed the tests + added explanation to the tests)

elephaint

@marcopeix good to go once you verify/check my explanations in the test

nbs/core.ipynb

Incorrect

Antoine-Schwartz · 2025-07-24T12:10:13Z

Sampling quality is a really interesting subject. Thank you for taking the time to add this first option @marcopeix !

By the way, it would be great to have a dedicated documentation section.
Sampling with time series, and particularly the neuralforecast implementation with the mask system is not obvious to understand.
And if you also look at the interactions with other options such as start_padding_enable, you can quickly get lost :)

marcopeix added 2 commits June 19, 2025 15:19

Add new parameter to filter windows based on available data points

f6b7707

Add test for filtering training windows

015f146

Hide testing cell

b35987c

marcopeix marked this pull request as ready for review June 20, 2025 13:47

marcopeix requested a review from elephaint June 20, 2025 13:47

elephaint requested changes Jul 4, 2025

View reviewed changes

elephaint and others added 6 commits July 4, 2025 17:08

Merge branch 'main' into feature/training-window-filtering

a53a755

Olivier comments

631ca83

Olivier comments

9084f70

Adjust parameter name for each model

aef4c96

Add test for training_data_availability_threshold using LSTM, NHITS a…

15e6a5f

…nd TSMixer

I missed TimeMixer

dfb7fc2

marcopeix requested a review from elephaint July 9, 2025 16:16

marcopeix added 2 commits July 10, 2025 10:04

Merge branch 'main' into feature/training-window-filtering

2019ca5

Merge branch 'main' into feature/training-window-filtering

2153680

elephaint requested changes Jul 15, 2025

View reviewed changes

nbs/common.base_model.ipynb Show resolved Hide resolved

nbs/common.base_model.ipynb Outdated Show resolved Hide resolved

nbs/common.base_model.ipynb Outdated Show resolved Hide resolved

marcopeix added 6 commits July 15, 2025 16:18

Change test and multiply by n_series

52adad6

remove mps fallback for local tests

dd08474

Fix test

56a6631

Read data again to fix test

04f0ef9

Debug test

edcd73c

Make test less strict

39a6d52

marcopeix requested a review from elephaint July 16, 2025 17:14

change_tests

cbeeede

elephaint previously approved these changes Jul 17, 2025

View reviewed changes

elephaint reviewed Jul 17, 2025

View reviewed changes

nbs/core.ipynb Show resolved Hide resolved

nbs/core.ipynb Show resolved Hide resolved

elephaint self-requested a review July 17, 2025 08:43

Adjust numbers in case where assert fails

7a76cd4

elephaint approved these changes Jul 17, 2025

View reviewed changes

marcopeix merged commit f0eeab6 into main Jul 17, 2025
18 checks passed

marcopeix deleted the feature/training-window-filtering branch July 17, 2025 14:56

marcopeix mentioned this pull request Jul 18, 2025

add option to remove windows with poor data quality #1059

Closed

W057 mentioned this pull request Feb 12, 2026

Q/feature req: how to only train selected days rows? #1411

Open

Conversation

marcopeix commented Jun 19, 2025

Uh oh!

review-notebook-app bot commented Jun 19, 2025

Uh oh!

marcopeix commented Jun 19, 2025

Uh oh!

elephaint left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

elephaint left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

elephaint left a comment

Choose a reason for hiding this comment

Uh oh!

elephaint left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Antoine-Schwartz commented Jul 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants