Important technical question: Global versus Local models #705

philip-ndikum · 2023-07-26T01:17:03Z

philip-ndikum
Jul 26, 2023

Hi - in your documentation here - your have stated the following:

_NeuralForecast contains user-friendly implementations of neural forecasting models that allow for easy transition of computing capabilities (GPU/CPU), computation parallelization, and hyperparameter tuning.

All the NeuralForecast models are “global” because we train them with all the series from the input pd.DataFrame data Y_df, yet the optimization objective is, momentarily, “univariate” as it does not consider the interaction between the output predictions across time series. Like the StatsForecast library, core.NeuralForecast allows you to explore collections of models efficiently and contains functions for convenient wrangling of input and output pd.DataFrames predictions._

This is a bit unclear, so if you are prediction let's say for e-commerce and the unique_id is the clothing type - are you saying that the neural network is not considering the interactions for different unique_ids - so you couldn't do something like segment your data-set by t-shirts versus electronics for example and produce a different neural network weight so when you do inference the model has learned about those interactions given the unique_id?

Or in the lower level code are you filtering out on a unique_id level then training and doing inference for each unique_id - I'm a bit confused here.

Answered by cchallu

Aug 30, 2023

Hi @anonymous-engineering! Sorry for the late reply.

By global model, we mean that the same model (only one set of weights) is used for all the time series (distinguished by the unique_id column) in your dataset. The actual values of the unique_id column are not used at all by the models to learn any specific characteristic for a group of ids (like the difference between t-shirts and electronics). If you want separate models for each group, you need to set separate pipelines and select the relevant set of time series in each dataset (not recommended in general for deep-learning models). The best solution to learning different dynamics for each group is to add static exogenous variables, f…

View full answer

cchallu · 2023-08-30T17:26:23Z

cchallu
Aug 30, 2023
Maintainer

Hi @anonymous-engineering! Sorry for the late reply.

By global model, we mean that the same model (only one set of weights) is used for all the time series (distinguished by the unique_id column) in your dataset. The actual values of the unique_id column are not used at all by the models to learn any specific characteristic for a group of ids (like the difference between t-shirts and electronics). If you want separate models for each group, you need to set separate pipelines and select the relevant set of time series in each dataset (not recommended in general for deep-learning models). The best solution to learning different dynamics for each group is to add static exogenous variables, for example, one-hot encoding the category.

0 replies

philip-ndikum · 2023-08-30T17:32:29Z

philip-ndikum
Aug 30, 2023
Author

Awesome yes I figured out a few weeks back looking at the underlying code. Thank you so much!

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Important technical question: Global versus Local models #705

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Important technical question: Global versus Local models #705

Uh oh!

philip-ndikum Jul 26, 2023

Replies: 2 comments

Uh oh!

cchallu Aug 30, 2023 Maintainer

Uh oh!

philip-ndikum Aug 30, 2023 Author

philip-ndikum
Jul 26, 2023

cchallu
Aug 30, 2023
Maintainer

philip-ndikum
Aug 30, 2023
Author