Skip to content

Commit fd3bf0f

Browse files
authored
Merge pull request #183376 from nibaccam/forecast-valid
AutoML | forecasting notes
2 parents de8765a + 5b3b4f8 commit fd3bf0f

File tree

1 file changed

+8
-0
lines changed

1 file changed

+8
-0
lines changed

articles/machine-learning/how-to-configure-cross-validation-data-splits.md

Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -44,6 +44,9 @@ For this article you need,
4444

4545
Use the [AutoMLConfig](/python/api/azureml-train-automl-client/azureml.train.automl.automlconfig.automlconfig) object to define your experiment and training settings. In the following code snippet, notice that only the required parameters are defined, that is the parameters for `n_cross_validations` or `validation_data` are **not** included.
4646

47+
> [!NOTE]
48+
> The default data splits and cross-validation are not supported in forecasting scenarios.
49+
4750
```python
4851
data = "https://automlsamplenotebookdata.blob.core.windows.net/automl-sample-notebook-data/creditcard.csv"
4952

@@ -121,6 +124,7 @@ To perform k-fold cross-validation, include the `n_cross_validations` parameter
121124

122125
> [!NOTE]
123126
> The `n_cross_validations` parameter is not supported in classification scenarios that use deep neural networks.
127+
> For forecasting scenarios, see how cross validation is applied in [Set up AutoML to train a time-series forecasting model](how-to-auto-train-forecast.md#training-and-validation-data).
124128
125129
In the following code, five folds for cross-validation are defined. Hence, five different trainings, each training using 4/5 of the data, and each validation using 1/5 of the data with a different holdout fold each time.
126130

@@ -169,6 +173,10 @@ automl_config = AutoMLConfig(compute_target = aml_remote_compute,
169173

170174
You can also provide your own cross-validation (CV) data folds. This is considered a more advanced scenario because you are specifying which columns to split and use for validation. Include custom CV split columns in your training data, and specify which columns by populating the column names in the `cv_split_column_names` parameter. Each column represents one cross-validation split, and is filled with integer values 1 or 0--where 1 indicates the row should be used for training and 0 indicates the row should be used for validation.
171175

176+
> [!NOTE]
177+
> The `cv_split_column_names` parameter is not supported in forecasting scenarios.
178+
179+
172180
The following code snippet contains bank marketing data with two CV split columns 'cv1' and 'cv2'.
173181

174182
```python

0 commit comments

Comments
 (0)