You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The `ws` object is used throughout the rest of the code in this tutorial.
172
172
173
-
## Split the data into train and test sets
173
+
## Split data into train and test sets
174
174
175
175
Split the data into training and test sets by using the `train_test_split` function in the _scikit-learn_ library. This function segregates the data into the x (**features**) data set for model training and the y (**values to predict**) data set for testing.
The purpose of this step is to prepare data points to test the finished model that aren't used to train the model. These points are used to measure true accuracy. A well-trained model is one that can make accurate predictions from unseen data. You now have data prepared for autotraining a machine learning model.
188
188
189
-
## Automatically train a model
189
+
## Automatically train model
190
190
191
191
To automatically train a model, take the following steps:
> Automated ML pre-processing steps (feature normalization, handling missing data, converting text to numeric, and so on) become part of the underlying model. When you use the model for predictions, the same pre-processing steps applied during training are applied to your input data automatically.
241
241
242
-
### Train the automatic regression model
242
+
### Train automatic regression model
243
243
244
244
Create an experiment object in your workspace. An experiment acts as a container for your individual jobs. Pass the defined `automl_config` object to the experiment, and set the output to _True_ to view progress during the job.
245
245
@@ -294,7 +294,7 @@ BEST: The best observed score thus far.
294
294
19 StackEnsemble 0:00:27 0.9463 0.9471
295
295
```
296
296
297
-
## Explore the results
297
+
## Explore results
298
298
299
299
Explore the results of automatic training with a [Jupyter widget](/python/api/azureml-widgets/azureml.widgets). The widget allows you to see a graph and table of all individual job iterations, along with training accuracy metrics and metadata. Additionally, you can filter on different accuracy metrics than your primary metric with the dropdown selector.
300
300
@@ -313,7 +313,7 @@ The plot chart for the Jupyter widget:
313
313
314
314
:::image type="content" source="./media/how-to-auto-train-models/automl-chart-output.png" alt-text="Screenshot that shows the Jupyter widget plot diagram in Azure Machine Learning studio." lightbox="./media/how-to-auto-train-models/automl-chart-output.png":::
315
315
316
-
### Retrieve the best model
316
+
### Retrieve best model
317
317
318
318
The following code lets you select the best model from your iterations. The `get_output` function returns the best run and the fitted model for the last fit invocation. By using the overloads on the `get_output` function, you can retrieve the best run and fitted model for any logged metric or a particular iteration.
319
319
@@ -323,7 +323,7 @@ print(best_run)
323
323
print(fitted_model)
324
324
```
325
325
326
-
### Test the best model accuracy
326
+
### Test best model accuracy
327
327
328
328
Use the best model to run predictions on the test data set to predict taxi fares. The `predict` function uses the best model and predicts the values of y, **trip cost**, from the `x_test` data set.
0 commit comments