Skip to content

Commit e28c489

Browse files
authored
Merge pull request #107040 from nibaccam/automl-refresh
Auto ML | freshness review
2 parents efa287d + 972f680 commit e28c489

6 files changed

+47
-46
lines changed

articles/machine-learning/how-to-auto-train-forecast.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,7 @@ ms.service: machine-learning
99
ms.subservice: core
1010
ms.reviewer: trbye
1111
ms.topic: conceptual
12-
ms.date: 11/04/2019
12+
ms.date: 03/09/2020
1313
---
1414

1515
# Auto-train a time-series forecast model
@@ -23,7 +23,7 @@ In this article, you learn how to train a time-series forecasting regression mod
2323

2424
> [!VIDEO https://www.microsoft.com/videoplayer/embed/RE2X1GW]
2525
26-
You can use automated ML to combine techniques and approaches and get a recommended, high-quality time-series forecast. An automated time-series experiment is treated as a multivariate regression problem. Past time-series values are pivoted to become additional dimensions for the regressor together with other predictors.
26+
You can use automated ML to combine techniques and approaches and get a recommended, high-quality time-series forecast. An automated time-series experiment is treated as a multivariate regression problem. Past time-series values are "pivoted" to become additional dimensions for the regressor together with other predictors.
2727

2828
This approach, unlike classical time series methods, has an advantage of naturally incorporating multiple contextual variables and their relationship to one another during training. In real-world forecasting applications, multiple factors can influence a forecast. For example, when forecasting sales, interactions of historical trends, exchange rate and price all jointly drive the sales outcome. A further benefit is that all recent innovations in regression models apply immediately to forecasting.
2929

articles/machine-learning/how-to-auto-train-remote.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@ ms.service: machine-learning
1010
ms.subservice: core
1111
ms.workload: data-services
1212
ms.topic: conceptual
13-
ms.date: 11/04/2019
13+
ms.date: 03/09/2020
1414

1515
#Customer intent: As a professional data scientist, I can use automated machine learning (automated ML) functionality to build a model on an Azure Machine Learning remote compute target.
1616
---

articles/machine-learning/how-to-configure-auto-train.md

Lines changed: 7 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,7 @@ services: machine-learning
99
ms.service: machine-learning
1010
ms.subservice: core
1111
ms.topic: conceptual
12-
ms.date: 11/04/2019
12+
ms.date: 03/09/2020
1313
ms.custom: seodec18
1414
---
1515

@@ -125,8 +125,8 @@ Use custom validation dataset if random split is not acceptable, usually time se
125125
## Compute to run experiment
126126

127127
Next determine where the model will be trained. An automated machine learning training experiment can run on the following compute options:
128-
* Your local machine such as a local desktop or laptop – Generally when you have small dataset and you are still in the exploration stage.
129-
* A remote machine in the cloud – [Azure Machine Learning Managed Compute](concept-compute-target.md#amlcompute) is a managed service that enables the ability to train machine learning models on clusters of Azure virtual machines.
128+
* Your local machine such as a local desktop or laptop – Generally when you have small dataset and you are still in the exploration stage.
129+
* A remote machine in the cloud – [Azure Machine Learning Managed Compute](concept-compute-target.md#amlcompute) is a managed service that enables the ability to train machine learning models on clusters of Azure virtual machines.
130130

131131
See this [GitHub site](https://github.com/Azure/MachineLearningNotebooks/tree/master/how-to-use-azureml/automated-machine-learning) for examples of notebooks with local and remote compute targets.
132132

@@ -142,7 +142,7 @@ There are several options that you can use to configure your automated machine l
142142

143143
Some examples include:
144144

145-
1. Classification experiment using AUC weighted as the primary metric with experiment timeout minutes set to 30 minutes and 2 cross-validation folds.
145+
1. Classification experiment using AUC weighted as the primary metric with experiment timeout minutes set to 30 minutes and 2 cross-validation folds.
146146

147147
```python
148148
automl_classifier=AutoMLConfig(
@@ -154,7 +154,7 @@ Some examples include:
154154
label_column_name=label,
155155
n_cross_validations=2)
156156
```
157-
2. Below is an example of a regression experiment set to end after 60 minutes with five validation cross folds.
157+
2. Below is an example of a regression experiment set to end after 60 minutes with five validation cross folds.
158158

159159
```python
160160
automl_regressor = AutoMLConfig(
@@ -367,7 +367,7 @@ Use these 2 APIs on the first step of fitted model to understand more. See [thi
367367
This list includes all engineered feature names.
368368

369369
>[!Note]
370-
>Use 'timeseriestransformer' for task=forecasting, else use 'datatransformer' for regression or classification task.
370+
>Use 'timeseriestransformer' for task='forecasting', else use 'datatransformer' for 'regression' or 'classification' task.
371371

372372
+ API 2: `get_featurization_summary()` returns featurization summary for all the input features.
373373

@@ -377,7 +377,7 @@ Use these 2 APIs on the first step of fitted model to understand more. See [thi
377377
```
378378

379379
>[!Note]
380-
>Use 'timeseriestransformer' for task=forecasting, else use 'datatransformer' for regression or classification task.
380+
>Use 'timeseriestransformer' for task='forecasting', else use 'datatransformer' for 'regression' or 'classification' task.
381381

382382
Output:
383383
```

articles/machine-learning/how-to-create-portal-experiments.md

Lines changed: 17 additions & 16 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
---
2-
title: Build & deploy automated ML models
2+
title: Use autoML to create models & deploy
33
titleSuffix: Azure Machine Learning
4-
description: Create, manage, and deploy automated machine learning experiments in Azure Machine Learning studio.
4+
description: Create, review, and deploy automated machine learning models with Azure Machine Learning.
55
services: machine-learning
66
ms.service: machine-learning
77
ms.subservice: core
@@ -10,26 +10,28 @@ ms.author: nibaccam
1010
author: tsikiksr
1111
manager: cgronlun
1212
ms.reviewer: nibaccam
13-
ms.date: 02/04/2020
13+
ms.date: 03/10/2020
1414

1515
---
1616

17-
# Create, explore, and deploy automated machine learning experiments with Azure Machine Learning studio
17+
# Create, review, and deploy automated machine learning models with Azure Machine Learning
1818
[!INCLUDE [applies-to-skus](../../includes/aml-applies-to-enterprise-sku.md)]
1919

20-
In this article, you learn how to create, explore, and deploy automated machine learning experiments in Azure Machine Learning studio without a single line of code. Automated machine learning automates the process of selecting the best algorithm to use for your specific data, so you can generate a machine learning model quickly. [Learn more about automated machine learning](concept-automated-ml.md).
20+
In this article, you learn how to create, explore, and deploy automated machine learning models without a single line of code in Azure Machine Learning's studio interface. Automated machine learning is a process in which the best machine learning algorithm to use for your specific data is selected for you. This process enables you to generate machine learning models quickly. [Learn more about automated machine learning](concept-automated-ml.md).
21+
22+
For an end to end example, try the [tutorial for creating a classification model with Azure Machine Learning's automated ML interface](tutorial-first-experiment-automated-ml.md).
2123

22-
If you prefer a more code-based experience, you can also [configure your automated machine learning experiments in Python](how-to-configure-auto-train.md) with the [Azure Machine Learning SDK](https://docs.microsoft.com/python/api/overview/azure/ml/intro?view=azure-ml-py).
24+
For a Python code-based experience, [configure your automated machine learning experiments](how-to-configure-auto-train.md) with the Azure Machine Learning SDK.
2325

2426
## Prerequisites
2527

26-
* An Azure subscription. If you dont have an Azure subscription, create a free account before you begin. Try the [free or paid version of Azure Machine Learning](https://aka.ms/AMLFree) today.
28+
* An Azure subscription. If you don't have an Azure subscription, create a free account before you begin. Try the [free or paid version of Azure Machine Learning](https://aka.ms/AMLFree) today.
2729

2830
* An Azure Machine Learning workspace with a type of **Enterprise edition**. See [Create an Azure Machine Learning workspace](how-to-manage-workspace.md). To upgrade an existing workspace to Enterprise edition, see [Upgrade to Enterprise edition](how-to-manage-workspace.md#upgrade).
2931

3032
## Get started
3133

32-
1. Sign in to [Azure Machine Learning studio](https://ml.azure.com).
34+
1. Sign in to Azure Machine Learning at https://ml.azure.com.
3335

3436
1. Select your subscription and workspace.
3537

@@ -164,7 +166,7 @@ Automated machine learning offers preprocessing and data guardrails automaticall
164166
|Impute missing values|For numerical features, impute with average of values in the column.<br/><br/>For categorical features, impute with most frequent value.|
165167
|Generate additional features|For DateTime features: Year, Month, Day, Day of week, Day of year, Quarter, Week of the year, Hour, Minute, Second.<br/><br/>For Text features: Term frequency based on unigrams, bi-grams, and tri-character-grams.|
166168
|Transform and encode |Numeric features with few unique values are transformed into categorical features.<br/><br/>One-hot encoding is performed for low cardinality categorical; for high cardinality, one-hot-hash encoding.|
167-
|Word embeddings|Text featurizer that converts vectors of text tokens into sentence vectors using a pre-trained model. Each words embedding vector in a document is aggregated together to produce a document feature vector.|
169+
|Word embeddings|Text featurizer that converts vectors of text tokens into sentence vectors using a pre-trained model. Each word's embedding vector in a document is aggregated together to produce a document feature vector.|
168170
|Target encodings|For categorical features, maps each category with averaged target value for regression problems, and to the class probability for each class for classification problems. Frequency-based weighting and k-fold cross validation is applied to reduce over fitting of the mapping and noise caused by sparse data categories.|
169171
|Text target encoding|For text input, a stacked linear model with bag-of-words is used to generate the probability of each class.|
170172
|Weight of Evidence (WoE)|Calculates WoE as a measure of correlation of categorical columns to the target column. It is calculated as the log of the ratio of in-class vs out-of-class probabilities. This step outputs one numerical feature column per class and removes the need to explicitly impute missing values and outlier treatment.|
@@ -178,10 +180,10 @@ The following table describes the currently supported data guardrails, and the a
178180

179181
Guardrail|Status|Condition&nbsp;for&nbsp;trigger
180182
---|---|---
181-
Missing&nbsp;values&nbsp;imputation |**Passed** <br> <br> **Fixed**| No missing value in any of the input&nbsp;columns <br> <br> Some columns have missing values
183+
Missing&nbsp;values&nbsp;imputation |**Passed** <br> <br> **Fixed**| No missing value in any of the input&nbsp;columns <br> <br> Some columns have missing values
182184
Cross validation|**Done**|If no explicit validation set is provided
183-
High&nbsp;cardinality&nbsp;feature&nbsp;detection| **Passed** <br> <br>**Done**| No high cardinality features were detected <br><br> High cardinality input columns were detected
184-
Class balance detection |**Passed** <br><br><br>**Alerted** |Classes are balanced in the training data; A dataset is considered balanced if each class has good representation in the dataset, as measured by number and ratio of samples <br> <br> Classes in the training data are imbalanced
185+
High&nbsp;cardinality&nbsp;feature&nbsp;detection| **Passed** <br> <br>**Done**| No high cardinality features were detected <br><br> High cardinality input columns were detected
186+
Class balance detection |**Passed** <br><br><br>**Alerted** |Classes are balanced in the training data; A dataset is considered balanced if each class has good representation in the dataset, as measured by number and ratio of samples <br> <br> Classes in the training data are imbalanced
185187
Time-series data consistency|**Passed** <br><br><br><br> **Fixed** |<br> The selected {horizon, lag, rolling window} value(s) were analyzed, and no potential out-of-memory issues were detected. <br> <br>The selected {horizon, lag, rolling window} values were analyzed and will potentially cause your experiment to run out of memory. The lag or rolling window has been turned off.
186188

187189
## Run experiment and view results
@@ -232,11 +234,10 @@ Automated ML helps you with deploying the model without writing code:
232234

233235
1. Select **Deploy**. Deployment can take about 20 minutes to complete.
234236

235-
Now you have an operational web service to generate predictions! You can test the predictions by querying the service from [Power BIs built in Azure Machine Learning support](how-to-consume-web-service.md#consume-the-service-from-power-bi).
237+
Now you have an operational web service to generate predictions! You can test the predictions by querying the service from [Power BI's built in Azure Machine Learning support](how-to-consume-web-service.md#consume-the-service-from-power-bi).
236238

237239
## Next steps
238240

239-
* Try the end to end [tutorial for creating your first automated ML experiment with Azure Machine Learning studio](tutorial-first-experiment-automated-ml.md).
240-
* [Learn more about automated machine learning](concept-automated-ml.md) and Azure Machine Learning.
241-
* [Understand automated machine learning results](how-to-understand-automated-ml.md).
242241
* [Learn how to consume a web service](https://docs.microsoft.com/azure/machine-learning/how-to-consume-web-service).
242+
* [Understand automated machine learning results](how-to-understand-automated-ml.md).
243+
* [Learn more about automated machine learning](concept-automated-ml.md) and Azure Machine Learning.

articles/machine-learning/how-to-train-ml-models.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,7 @@ ms.service: machine-learning
99
ms.subservice: core
1010
ms.topic: conceptual
1111
ms.reviewer: sgilley
12-
ms.date: 11/08/2019
12+
ms.date: 03/09/2020
1313
ms.custom: seodec18
1414

1515
---
@@ -34,7 +34,7 @@ This article focuses on steps 4-5. For steps 1-3, refer to the [train a model tu
3434

3535
### Single-node training
3636

37-
Use an `Estimator` for a single-node training run on remote compute in Azure for a scikit-learn model. You should have already created your [compute target](how-to-set-up-training-targets.md#amlcompute) object `compute_target` and your [datastore](how-to-access-data.md) object `ds`.
37+
Use an `Estimator` for a single-node training run on remote compute in Azure for a scikit-learn model. You should have already created your [compute target](how-to-set-up-training-targets.md#amlcompute) object `compute_target` and your [FileDataset](how-to-create-register-datasets.md) object `ds`.
3838

3939
```Python
4040
from azureml.train.estimator import Estimator

0 commit comments

Comments
 (0)