Skip to content

Commit 150cd08

Browse files
authored
Merge pull request #88973 from PeterCLu/plu-samples-to-how-to-working
AMLs Convert visual interface samples to how-tos
2 parents d6cbf9e + c86f808 commit 150cd08

39 files changed

+140
-140
lines changed

articles/machine-learning/index.yml

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -82,11 +82,11 @@ landingContent:
8282
- linkListType: how-to-guide
8383
links:
8484
- text: Predict credit risk
85-
url: service/ui-sample-classification-predict-credit-risk-cost-sensitive.md
85+
url: service/how-to-ui-sample-classification-predict-credit-risk-cost-sensitive.md
8686
- text: Predict delays
87-
url: service/ui-sample-classification-predict-flight-delay.md
87+
url: service/how-to-ui-sample-classification-predict-flight-delay.md
8888
- text: Predict churn
89-
url: service/ui-sample-classification-predict-churn.md
89+
url: service/how-to-ui-sample-classification-predict-churn.md
9090

9191
# MLOps
9292
- title: Deploy & manage models

articles/machine-learning/service/ui-sample-classification-predict-churn.md renamed to articles/machine-learning/service/how-to-ui-sample-classification-predict-churn.md

Lines changed: 17 additions & 17 deletions
Original file line numberDiff line numberDiff line change
@@ -18,47 +18,47 @@ Learn how to build a complex machine learning experiment without writing a singl
1818

1919
This experiment trains three, **two-class boosted decision tree** classifiers to predict common tasks for customer relationship management (CRM) systems: churn, appetency, and up-selling. The data values and labels are split across multiple data sources and scrambled to anonymize customer information, however, we can still use the visual interface to combine data sets and train a model using the scrambled values.
2020

21-
Because we're trying to answer the question "Which one?" this is called a classification problem. However, you can apply the same steps in this experiment to tackle any type of machine learning problem whether it be regression, classification, clustering, and so on.
21+
Because you're trying to answer the question "Which one?" this is called a classification problem, but you can apply the same logic in this project to tackle any type of machine learning problem whether it be regression, classification, clustering, and so on.
2222

2323
Here's the completed graph for this experiment:
2424

25-
![Experiment graph](./media/ui-sample-classification-predict-churn/experiment-graph.png)
25+
![Experiment graph](./media/how-to-ui-sample-classification-predict-churn/experiment-graph.png)
2626

2727
## Prerequisites
2828

2929
[!INCLUDE [aml-ui-prereq](../../../includes/aml-ui-prereq.md)]
3030

3131
4. Select the **Open** button for the Sample 5 experiment.
3232

33-
![Open the experiment](media/ui-sample-classification-predict-churn/open-sample5.png)
33+
![Open the experiment](media/how-to-ui-sample-classification-predict-churn/open-sample5.png)
3434

3535
## Data
3636

37-
The data we use for this experiment is from KDD Cup 2009. The dataset has 50,000 rows and 230 feature columns. The task is to predict churn, appetency, and up-selling for customers who use these features. For more information about the data and the task, see the [KDD website](https://www.kdd.org/kdd-cup/view/kdd-cup-2009).
37+
The data for this experiment is from KDD Cup 2009. It has 50,000 rows and 230 feature columns. The task is to predict churn, appetency, and up-selling for customers who use these features. For more information about the data and the task, see the [KDD website](https://www.kdd.org/kdd-cup/view/kdd-cup-2009).
3838

3939
## Experiment summary
4040

4141
This visual interface sample experiment shows binary classifier prediction of churn, appetency, and up-selling, a common task for customer relationship management (CRM).
4242

43-
First, we do some simple data processing.
43+
First, do some simple data processing.
4444

45-
- The raw dataset contains lots of missing values. We use the **Clean Missing Data** module to replace the missing values with 0.
45+
- The raw dataset contains lots of missing values. Use the **Clean Missing Data** module to replace the missing values with 0.
4646

47-
![Clean the dataset](./media/ui-sample-classification-predict-churn/cleaned-dataset.png)
47+
![Clean the dataset](./media/how-to-ui-sample-classification-predict-churn/cleaned-dataset.png)
4848

49-
- The features and the corresponding churn, appetency, and up-selling labels are in different datasets. We use the **Add Columns** module to append the label columns to the feature columns. The first column, **Col1**, is the label column. The rest of the columns, **Var1**, **Var2**, and so on, are the feature columns.
49+
- The features and the corresponding churn, appetency, and up-selling labels are in different datasets. Use the **Add Columns** module to append the label columns to the feature columns. The first column, **Col1**, is the label column. The rest of the columns, **Var1**, **Var2**, and so on, are the feature columns.
5050

51-
![Add the column dataset](./media/ui-sample-classification-predict-churn/added-column1.png)
51+
![Add the column dataset](./media/how-to-ui-sample-classification-predict-churn/added-column1.png)
5252

53-
- We use the **Split Data** module to split the dataset into train and test sets.
53+
- Use the **Split Data** module to split the dataset into train and test sets.
5454

55-
We then use the Boosted Decision Tree binary classifier with the default parameters to build the prediction models. We build one model per task, that is, one model each to predict up-selling, appetency, and churn.
55+
Then use the Boosted Decision Tree binary classifier with the default parameters to build the prediction models. Build one model per task, that is, one model each to predict up-selling, appetency, and churn.
5656

5757
## Results
5858

5959
Visualize the output of the **Evaluate Model** module to see the performance of the model on the test set. For the up-selling task, the ROC curve shows that the model does better than a random model. The area under the curve (AUC) is 0.857. At threshold 0.5, the precision is 0.7, the recall is 0.463, and the F1 score is 0.545.
6060

61-
![Evaluate the results](./media/ui-sample-classification-predict-churn/evaluate-result.png)
61+
![Evaluate the results](./media/how-to-ui-sample-classification-predict-churn/evaluate-result.png)
6262

6363
You can move the **Threshold** slider and see the metrics change for the binary classification task.
6464

@@ -70,8 +70,8 @@ Visualize the output of the **Evaluate Model** module to see the performance of
7070

7171
Explore the other samples available for the visual interface:
7272

73-
- [Sample 1 - Regression: Predict an automobile's price](ui-sample-regression-predict-automobile-price-basic.md)
74-
- [Sample 2 - Regression: Compare algorithms for automobile price prediction](ui-sample-regression-predict-automobile-price-compare-algorithms.md)
75-
- [Sample 3 - Classification: Predict credit risk](ui-sample-classification-predict-credit-risk-basic.md)
76-
- [Sample 4 - Classification: Predict credit risk (cost sensitive)](ui-sample-classification-predict-credit-risk-cost-sensitive.md)
77-
- [Sample 6 - Classification: Predict flight delays](ui-sample-classification-predict-flight-delay.md)
73+
- [Sample 1 - Regression: Predict an automobile's price](how-to-ui-sample-regression-predict-automobile-price-basic.md)
74+
- [Sample 2 - Regression: Compare algorithms for automobile price prediction](how-to-ui-sample-regression-predict-automobile-price-compare-algorithms.md)
75+
- [Sample 3 - Classification: Predict credit risk](how-to-ui-sample-classification-predict-credit-risk-basic.md)
76+
- [Sample 4 - Classification: Predict credit risk (cost sensitive)](how-to-ui-sample-classification-predict-credit-risk-cost-sensitive.md)
77+
- [Sample 6 - Classification: Predict flight delays](how-to-ui-sample-classification-predict-flight-delay.md)

articles/machine-learning/service/ui-sample-classification-predict-credit-risk-basic.md renamed to articles/machine-learning/service/how-to-ui-sample-classification-predict-credit-risk-basic.md

Lines changed: 14 additions & 18 deletions
Original file line numberDiff line numberDiff line change
@@ -16,33 +16,33 @@ ms.date: 05/10/2019
1616

1717
Learn how to build a machine learning classifier without writing a single line of code using the visual interface. This sample trains a **two-class boosted decision tree** to predict credit risk (high or low) based on credit application information such as credit history, age, and number of credit cards.
1818

19-
Because we're trying to answer the question "Which one?" this is called a classification problem. However, you can apply the same fundamental process to tackle any type of machine learning problem whether it be regression, classification, clustering, and so on.
19+
Because the question is answering "Which one?" this is called a classification problem. However, you can apply the same fundamental process to tackle any type of machine learning problem whether it be regression, classification, clustering, and so on.
2020

21-
Here's the completed graph for this experiment:
21+
Here's the final experiment graph for this sample:
2222

23-
![Graph of the experiment](media/ui-sample-classification-predict-credit-risk-basic/overall-graph.png)
23+
![Graph of the experiment](media/how-to-ui-sample-classification-predict-credit-risk-basic/overall-graph.png)
2424

2525
## Prerequisites
2626

2727
[!INCLUDE [aml-ui-prereq](../../../includes/aml-ui-prereq.md)]
2828

2929
4. Select the **Open** button for the Sample 3 experiment:
3030

31-
![Open the experiment](media/ui-sample-classification-predict-credit-risk-basic/open-sample3.png)
31+
![Open the experiment](media/how-to-ui-sample-classification-predict-credit-risk-basic/open-sample3.png)
3232

3333
## Related sample
3434

35-
[Sample 4 - Classification: Credit Risk Prediction (Cost Sensitive)](ui-sample-classification-predict-credit-risk-cost-sensitive.md)
36-
provides an advanced experiment that solves the same problem as this experiment. It shows how to perform _cost sensitive_ classification by using an **Execute Python Script** module and compare the performance of two binary classification algorithms. Refer to it if you want to learn more about how to build classification experiments.
35+
[Sample 4 - Classification: Credit Risk Prediction (Cost Sensitive)](how-to-ui-sample-classification-predict-credit-risk-cost-sensitive.md)
36+
provides an advanced experiment that solves the same problem as this experiment. It shows how to perform *cost sensitive* classification by using an **Execute Python Script** module and compare the performance of two binary classification algorithms. Refer to it if you want to learn more about how to build classification pipelines.
3737

3838
## Data
3939

40-
We use the German Credit Card dataset from the UC Irvine repository.
40+
The sample uses the German Credit Card dataset from the UC Irvine repository.
4141
The dataset contains 1,000 samples with 20 features and 1 label. Each sample represents a person. The features include numerical and categorical features. See the [UCI website](https://archive.ics.uci.edu/ml/datasets/Statlog+%28German+Credit+Data%29) for the meaning of the categorical features. The last column is the label, which denotes the credit risk and has only two possible values: high credit risk = 2, and low credit risk = 1.
4242

4343
## Experiment summary
4444

45-
We follow these steps to create the experiment:
45+
Follow these steps to create the experiment:
4646

4747
1. Drag the German Credit Card UCI Data dataset module into the experiment's canvas.
4848
1. Add an **Edit Metadata** module so we can add meaningful names for each column.
@@ -52,13 +52,9 @@ We follow these steps to create the experiment:
5252
1. Add a **Score Model** module and connect the **Train Model** module to it. Then add the test set (the right port of the **Split Data**) to the **Score Model**. The **Score Model** will make the predictions. You can select its output port to see the predictions and the positive class probabilities.
5353
1. Add an **Evaluate Model** module and connect the scored dataset to its left input port. To see the evaluation results, select the output port of the **Evaluate Model** module and select **Visualize**.
5454

55-
Here's the complete experiment graph:
56-
57-
![Graph of the experiment](media/ui-sample-classification-predict-credit-risk-basic/overall-graph.png)
58-
5955
## Results
6056

61-
![Evaluate the results](media/ui-sample-classification-predict-credit-risk-basic/evaluate-result.png)
57+
![Evaluate the results](media/how-to-ui-sample-classification-predict-credit-risk-basic/evaluate-result.png)
6258

6359
In the evaluation results, you can see that the AUC of the model is 0.776. At threshold 0.5, the precision is 0.621, the recall is 0.456, and the F1 score is 0.526.
6460

@@ -70,8 +66,8 @@ In the evaluation results, you can see that the AUC of the model is 0.776. At th
7066

7167
Explore the other samples available for the visual interface:
7268

73-
- [Sample 1 - Regression: Predict an automobile's price](ui-sample-regression-predict-automobile-price-basic.md)
74-
- [Sample 2 - Regression: Compare algorithms for automobile price prediction](ui-sample-regression-predict-automobile-price-compare-algorithms.md)
75-
- [Sample 4 - Classification: Predict credit risk (cost sensitive)](ui-sample-classification-predict-credit-risk-cost-sensitive.md)
76-
- [Sample 5 - Classification: Predict churn](ui-sample-classification-predict-churn.md)
77-
- [Sample 6 - Classification: Predict flight delays](ui-sample-classification-predict-flight-delay.md)
69+
- [Sample 1 - Regression: Predict an automobile's price](how-to-ui-sample-regression-predict-automobile-price-basic.md)
70+
- [Sample 2 - Regression: Compare algorithms for automobile price prediction](how-to-ui-sample-regression-predict-automobile-price-compare-algorithms.md)
71+
- [Sample 4 - Classification: Predict credit risk (cost sensitive)](how-to-ui-sample-classification-predict-credit-risk-cost-sensitive.md)
72+
- [Sample 5 - Classification: Predict churn](how-to-ui-sample-classification-predict-churn.md)
73+
- [Sample 6 - Classification: Predict flight delays](how-to-ui-sample-classification-predict-flight-delay.md)

0 commit comments

Comments
 (0)