Skip to content

Commit 2e8d2b7

Browse files
authored
Merge pull request #99715 from likebupt/review-sample-docs
update screenshots in sample docs, correct some errors
2 parents a041fa2 + 1e60a2e commit 2e8d2b7

24 files changed

+31
-31
lines changed

articles/machine-learning/how-to-designer-sample-classification-churn.md

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -6,10 +6,10 @@ services: machine-learning
66
ms.service: machine-learning
77
ms.subservice: core
88
ms.topic: conceptual
9-
author: xiaoharper
10-
ms.author: zhanxia
9+
author: likebupt
10+
ms.author: keli19
1111
ms.reviewer: sgilley
12-
ms.date: 11/04/2019
12+
ms.date: 12/25/2019
1313
---
1414

1515
# Use boosted decision tree to predict churn with Azure Machine Learning designer
@@ -46,11 +46,11 @@ First, some simple data processing.
4646

4747
- The raw dataset has many missing values. Use the **Clean Missing Data** module to replace the missing values with 0.
4848

49-
![Clean the dataset](./media/how-to-designer-sample-classification-churn/cleaned-dataset.png)
49+
![Clean the dataset](media/how-to-designer-sample-classification-churn/sample5-dataset-1225.png)
5050

5151
- The features and the corresponding churn are in different datasets. Use the **Add Columns** module to append the label columns to the feature columns. The first column, **Col1**, is the label column. From the visualization result we can see the dataset is unbalanced. There way more negative (-1) examples than positive examples (+1). We will use **SMOTE** module to increase underrepresented cases later.
5252

53-
![Add the column dataset](./media/how-to-designer-sample-classification-churn/added-column1.png)
53+
![Add the column dataset](./media/how-to-designer-sample-classification-churn/sample5-addcol-1225.png)
5454

5555

5656

@@ -64,7 +64,7 @@ First, some simple data processing.
6464

6565
Visualize the output of the **Evaluate Model** module to see the performance of the model on the test set.
6666

67-
![Evaluate the results](./media/how-to-designer-sample-classification-churn/evaluate-result.png)
67+
![Evaluate the results](./media/how-to-designer-sample-classification-churn/sample5-evaluate-1225.png)
6868

6969
You can move the **Threshold** slider and see the metrics change for the binary classification task.
7070

articles/machine-learning/how-to-designer-sample-classification-credit-risk-cost-sensitive.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -6,10 +6,10 @@ services: machine-learning
66
ms.service: machine-learning
77
ms.subservice: core
88
ms.topic: conceptual
9-
author: xiaoharper
10-
ms.author: zhanxia
9+
author: likebupt
10+
ms.author: keli19
1111
ms.reviewer: peterlu
12-
ms.date: 11/04/2019
12+
ms.date: 12/25/2019
1313
---
1414
# Build a classifier & use Python scripts to predict credit risk using Azure Machine Learning designer
1515

@@ -138,7 +138,7 @@ def azureml_main(dataframe1 = None, dataframe2 = None):
138138

139139
To view the results of the pipeline, you can right-click the Visualize output of the last **Select Columns in Dataset** module.
140140

141-
![Visualize output](./media/how-to-designer-sample-classification-credit-risk-cost-sensitive/result.png)
141+
![Visualize output](media/how-to-designer-sample-classification-credit-risk-cost-sensitive/sample4-lastselect-1225.png)
142142

143143
The first column lists the machine learning algorithm used to generate the model.
144144

articles/machine-learning/how-to-designer-sample-classification-flight-delay.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -6,10 +6,10 @@ services: machine-learning
66
ms.service: machine-learning
77
ms.subservice: core
88
ms.topic: conceptual
9-
author: xiaoharper
10-
ms.author: zhanxia
9+
author: likebupt
10+
ms.author: keli19
1111
ms.reviewer: peterlu
12-
ms.date: 11/04/2019
12+
ms.date: 12/25/2019
1313
---
1414
# Build a classifier & use R to predict flight delays with Azure Machine Learning designer
1515

@@ -109,7 +109,7 @@ Finally, to test the quality of the results, add the **Evaluate Model** module t
109109
## Evaluate
110110
The logistic regression model has AUC of 0.631 on the test set.
111111

112-
![evaluate](./media/how-to-designer-sample-classification-flight-delay/evaluate.png)
112+
![evaluate](media/how-to-designer-sample-classification-flight-delay/sample6-evaluate-1225.png)
113113

114114
## Next steps
115115

articles/machine-learning/how-to-designer-sample-classification-predict-income.md

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -6,10 +6,10 @@ services: machine-learning
66
ms.service: machine-learning
77
ms.subservice: core
88
ms.topic: conceptual
9-
author: xiaoharper
10-
ms.author: zhanxia
9+
author: likebupt
10+
ms.author: keli19
1111
ms.reviewer: peterlu
12-
ms.date: 11/04/2019
12+
ms.date: 12/25/2019
1313
---
1414
# Build a classifier & use feature selection to predict income with Azure Machine Learning designer
1515

@@ -19,7 +19,7 @@ ms.date: 11/04/2019
1919

2020
Learn how to build a machine learning classifier without writing a single line of code using the designer (preview). This sample trains a **two-class boosted decision tree** to predict adult census income (>=50K or <=50K).
2121

22-
Because the question is answering "Which one?" this is called a classification problem. However, you can apply the same fundamental process to tackle any type of machine learning problem whether it be regression, classification, clustering, and so on.
22+
Because the question is answering "Which one?" this is called a classification problem. However, you can apply the same fundamental process to tackle any type of machine learning problem - regression, classification, clustering, and so on.
2323

2424
Here's the final pipeline graph for this sample:
2525

@@ -36,7 +36,7 @@ Here's the final pipeline graph for this sample:
3636
## Data
3737

3838
The dataset contains 14 features and one label column. There are multiple types of features, including numerical and categorical. The following diagram shows an excerpt from the dataset:
39-
![data](./media/how-to-designer-sample-classification-predict-income/data.png)
39+
![data](media/how-to-designer-sample-classification-predict-income/sample3-dataset-1225.png)
4040

4141

4242

@@ -60,7 +60,7 @@ Follow these steps to create the pipeline:
6060

6161
## Results
6262

63-
![Evaluate the results](./media/how-to-designer-sample-classification-predict-income/evaluate-result.png)
63+
![Evaluate the results](media/how-to-designer-sample-classification-predict-income/sample3-evaluate-1225.png)
6464

6565
In the evaluation results, you can see that the curves like ROC, Precision-recall and confusion metrics.
6666

articles/machine-learning/how-to-designer-sample-regression-automobile-price-basic.md

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -6,10 +6,10 @@ services: machine-learning
66
ms.service: machine-learning
77
ms.subservice: core
88
ms.topic: conceptual
9-
author: xiaoharper
10-
ms.author: zhanxia
9+
author: likebupt
10+
ms.author: keli19
1111
ms.reviewer: peterlu
12-
ms.date: 11/04/2019
12+
ms.date: 12/25/2019
1313
---
1414
# Use regression to predict car prices with Azure Machine Learning designer
1515

@@ -19,7 +19,7 @@ ms.date: 11/04/2019
1919

2020
Learn how to build a machine learning regression model without writing a single line of code using the designer (preview).
2121

22-
This pipeline trains a **decision forest regressor** to predict a car's price based on technical features such as make, model, horsepower, and size. Because you're trying to answer the question "How much?" this is called a regression problem. However, you can apply the same fundamental steps in this example to tackle any type of machine learning problem whether it be regression, classification, clustering, and so on.
22+
This pipeline trains a **linear regressor** to predict a car's price based on technical features such as make, model, horsepower, and size. Because you're trying to answer the question "How much?" this is called a regression problem. However, you can apply the same fundamental steps in this example to tackle any type of machine learning problem whether it be regression, classification, clustering, and so on.
2323

2424
The fundamental steps of a training machine learning model are:
2525

@@ -67,11 +67,11 @@ After the model is trained, you can use the **Score Model** and **Evaluate Model
6767

6868
**Score Model** generates predictions for the test dataset by using the trained model. To check the result, select the output port of **Score Model** and then select **Visualize**.
6969

70-
![Score result](./media/how-to-designer-sample-regression-automobile-price-basic/score-result.png)
70+
![Score result](./media/how-to-designer-sample-regression-automobile-price-basic/sample1-score-1225.png)
7171

7272
Pass the scores to the **Evaluate Model** module to generate evaluation metrics. To check the result, select the output port of the **Evaluate Model** and then select **Visualize**.
7373

74-
![Evaluate result](./media/how-to-designer-sample-regression-automobile-price-basic/evaluate-result.png)
74+
![Evaluate result](./media/how-to-designer-sample-regression-automobile-price-basic/sample1-evaluate-1225.png)
7575

7676
## Clean up resources
7777

articles/machine-learning/how-to-designer-sample-regression-automobile-price-compare-algorithms.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -6,10 +6,10 @@ services: machine-learning
66
ms.service: machine-learning
77
ms.subservice: core
88
ms.topic: conceptual
9-
author: xiaoharper
10-
ms.author: zhanxia
9+
author: likebupt
10+
ms.author: keli19
1111
ms.reviewer: peterlu
12-
ms.date: 11/04/2019
12+
ms.date: 12/25/2019
1313
---
1414
# Train & compare multiple regression models to predict car prices with Azure Machine Learning designer
1515

@@ -76,7 +76,7 @@ Here are the results:
7676

7777
These results show that the model built with **Boosted Decision Tree Regression** has a lower root mean squared error than the model built on **Decision Forest Regression**.
7878

79-
Both algorithms have a lower error on the training dataset than on the unseen testing dataset.
79+
8080

8181
## Clean up resources
8282

84.1 KB
Loading
76.2 KB
Loading
73.3 KB
Loading
18.3 KB
Loading

0 commit comments

Comments
 (0)