You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/machine-learning/algorithm-cheat-sheet.md
+8-6Lines changed: 8 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -15,7 +15,9 @@ ms.date: 11/04/2019
15
15
16
16
The **Azure Machine Learning Algorithm Cheat Sheet** helps you choose the right algorithm for a predictive analytics model.
17
17
18
-
Azure Machine Learning has a large library of algorithms from the ***classification***, ***recommender systems***, ***clustering***, ***anomaly detection***, ***regression*** and ***text analytics*** families. Each is designed to address a different type of machine learning problem.
18
+
Azure Machine Learning has a large library of algorithms from the ***classification***, ***recommender systems***, ***clustering***, ***anomaly detection***, ***regression***, and ***text analytics*** families. Each is designed to address a different type of machine learning problem.
19
+
20
+
For additional guidance, see [How to select algorithms](how-to-select-algorithms.md)
@@ -27,7 +29,7 @@ Download and print the Machine Learning Algorithm Cheat Sheet in tabloid size to
27
29
28
30
## How to use the Machine Learning Algorithm Cheat Sheet
29
31
30
-
The suggestions offered in this algorithm cheat sheet are approximate rules-of-thumb. Some can be bent, and some can be flagrantly violated. This is intended to suggest a starting point. Don’t be afraid to run a head-to-head competition between several algorithms on your data. There is simply no substitute for understanding the principles of each algorithm and the system that generated your data.
32
+
The suggestions offered in this algorithm cheat sheet are approximate rules-of-thumb. Some can be bent, and some can be flagrantly violated. This cheat sheet is intended to suggest a starting point. Don’t be afraid to run a head-to-head competition between several algorithms on your data. There is simply no substitute for understanding the principles of each algorithm and the system that generated your data.
31
33
32
34
Every machine learning algorithm has its own style or inductive bias. For a specific problem, several algorithms may be appropriate, and one algorithm may be a better fit than others. But it's not always possible to know beforehand which is the best fit. In cases like these, several algorithms are listed together in the cheat sheet. An appropriate strategy would be to try one algorithm, and if the results are not yet satisfactory, try the others.
33
35
@@ -43,17 +45,17 @@ In supervised learning, each data point is labeled or associated with a category
43
45
44
46
### Unsupervised learning
45
47
46
-
In unsupervised learning, data points have no labels associated with them. Instead, the goal of an unsupervised learning algorithm is to organize the data in some way or to describe its structure. This can mean grouping it into clusters, as K-means does, or finding different ways of looking at complex data so that it appears simpler.
48
+
In unsupervised learning, data points have no labels associated with them. Instead, the goal of an unsupervised learning algorithm is to organize the data in some way or to describe its structure. Unsupervised learning groups data into clusters, as K-means does, or finds different ways of looking at complex data so that it appears simpler.
47
49
48
50
### Reinforcement learning
49
51
50
-
In reinforcement learning, the algorithm gets to choose an action in response to each data point. It is a common approach in robotics, where the set of sensor readings at one point in time is a data point, and the algorithm must choose the robot’s next action. It's also a natural fit for Internet of Things applications. The learning algorithm also receives a reward signal a short time later, indicating how good the decision was. Based on this, the algorithm modifies its strategy in order to achieve the highest reward.
52
+
In reinforcement learning, the algorithm gets to choose an action in response to each data point. It is a common approach in robotics, where the set of sensor readings at one point in time is a data point, and the algorithm must choose the robot’s next action. It's also a natural fit for Internet of Things applications. The learning algorithm also receives a reward signal a short time later, indicating how good the decision was. Based on this signal, the algorithm modifies its strategy in order to achieve the highest reward.
51
53
52
54
## Next steps
53
55
54
-
*[Learn about studio in Azure Machine Learning and the Azure portal](overview-what-is-azure-ml.md).
56
+
*See additional guidance on [How to select algorithms](how-to-select-algorithms.md)
55
57
56
-
*See a list of algorithms and modules in the [Algorithm and module reference](algorithm-module-reference/module-reference.md).
58
+
*[Learn about studio in Azure Machine Learning and the Azure portal](overview-what-is-azure-ml.md).
57
59
58
60
*[Tutorial: Build a prediction model in Azure Machine Learning designer](tutorial-designer-automobile-price-train-score.md).
Copy file name to clipboardExpand all lines: articles/machine-learning/algorithm-module-reference/module-reference.md
+19-5Lines changed: 19 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -17,25 +17,35 @@ This reference content provides the technical background on each of the machine
17
17
18
18
Each module represents a set of code that can run independently and perform a machine learning task, given the required inputs. A module might contain a particular algorithm, or perform a task that is important in machine learning, such as missing value replacement, or statistical analysis.
19
19
20
+
For help with choosing algorithms, see
21
+
*[How to select algorithms](../how-to-select-algorithms.md)
> In any pipeline in the designer, you can get information about a specific module. Select the module, then select the **more help** link in the **Quick Help** pane.
22
26
23
-
## Modules
27
+
## Data preparation modules
24
28
25
-
Modules are organized by functionality:
26
29
27
30
| Functionality | Description | Module |
28
31
| --- |--- | --- |
29
-
||**Data preparation**: ||
30
32
| Data input and output | Move data from cloud sources into your pipeline. Write your results or intermediate data to Azure Storage, a SQL database, or Hive, while running a pipeline, or use cloud storage to exchange data between pipelines. |[Enter Data Manually](enter-data-manually.md) <br/> [Export Data](export-data.md) <br/> [Import Data](import-data.md)|
31
33
| Data transformation | Operations on data that are unique to machine learning, such as normalizing or binning data, dimensionality reduction, and converting data among various file formats.|[Add Columns](add-columns.md) <br/> [Add Rows](add-rows.md) <br/> [Apply Math Operation](apply-math-operation.md) <br/> [Apply SQL Transformation](apply-sql-transformation.md) <br/> [Clean Missing Data](clean-missing-data.md) <br/> [Clip Values](clip-values.md) <br/> [Convert to CSV](convert-to-csv.md) <br/> [Convert to Dataset](convert-to-dataset.md) <br/> [Edit Metadata](edit-metadata.md) <br/> [Join Data](join-data.md) <br/> [Normalize Data](normalize-data.md) <br/> [Partition and Sample](partition-and-sample.md) <br/> [Remove Duplicate Rows](remove-duplicate-rows.md) <br/> [SMOTE](smote.md) <br/> [Select Columns Transform](select-columns-transform.md) <br/> [Select Columns in Dataset](select-columns-in-dataset.md) <br/> [Split Data](split-data.md)|
32
34
| Feature Selection | Select a subset of relevant, useful features to use in building an analytical model. |[Filter Based Feature Selection](filter-based-feature-selection.md) <br/> [Permutation Feature Importance](permutation-feature-importance.md)|
33
35
| Statistical Functions | Provide a wide variety of statistical methods related to data science. |[Summarize Data](summarize-data.md)|
| Clustering | Group data together.| [K-Means Clustering](k-means-clustering.md)
37
43
| Classification | Predict a class. Choose from binary (two-class) or multiclass algorithms.|[Multiclass Boosted Decision Tree](multiclass-boosted-decision-tree.md) <br/> [Multiclass Decision Forest](multiclass-decision-forest.md) <br/> [Multiclass Logistic Regression](multiclass-logistic-regression.md) <br/> [Multiclass Neural Network](multiclass-neural-network.md) <br/> [One vs. All Multiclass](one-vs-all-multiclass.md) <br/> [Two-Class Averaged Perceptron](two-class-averaged-perceptron.md) <br/> [Two-Class Boosted Decision Tree](two-class-boosted-decision-tree.md) <br/> [Two-Class Decision Forest](two-class-decision-forest.md) <br/> [Two-Class Logistic Regression](two-class-logistic-regression.md) <br/> [Two-Class Neural Network](two-class-neural-network.md) <br/> [Two Class Support Vector Machine](two-class-support-vector-machine.md)|
38
-
||**Build and evaluate models**: ||
44
+
45
+
## Modules for building and evaluating models
46
+
47
+
| Functionality | Description | Module |
48
+
| --- |--- | --- |
39
49
| Model training | Run data through the algorithm. |[Train Clustering Model](train-clustering-model.md) <br/> [Train Model](train-model.md) <br/> [Tune Model Hyperparameters](tune-model-hyperparameters.md)|
40
50
| Model Scoring and Evaluation | Measure the accuracy of the trained model. |[Apply Transformation](apply-transformation.md) <br/> [Assign Data to Clusters](assign-data-to-clusters.md) <br/> [Cross Validate Model](cross-validate-model.md) <br/> [Evaluate Model](evaluate-model.md) <br/> [Score Model](score-model.md)|
41
51
| Python language | Write code and embed it in a module to integrate Python with your pipeline. |[Create Python Model](create-python-model.md) <br/> [Execute Python Script](execute-python-script.md)|
@@ -46,3 +56,7 @@ Modules are organized by functionality:
46
56
## Error messages
47
57
48
58
Learn about the [error messages and exception codes](designer-error-codes.md) you might encounter using modules in Azure Machine Learning designer.
59
+
60
+
## Next steps
61
+
62
+
*[Tutorial: Build a model in designer to predict auto prices](../tutorial-designer-automobile-price-train-score.md)
0 commit comments