Skip to content

Commit e7fa1da

Browse files
authored
Merge pull request #102351 from cjgronlund/algo-x-refs
Formatting & xrefs: algo ref
2 parents e0aac26 + c4121df commit e7fa1da

File tree

2 files changed

+27
-11
lines changed

2 files changed

+27
-11
lines changed

articles/machine-learning/algorithm-cheat-sheet.md

Lines changed: 8 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -15,7 +15,9 @@ ms.date: 11/04/2019
1515

1616
The **Azure Machine Learning Algorithm Cheat Sheet** helps you choose the right algorithm for a predictive analytics model.
1717

18-
Azure Machine Learning has a large library of algorithms from the ***classification***, ***recommender systems***, ***clustering***, ***anomaly detection***, ***regression*** and ***text analytics*** families. Each is designed to address a different type of machine learning problem.
18+
Azure Machine Learning has a large library of algorithms from the ***classification***, ***recommender systems***, ***clustering***, ***anomaly detection***, ***regression***, and ***text analytics*** families. Each is designed to address a different type of machine learning problem.
19+
20+
For additional guidance, see [How to select algorithms](how-to-select-algorithms.md)
1921

2022
## Download: Machine Learning Algorithm Cheat Sheet
2123

@@ -27,7 +29,7 @@ Download and print the Machine Learning Algorithm Cheat Sheet in tabloid size to
2729

2830
## How to use the Machine Learning Algorithm Cheat Sheet
2931

30-
The suggestions offered in this algorithm cheat sheet are approximate rules-of-thumb. Some can be bent, and some can be flagrantly violated. This is intended to suggest a starting point. Don’t be afraid to run a head-to-head competition between several algorithms on your data. There is simply no substitute for understanding the principles of each algorithm and the system that generated your data.
32+
The suggestions offered in this algorithm cheat sheet are approximate rules-of-thumb. Some can be bent, and some can be flagrantly violated. This cheat sheet is intended to suggest a starting point. Don’t be afraid to run a head-to-head competition between several algorithms on your data. There is simply no substitute for understanding the principles of each algorithm and the system that generated your data.
3133

3234
Every machine learning algorithm has its own style or inductive bias. For a specific problem, several algorithms may be appropriate, and one algorithm may be a better fit than others. But it's not always possible to know beforehand which is the best fit. In cases like these, several algorithms are listed together in the cheat sheet. An appropriate strategy would be to try one algorithm, and if the results are not yet satisfactory, try the others.
3335

@@ -43,17 +45,17 @@ In supervised learning, each data point is labeled or associated with a category
4345

4446
### Unsupervised learning
4547

46-
In unsupervised learning, data points have no labels associated with them. Instead, the goal of an unsupervised learning algorithm is to organize the data in some way or to describe its structure. This can mean grouping it into clusters, as K-means does, or finding different ways of looking at complex data so that it appears simpler.
48+
In unsupervised learning, data points have no labels associated with them. Instead, the goal of an unsupervised learning algorithm is to organize the data in some way or to describe its structure. Unsupervised learning groups data into clusters, as K-means does, or finds different ways of looking at complex data so that it appears simpler.
4749

4850
### Reinforcement learning
4951

50-
In reinforcement learning, the algorithm gets to choose an action in response to each data point. It is a common approach in robotics, where the set of sensor readings at one point in time is a data point, and the algorithm must choose the robot’s next action. It's also a natural fit for Internet of Things applications. The learning algorithm also receives a reward signal a short time later, indicating how good the decision was. Based on this, the algorithm modifies its strategy in order to achieve the highest reward.
52+
In reinforcement learning, the algorithm gets to choose an action in response to each data point. It is a common approach in robotics, where the set of sensor readings at one point in time is a data point, and the algorithm must choose the robot’s next action. It's also a natural fit for Internet of Things applications. The learning algorithm also receives a reward signal a short time later, indicating how good the decision was. Based on this signal, the algorithm modifies its strategy in order to achieve the highest reward.
5153

5254
## Next steps
5355

54-
* [Learn about studio in Azure Machine Learning and the Azure portal](overview-what-is-azure-ml.md).
56+
* See additional guidance on [How to select algorithms](how-to-select-algorithms.md)
5557

56-
* See a list of algorithms and modules in the [Algorithm and module reference](algorithm-module-reference/module-reference.md).
58+
* [Learn about studio in Azure Machine Learning and the Azure portal](overview-what-is-azure-ml.md).
5759

5860
* [Tutorial: Build a prediction model in Azure Machine Learning designer](tutorial-designer-automobile-price-train-score.md).
5961

articles/machine-learning/algorithm-module-reference/module-reference.md

Lines changed: 19 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -17,25 +17,35 @@ This reference content provides the technical background on each of the machine
1717

1818
Each module represents a set of code that can run independently and perform a machine learning task, given the required inputs. A module might contain a particular algorithm, or perform a task that is important in machine learning, such as missing value replacement, or statistical analysis.
1919

20+
For help with choosing algorithms, see
21+
* [How to select algorithms](../how-to-select-algorithms.md)
22+
* [Azure Machine Learning Algorithm Cheat Sheet](../algorithm-cheat-sheet.md)
23+
2024
> [!TIP]
2125
> In any pipeline in the designer, you can get information about a specific module. Select the module, then select the **more help** link in the **Quick Help** pane.
2226
23-
## Modules
27+
## Data preparation modules
2428

25-
Modules are organized by functionality:
2629

2730
| Functionality | Description | Module |
2831
| --- |--- | --- |
29-
| | **Data preparation**: | |
3032
| Data input and output | Move data from cloud sources into your pipeline. Write your results or intermediate data to Azure Storage, a SQL database, or Hive, while running a pipeline, or use cloud storage to exchange data between pipelines. | [Enter Data Manually](enter-data-manually.md) <br/> [Export Data](export-data.md) <br/> [Import Data](import-data.md) |
3133
| Data transformation | Operations on data that are unique to machine learning, such as normalizing or binning data, dimensionality reduction, and converting data among various file formats.| [Add Columns](add-columns.md) <br/> [Add Rows](add-rows.md) <br/> [Apply Math Operation](apply-math-operation.md) <br/> [Apply SQL Transformation](apply-sql-transformation.md) <br/> [Clean Missing Data](clean-missing-data.md) <br/> [Clip Values](clip-values.md) <br/> [Convert to CSV](convert-to-csv.md) <br/> [Convert to Dataset](convert-to-dataset.md) <br/> [Edit Metadata](edit-metadata.md) <br/> [Join Data](join-data.md) <br/> [Normalize Data](normalize-data.md) <br/> [Partition and Sample](partition-and-sample.md) <br/> [Remove Duplicate Rows](remove-duplicate-rows.md) <br/> [SMOTE](smote.md) <br/> [Select Columns Transform](select-columns-transform.md) <br/> [Select Columns in Dataset](select-columns-in-dataset.md) <br/> [Split Data](split-data.md) |
3234
| Feature Selection | Select a subset of relevant, useful features to use in building an analytical model. | [Filter Based Feature Selection](filter-based-feature-selection.md) <br/> [Permutation Feature Importance](permutation-feature-importance.md) |
3335
| Statistical Functions | Provide a wide variety of statistical methods related to data science. | [Summarize Data](summarize-data.md)|
34-
| | **Machine learning algorithms**: | |
36+
37+
## Machine learning algorithms
38+
39+
| Functionality | Description | Module |
40+
| --- |--- | --- |
3541
| Regression | Predict a value. | [Boosted Decision Tree Regression](boosted-decision-tree-regression.md) <br/> [Decision Forest Regression](decision-forest-regression.md) <br/> [Linear Regression](linear-regression.md) <br/> [Neural Network Regression](neural-network-regression.md) <br/> |
3642
| Clustering | Group data together.| [K-Means Clustering](k-means-clustering.md)
3743
| Classification | Predict a class. Choose from binary (two-class) or multiclass algorithms.| [Multiclass Boosted Decision Tree](multiclass-boosted-decision-tree.md) <br/> [Multiclass Decision Forest](multiclass-decision-forest.md) <br/> [Multiclass Logistic Regression](multiclass-logistic-regression.md) <br/> [Multiclass Neural Network](multiclass-neural-network.md) <br/> [One vs. All Multiclass](one-vs-all-multiclass.md) <br/> [Two-Class Averaged Perceptron](two-class-averaged-perceptron.md) <br/> [Two-Class Boosted Decision Tree](two-class-boosted-decision-tree.md) <br/> [Two-Class Decision Forest](two-class-decision-forest.md) <br/> [Two-Class Logistic Regression](two-class-logistic-regression.md) <br/> [Two-Class Neural Network](two-class-neural-network.md) <br/> [Two Class Support Vector Machine](two-class-support-vector-machine.md) |
38-
| | **Build and evaluate models**: | |
44+
45+
## Modules for building and evaluating models
46+
47+
| Functionality | Description | Module |
48+
| --- |--- | --- |
3949
| Model training | Run data through the algorithm. | [Train Clustering Model](train-clustering-model.md) <br/> [Train Model](train-model.md) <br/> [Tune Model Hyperparameters](tune-model-hyperparameters.md) |
4050
| Model Scoring and Evaluation | Measure the accuracy of the trained model. | [Apply Transformation](apply-transformation.md) <br/> [Assign Data to Clusters](assign-data-to-clusters.md) <br/> [Cross Validate Model](cross-validate-model.md) <br/> [Evaluate Model](evaluate-model.md) <br/> [Score Model](score-model.md) |
4151
| Python language | Write code and embed it in a module to integrate Python with your pipeline. | [Create Python Model](create-python-model.md) <br/> [Execute Python Script](execute-python-script.md) |
@@ -46,3 +56,7 @@ Modules are organized by functionality:
4656
## Error messages
4757

4858
Learn about the [error messages and exception codes](designer-error-codes.md) you might encounter using modules in Azure Machine Learning designer.
59+
60+
## Next steps
61+
62+
* [Tutorial: Build a model in designer to predict auto prices](../tutorial-designer-automobile-price-train-score.md)

0 commit comments

Comments
 (0)