Skip to content

Commit 04510c1

Browse files
authored
Merge pull request #57558 from garyericson/11-08-tdsp-selectors-2
Removing selectors from TDSP articles - 2
2 parents 4b269ef + b10c230 commit 04510c1

File tree

4 files changed

+8
-17
lines changed

4 files changed

+8
-17
lines changed

articles/machine-learning/team-data-science-process/create-features-hive.md

Lines changed: 1 addition & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -25,9 +25,7 @@ The operations needed to create features can be memory intensive. The performanc
2525

2626
Examples of the queries that are presented are specific to the [NYC Taxi Trip Data](http://chriswhong.com/open-data/foil_nyc_taxi/) scenarios are also provided in [GitHub repository](https://github.com/Azure/Azure-MachineLearning-DataScience/tree/master/Misc/DataScienceProcess/DataScienceScripts). These queries already have data schema specified and are ready to be submitted to run. In the final section, parameters that users can tune so that the performance of Hive queries can be improved are also discussed.
2727

28-
[!INCLUDE [cap-create-features-data-selector](../../../includes/cap-create-features-selector.md)]
29-
30-
This **menu** links to topics that describe how to create features for data in various environments. This task is a step in the [Team Data Science Process (TDSP)](https://azure.microsoft.com/documentation/learning-paths/cortana-analytics-process/).
28+
This task is a step in the [Team Data Science Process (TDSP)](https://azure.microsoft.com/documentation/learning-paths/cortana-analytics-process/).
3129

3230
## Prerequisites
3331
This article assumes that you have:

articles/machine-learning/team-data-science-process/create-features-sql-server.md

Lines changed: 1 addition & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -21,9 +21,7 @@ ms.author: deguhath
2121
# Create features for data in SQL Server using SQL and Python
2222
This document shows how to generate features for data stored in a SQL Server VM on Azure that help algorithms learn more efficiently from the data. You can use SQL or a programming language like Python to accomplish this task. Both approaches are demonstrated here.
2323

24-
[!INCLUDE [cap-create-features-data-selector](../../../includes/cap-create-features-selector.md)]
25-
26-
This **menu** links to topics that describe how to create features for data in various environments. This task is a step in the [Team Data Science Process (TDSP)](https://azure.microsoft.com/documentation/learning-paths/cortana-analytics-process/).
24+
This task is a step in the [Team Data Science Process (TDSP)](https://azure.microsoft.com/documentation/learning-paths/cortana-analytics-process/).
2725

2826
> [!NOTE]
2927
> For a practical example, you can consult the [NYC Taxi dataset](http://www.andresmh.com/nyctaxitrips/) and refer to the IPNB titled [NYC Data wrangling using IPython Notebook and SQL Server](https://github.com/Azure/Azure-MachineLearning-DataScience/blob/master/Misc/DataScienceProcess/iPythonNotebooks/machine-Learning-data-science-process-sql-walkthrough.ipynb) for an end-to-end walk-through.

articles/machine-learning/team-data-science-process/create-features.md

Lines changed: 6 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -21,9 +21,7 @@ ms.author: deguhath
2121
# Feature engineering in data science
2222
This article explains the purposes of feature engineering and provides examples of its role in the data enhancement process of machine learning. The examples used to illustrate this process are drawn from Azure Machine Learning Studio.
2323

24-
[!INCLUDE [cap-create-features-data-selector](../../../includes/cap-create-features-selector.md)]
25-
26-
This **menu** links to articles that describe how to create features for data in various environments. This task is a step in the [Team Data Science Process (TDSP)](https://azure.microsoft.com/documentation/learning-paths/cortana-analytics-process/).
24+
This task is a step in the [Team Data Science Process (TDSP)](https://azure.microsoft.com/documentation/learning-paths/cortana-analytics-process/).
2725

2826
Feature engineering attempts to increase the predictive power of learning algorithms by creating features from raw data that help facilitate the learning process. The engineering and selection of features is one part of the TDSP outlined in the [What is the Team Data Science Process lifecycle?](overview.md) Feature engineering and selection are parts of the **Develop features** step of the TDSP.
2927

@@ -34,6 +32,11 @@ Normally **feature engineering** is applied first to generate additional feature
3432

3533
The training data used in machine learning can often be enhanced by extraction of features from the raw data collected. An example of an engineered feature in the context of learning how to classify the images of handwritten characters is creation of a bit density map constructed from the raw bit distribution data. This map can help locate the edges of the characters more efficiently than simply using the raw distribution directly.
3634

35+
To create features for data in specific environments, see the following articles:
36+
37+
* [Create features for data in SQL Server](create-features-sql-server.md)
38+
* [Create features for data in a Hadoop cluster using Hive queries](create-features-hive.md)
39+
3740
[!INCLUDE [machine-learning-free-trial](../../../includes/machine-learning-free-trial.md)]
3841

3942
## Create features from your data - feature engineering

includes/cap-create-features-selector.md

Lines changed: 0 additions & 8 deletions
This file was deleted.

0 commit comments

Comments
 (0)