Skip to content

Commit f4ea45f

Browse files
authored
Merge pull request #98330 from j-martens/patch-768
Update prepare-data.md
2 parents cefaa48 + 56bc98a commit f4ea45f

File tree

1 file changed

+4
-4
lines changed

1 file changed

+4
-4
lines changed

articles/machine-learning/team-data-science-process/prepare-data.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
---
2-
title: Clean and prepare data for Azure Machine Learning - Team Data Science Process
2+
title: Prep data for ML Studio (classic) - Team Data Science Process
33
description: Pre-process and clean data to prepare it to be used effectively for machine learning.
44
services: machine-learning
55
author: marktab
@@ -17,9 +17,9 @@ Pre-processing and cleaning data are important tasks that typically must be cond
1717

1818
Pre-processing and cleaning tasks, like the data exploration task, can be carried out in a wide variety of environments, such as SQL or Hive or Azure Machine Learning Studio, and with various tools and languages, such as R or Python, depending where your data is stored and how it is formatted. Since TDSP is iterative in nature, these tasks can take place at various steps in the workflow of the process.
1919

20-
This article introduces various data processing concepts and tasks that can be undertaken either before or after ingesting data into Azure Machine Learning.
20+
This article introduces various data processing concepts and tasks that can be undertaken either before or after ingesting data into Azure ML Studio (classic).
2121

22-
For an example of data exploration and pre-processing done inside Azure Machine Learning studio, see the [Pre-processing data in Azure Machine Learning Studio](https://azure.microsoft.com/documentation/videos/preprocessing-data-in-azure-ml-studio/) video.
22+
For an example of data exploration and pre-processing done inside Azure ML Studio (classic)., see the [Pre-processing data](https://azure.microsoft.com/documentation/videos/preprocessing-data-in-azure-ml-studio/) video.
2323

2424
## Why pre-process and clean data?
2525
Real world data is gathered from various sources and processes and it may contain irregularities or corrupt data compromising the quality of the dataset. The typical data quality issues that arise are:
@@ -45,7 +45,7 @@ We can check the general quality of data by checking:
4545

4646
When you find issues with data, **processing steps** are necessary which often involves cleaning missing values, data normalization, discretization, text processing to remove and/or replace embedded characters which may affect data alignment, mixed data types in common fields, and others.
4747

48-
**Azure Machine Learning consumes well-formed tabular data**. If the data is already in tabular form, data pre-processing can be performed directly with Azure Machine Learning in the Machine Learning Studio. If data is not in tabular form, say it is in XML, parsing may be required in order to convert the data to tabular form.
48+
**Azure Machine Learning consumes well-formed tabular data**. If the data is already in tabular form, data pre-processing can be performed directly with Azure ML Studio (classic) in the Machine Learning. If data is not in tabular form, say it is in XML, parsing may be required in order to convert the data to tabular form.
4949

5050
## What are some of the major tasks in data pre-processing?
5151
* **Data cleaning**: Fill in or missing values, detect and remove noisy data and outliers.

0 commit comments

Comments
 (0)