Skip to content

Commit 76a1d41

Browse files
authored
Merge pull request #111321 from dagiro/freshness50
freshness50
2 parents 8380d35 + e741660 commit 76a1d41

File tree

1 file changed

+9
-9
lines changed

1 file changed

+9
-9
lines changed

articles/hdinsight/interactive-query/interactive-query-tutorial-analyze-flight-data.md

Lines changed: 9 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -1,19 +1,19 @@
11
---
22
title: 'Tutorial: ETL operations with Interactive Query - Azure HDInsight'
3-
description: Tutorial - Learn how to extract data from a raw CSV dataset, transform it using Interactive Query on HDInsight, and then load the transformed data into Azure SQL database by using Apache Sqoop.
3+
description: Tutorial - Learn how to extract data from a raw CSV dataset. Transform it using Interactive Query on HDInsight. Then load the transformed data into Azure SQL database by using Apache Sqoop.
44
author: hrasheed-msft
5+
ms.author: hrasheed
56
ms.reviewer: jasonh
67
ms.service: hdinsight
78
ms.topic: tutorial
8-
ms.date: 07/02/2019
9-
ms.author: hrasheed
109
ms.custom: hdinsightactive,mvc
10+
ms.date: 07/02/2019
1111
#Customer intent: As a data analyst, I need to load some data using Interactive Query, transform, and then export it to an Azure SQL database
1212
---
1313

1414
# Tutorial: Extract, transform, and load data using Interactive Query in Azure HDInsight
1515

16-
In this tutorial, you take a raw CSV data file of publicly available flight data, import it into HDInsight cluster storage, and then transform the data using Interactive Query in Azure HDInsight. Once the data is transformed, you load that data into an Azure SQL database using [Apache Sqoop](https://sqoop.apache.org/).
16+
In this tutorial, you download a raw CSV data file of publicly available flight data. Import it into HDInsight cluster storage, and then transform the data using Interactive Query in Azure HDInsight. Once the data is transformed, you load that data into an Azure SQL database using [Apache Sqoop](https://sqoop.apache.org/).
1717

1818
This tutorial covers the following tasks:
1919

@@ -42,7 +42,7 @@ This tutorial covers the following tasks:
4242
| --- | --- |
4343
| Filter Year |2019 |
4444
| Filter Period |January |
45-
| Fields |Year, FlightDate, Reporting_Airline, DOT_ID_Reporting_Airline, Flight_Number_Reporting_Airline, OriginAirportID, Origin, OriginCityName, OriginState, DestAirportID, Dest, DestCityName, DestState, DepDelayMinutes, ArrDelay, ArrDelayMinutes, CarrierDelay, WeatherDelay, NASDelay, SecurityDelay, LateAircraftDelay. |
45+
| Fields |`Year, FlightDate, Reporting_Airline, DOT_ID_Reporting_Airline, Flight_Number_Reporting_Airline, OriginAirportID, Origin, OriginCityName, OriginState, DestAirportID, Dest, DestCityName, DestState, DepDelayMinutes, ArrDelay, ArrDelayMinutes, CarrierDelay, WeatherDelay, NASDelay, SecurityDelay, LateAircraftDelay`. |
4646

4747
3. Select **Download**. You get a .zip file with the data fields you selected.
4848

@@ -56,7 +56,7 @@ There are many ways to upload data to the storage associated with an HDInsight c
5656
scp FILENAME.zip [email protected]:FILENAME.zip
5757
```
5858
59-
If you are asked to enter yes or no to proceed, type yes at the command prompt and press enter. The text is not visible in the window as you type.
59+
Enter yes or no to continue if prompted. The text isn't visible in the window as you type.
6060
6161
2. After the upload has finished, connect to the cluster by using SSH. Edit the command below by replacing `CLUSTERNAME` with the name of the HDInsight cluster. Then enter the following command:
6262
@@ -280,13 +280,13 @@ In the previous sections, you copied the transformed data at `/tutorials/flightd
280280
GO
281281
```
282282
283-
You should see a listing of data in the table. The table includes the city name and the average flight delay time for that city.
283+
You should see a listing of data in the table. The table includes the city name and the average flight delay time for that city.
284284
285285
Type `exit` to exit the tsql utility.
286286
287287
## Clean up resources
288288
289-
After you complete the tutorial, you may want to delete the cluster. With HDInsight, your data is stored in Azure Storage, so you can safely delete a cluster when it is not in use. You are also charged for an HDInsight cluster, even when it is not in use. Since the charges for the cluster are many times more than the charges for storage, it makes economic sense to delete clusters when they are not in use.
289+
After you complete the tutorial, you may want to delete the cluster. With HDInsight, your data is stored in Azure Storage, so you can safely delete a cluster when it isn't in use. You're also charged for an HDInsight cluster, even when it isn't in use. Since the charges for the cluster are many times more than the charges for storage, it makes economic sense to delete clusters when they aren't in use.
290290
291291
To delete a cluster, see [Delete an HDInsight cluster using your browser, PowerShell, or the Azure CLI](../hdinsight-delete-cluster.md).
292292
@@ -295,4 +295,4 @@ To delete a cluster, see [Delete an HDInsight cluster using your browser, PowerS
295295
In this tutorial, you took a raw CSV data file, imported it into an HDInsight cluster storage, and then transformed the data using Interactive Query in Azure HDInsight. Advance to the next tutorial to learn about the Apache Hive Warehouse Connector.
296296
297297
> [!div class="nextstepaction"]
298-
>[Integrate Apache Spark and Apache Hive with the Hive Warehouse Connector](./apache-hive-warehouse-connector.md)
298+
> [Integrate Apache Spark and Apache Hive with the Hive Warehouse Connector](./apache-hive-warehouse-connector.md)

0 commit comments

Comments
 (0)