You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/lab-services/classroom-labs/class-type-big-data-analytics.md
+5-5Lines changed: 5 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,9 +1,9 @@
1
1
---
2
-
title: Set up a lab to teach big data analytics | Microsoft Docs
2
+
title: Set up a lab to teach big data analytics using Azure Lab Services | Microsoft Docs
3
3
description: Learn how to set up a lab to teach the big data analytics using Docker deployment of Hortonworks Data Platform (HDP).
4
4
services: lab-services
5
5
documentationcenter: na
6
-
author: emaher
6
+
author: nicolela
7
7
manager:
8
8
editor: ''
9
9
@@ -18,7 +18,7 @@ ms.author: nicolela
18
18
---
19
19
# Set up a lab to big data analytics using Docker deployment of HortonWorks Data Platform
20
20
21
-
This article shows you how to set up a lab to teach a big data analytics class. With this type of class, students learn how to handle large volumes of data and apply machine\statistical learning algorithms to derive data insights. A key objective for students is to learn to use data analytics tools, such as [Apache Hadoop's open-source software package](https://hadoop.apache.org/) which provides tools for storing, managing, and processing big data.
21
+
This article shows you how to set up a lab to teach a big data analytics class. With this type of class, students learn how to handle large volumes of data and apply machine and statistical learning algorithms to derive data insights. A key objective for students is to learn to use data analytics tools, such as [Apache Hadoop's open-source software package](https://hadoop.apache.org/) which provides tools for storing, managing, and processing big data.
22
22
23
23
In this lab, students will use a popular commercial version of Hadoop provided by [Cloudera](https://www.cloudera.com/), called [Hortonworks Data Platform (HDP)](https://www.cloudera.com/products/hdp.html). Specifically, students will use [HDP Sandbox 3.0.1](https://www.cloudera.com/tutorials/getting-started-with-hdp-sandbox/1.html) which is a simplified, easy-to-use version of the platform that is free of cost and intended for learning and experimentation. Although this class may use either Windows or Linux virtual machines (VM) with HDP Sandbox deployed, this article will show how to use Windows.
24
24
@@ -44,7 +44,7 @@ Use the settings in the table below when setting up a classroom lab. For more i
44
44
45
45
| Lab settings | Value/instructions |
46
46
| ------------ | ------------------ |
47
-
|Virtual Machine Size| Medium (Nested Virtualization). This VM size is best suited for relational databases, in-memory caching, and analytics. This size also supports nested virtualization.
47
+
|Virtual Machine Size| Medium (Nested Virtualization). This VM size is best suited for relational databases, in-memory caching, and analytics. This size also supports nested virtualization.|
48
48
|Virtual Machine Image| Windows 10 Pro|
49
49
50
50
> [!NOTE] We need to use Medium (Nested Virtualization) since deploying HDP Sandbox using Docker requires:
@@ -100,7 +100,7 @@ To provide an easy to use experience for students, we'll use a PowerShell script
100
100
- Starts the HDP Sandbox Docker containers when a student starts and connects to their lab VM.
101
101
- Launches the browser and navigates to the Sandbox Welcome Page.
102
102
We'll also use Windows Task Scheduler to automatically run this script when a student logs into their VM.
103
-
To set this up, follow these steps: [Big Data Analytics scripting](TODO link to)
103
+
To set this up, follow these steps: [Big Data Analytics scripting](https://github.com/Azure/azure-devtestlab/tree/master/samples/ClassroomLabs/Scripts/BigDataAnalytics/).
0 commit comments