You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/hdinsight/hadoop/apache-hadoop-deep-dive-advanced-analytics.md
+4-3Lines changed: 4 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -10,6 +10,7 @@ ms.topic: conceptual
10
10
ms.date: 11/14/2017
11
11
ms.author: ashishth
12
12
---
13
+
13
14
# Deep dive - advanced analytics
14
15
15
16
## What is advanced analytics for HDInsight?
@@ -28,7 +29,7 @@ After you create, load, test, and evaluate your model, the next step is to deplo
28
29
29
30
Advanced analytics solutions provide a set of machine learning algorithms. Here is a summary of the categories of algorithms and associated common business use cases.
30
31
31
-

32
+

32
33
33
34
Along with selecting the best-fitting algorithm(s), you need to consider whether or not you need to provide data for training. Machine learning algorithms are categorized as follows:
34
35
@@ -93,7 +94,7 @@ There are three key tasks in this advanced analytics scenario:
93
94
94
95
This example uses the CIFAR-10 image set compiled and distributed by Alex Krizhevsky, Vinod Nair, and Geoffrey Hinton. The CIFAR-10 dataset contains 60,000 32×32 color images belonging to 10 mutually exclusive classes:
For more details on the dataset, see Alex Krizhevsky’s [Learning Multiple Layers of Features from Tiny Images](https://www.cs.toronto.edu/~kriz/learning-features-2009-TR.pdf).
99
100
@@ -106,7 +107,7 @@ The dataset was partitioned into a training set of 50,000 images and a test set
106
107
107
108
The entire preprocessing/scoring of the 10,000 images takes less than one minute on a cluster with 4 worker nodes. The model accurately predicts the labels of ~9,100 (91%) images. A confusion matrix illustrates the most common classification errors. For example, the matrix shows that mislabeling dogs as cats and vice versa occurs more frequently than for other label pairs.
Copy file name to clipboardExpand all lines: articles/hdinsight/hadoop/apache-hadoop-emulator-get-started.md
+12-8Lines changed: 12 additions & 8 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,5 +1,5 @@
1
1
---
2
-
title: Learn using an Apache Hadoop sandbox - emulator - Azure HDInsight
2
+
title: Learn to use an Apache Hadoop sandbox - emulator - Azure HDInsight
3
3
description: 'To start learning about using the Apache Hadoop ecosystem, you can set up a Hadoop sandbox from Hortonworks on an Azure virtual machine. '
# Get started with an Apache Hadoop sandbox, an emulator on a virtual machine
16
16
17
17
Learn how to install the Apache Hadoop sandbox from Hortonworks on a virtual machine to learn about the Hadoop ecosystem. The sandbox provides a local development environment to learn about Hadoop, Hadoop Distributed File System (HDFS), and job submission. Once you are familiar with Hadoop, you can start using Hadoop on Azure by creating an HDInsight cluster. For more information on how to get started, see [Get started with Hadoop on HDInsight](apache-hadoop-linux-tutorial-get-started.md).
18
18
19
19
## Prerequisites
20
-
*[Oracle VirtualBox](https://www.virtualbox.org/). Download and install it from [here](https://www.virtualbox.org/wiki/Downloads).
21
20
21
+
*[Oracle VirtualBox](https://www.virtualbox.org/). Download and install it from [here](https://www.virtualbox.org/wiki/Downloads).
22
22
23
23
## Download and install the virtual machine
24
+
24
25
1. Browse to the [Cloudera downloads](https://www.cloudera.com/downloads/hortonworks-sandbox/hdp.html).
25
26
26
-
2. Click **VIRTUALBOX** under **Choose Installation Type** to download the latest Hortonworks Sandbox on a VM. Sign in or complete the product interest form.
27
+
1. Click **VIRTUALBOX** under **Choose Installation Type** to download the latest Hortonworks Sandbox on a VM. Sign in or complete the product interest form.
27
28
28
29
1. Click the button **HDP SANDBOX (LATEST)** to begin the download.
29
30
@@ -34,11 +35,12 @@ To download an older HDP version sandbox, see the links under **Older Versions**
34
35
## Start the virtual machine
35
36
36
37
1. Open Oracle VM VirtualBox.
37
-
2. From the **File** menu, click **Import Appliance**, and then specify the Hortonworks Sandbox image.
38
+
1. From the **File** menu, click **Import Appliance**, and then specify the Hortonworks Sandbox image.
38
39
1. Select the Hortonworks Sandbox, click **Start**, and then **Normal Start**. Once the virtual machine has finished the boot process, it displays login instructions.
0 commit comments