Skip to content

Commit 4bb1a3e

Browse files
committed
cats53
1 parent eaab682 commit 4bb1a3e

File tree

6 files changed

+16
-11
lines changed

6 files changed

+16
-11
lines changed

articles/hdinsight/hadoop/apache-hadoop-deep-dive-advanced-analytics.md

Lines changed: 4 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -10,6 +10,7 @@ ms.topic: conceptual
1010
ms.date: 11/14/2017
1111
ms.author: ashishth
1212
---
13+
1314
# Deep dive - advanced analytics
1415

1516
## What is advanced analytics for HDInsight?
@@ -28,7 +29,7 @@ After you create, load, test, and evaluate your model, the next step is to deplo
2829

2930
Advanced analytics solutions provide a set of machine learning algorithms. Here is a summary of the categories of algorithms and associated common business use cases.
3031

31-
![Machine Learning Use Cases](./media/apache-hadoop-deep-dive-advanced-analytics/ml-use-cases.png)
32+
![Machine Learning Use Cases](./media/apache-hadoop-deep-dive-advanced-analytics/machine-learning-use-cases.png)
3233

3334
Along with selecting the best-fitting algorithm(s), you need to consider whether or not you need to provide data for training. Machine learning algorithms are categorized as follows:
3435

@@ -93,7 +94,7 @@ There are three key tasks in this advanced analytics scenario:
9394

9495
This example uses the CIFAR-10 image set compiled and distributed by Alex Krizhevsky, Vinod Nair, and Geoffrey Hinton. The CIFAR-10 dataset contains 60,000 32×32 color images belonging to 10 mutually exclusive classes:
9596

96-
![Images](./media/apache-hadoop-deep-dive-advanced-analytics/ml-images.png)
97+
![Images](./media/apache-hadoop-deep-dive-advanced-analytics/machine-learning-images.png)
9798

9899
For more details on the dataset, see Alex Krizhevsky’s [Learning Multiple Layers of Features from Tiny Images](https://www.cs.toronto.edu/~kriz/learning-features-2009-TR.pdf).
99100

@@ -106,7 +107,7 @@ The dataset was partitioned into a training set of 50,000 images and a test set
106107

107108
The entire preprocessing/scoring of the 10,000 images takes less than one minute on a cluster with 4 worker nodes. The model accurately predicts the labels of ~9,100 (91%) images. A confusion matrix illustrates the most common classification errors. For example, the matrix shows that mislabeling dogs as cats and vice versa occurs more frequently than for other label pairs.
108109

109-
![Results](./media/apache-hadoop-deep-dive-advanced-analytics/ml-results.png)
110+
![Results](./media/apache-hadoop-deep-dive-advanced-analytics/machine-learning-results.png)
110111

111112
### Try it Out!
112113

articles/hdinsight/hadoop/apache-hadoop-emulator-get-started.md

Lines changed: 12 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
---
2-
title: Learn using an Apache Hadoop sandbox - emulator - Azure HDInsight
2+
title: Learn to use an Apache Hadoop sandbox - emulator - Azure HDInsight
33
description: 'To start learning about using the Apache Hadoop ecosystem, you can set up a Hadoop sandbox from Hortonworks on an Azure virtual machine. '
44
keywords: hadoop emulator,hadoop sandbox
55
ms.reviewer: jasonh
@@ -10,20 +10,21 @@ ms.custom: hdinsightactive,hdiseo17may2017
1010
ms.topic: conceptual
1111
ms.date: 05/29/2019
1212
ms.author: hrasheed
13-
1413
---
14+
1515
# Get started with an Apache Hadoop sandbox, an emulator on a virtual machine
1616

1717
Learn how to install the Apache Hadoop sandbox from Hortonworks on a virtual machine to learn about the Hadoop ecosystem. The sandbox provides a local development environment to learn about Hadoop, Hadoop Distributed File System (HDFS), and job submission. Once you are familiar with Hadoop, you can start using Hadoop on Azure by creating an HDInsight cluster. For more information on how to get started, see [Get started with Hadoop on HDInsight](apache-hadoop-linux-tutorial-get-started.md).
1818

1919
## Prerequisites
20-
* [Oracle VirtualBox](https://www.virtualbox.org/). Download and install it from [here](https://www.virtualbox.org/wiki/Downloads).
2120

21+
* [Oracle VirtualBox](https://www.virtualbox.org/). Download and install it from [here](https://www.virtualbox.org/wiki/Downloads).
2222

2323
## Download and install the virtual machine
24+
2425
1. Browse to the [Cloudera downloads](https://www.cloudera.com/downloads/hortonworks-sandbox/hdp.html).
2526

26-
2. Click **VIRTUALBOX** under **Choose Installation Type** to download the latest Hortonworks Sandbox on a VM. Sign in or complete the product interest form.
27+
1. Click **VIRTUALBOX** under **Choose Installation Type** to download the latest Hortonworks Sandbox on a VM. Sign in or complete the product interest form.
2728

2829
1. Click the button **HDP SANDBOX (LATEST)** to begin the download.
2930

@@ -34,11 +35,12 @@ To download an older HDP version sandbox, see the links under **Older Versions**
3435
## Start the virtual machine
3536

3637
1. Open Oracle VM VirtualBox.
37-
2. From the **File** menu, click **Import Appliance**, and then specify the Hortonworks Sandbox image.
38+
1. From the **File** menu, click **Import Appliance**, and then specify the Hortonworks Sandbox image.
3839
1. Select the Hortonworks Sandbox, click **Start**, and then **Normal Start**. Once the virtual machine has finished the boot process, it displays login instructions.
3940

40-
![Normal start](./media/apache-hadoop-emulator-get-started/normal-start.png)
41-
2. Open a web browser and navigate to the URL displayed (usually `http://127.0.0.1:8888`).
41+
![Normal start](./media/apache-hadoop-emulator-get-started/virtualbox-normal-start.png)
42+
43+
1. Open a web browser and navigate to the URL displayed (usually `http://127.0.0.1:8888`).
4244

4345
## Set Sandbox passwords
4446

@@ -68,7 +70,9 @@ To download an older HDP version sandbox, see the links under **Older Versions**
6870
select * from sample_07 limit 10;
6971

7072
## Next steps
73+
7174
* [Learn how to use Visual Studio with the Hortonworks Sandbox](../hdinsight-hadoop-emulator-visual-studio.md)
75+
7276
* [Learning the ropes of the Hortonworks Sandbox](https://hortonworks.com/hadoop-tutorial/learning-the-ropes-of-the-hortonworks-sandbox/)
73-
* [Hadoop tutorial - Getting started with HDP](https://hortonworks.com/hadoop-tutorial/hello-world-an-introduction-to-hadoop-hcatalog-hive-and-pig/)
7477

78+
* [Hadoop tutorial - Getting started with HDP](https://hortonworks.com/hadoop-tutorial/hello-world-an-introduction-to-hadoop-hcatalog-hive-and-pig/)

0 commit comments

Comments
 (0)