Skip to content

Commit 185100e

Browse files
Merge pull request #107341 from rolyon/rolyon-landing-data-lake-store
[Data Lake Storage Gen1] landing page update
2 parents 9923458 + 3970456 commit 185100e

File tree

5 files changed

+119
-219
lines changed

5 files changed

+119
-219
lines changed

articles/azure-databricks/what-is-azure-databricks.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -18,7 +18,7 @@ Azure Databricks is an Apache Spark-based analytics platform optimized for the M
1818

1919
![What is Azure Databricks?](./media/what-is-azure-databricks/azure-databricks-overview.png "What is Azure Databricks?")
2020

21-
Azure Databricks is a fast, easy, and collaborative Apache Spark-based analytics service. For a big data pipeline, the data (raw or structured) is ingested into Azure through Azure Data Factory in batches, or streamed near real-time using Kafka, Event Hub, or IoT Hub. This data lands in a data lake for long term persisted storage, in Azure Blob Storage or Azure Data Lake Storage. As part of your analytics workflow, use Azure Databricks to read data from multiple data sources such as [Azure Blob Storage](../storage/blobs/storage-blobs-introduction.md), [Azure Data Lake Storage](../data-lake-store/index.md), [Azure Cosmos DB](../cosmos-db/index.yml), or [Azure SQL Data Warehouse](../sql-data-warehouse/index.yml) and turn it into breakthrough insights using Spark.
21+
Azure Databricks is a fast, easy, and collaborative Apache Spark-based analytics service. For a big data pipeline, the data (raw or structured) is ingested into Azure through Azure Data Factory in batches, or streamed near real-time using Kafka, Event Hub, or IoT Hub. This data lands in a data lake for long term persisted storage, in Azure Blob Storage or Azure Data Lake Storage. As part of your analytics workflow, use Azure Databricks to read data from multiple data sources such as [Azure Blob Storage](../storage/blobs/storage-blobs-introduction.md), [Azure Data Lake Storage](../data-lake-store/index.yml), [Azure Cosmos DB](../cosmos-db/index.yml), or [Azure SQL Data Warehouse](../sql-data-warehouse/index.yml) and turn it into breakthrough insights using Spark.
2222

2323
![Databricks pipeline](./media/what-is-azure-databricks/databricks-pipeline.png)
2424

articles/data-lake-store/TOC.yml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
1-
- name: Data Lake Storage Gen1 Documentation
2-
href: index.md
1+
- name: Data Lake Storage Gen1 documentation
2+
href: index.yml
33
- name: Switch to Data Lake Storage Gen2 documentation
44
href: https://docs.microsoft.com/azure/storage/data-lake-storage/introduction
55
- name: Overview

articles/data-lake-store/index.md

Lines changed: 0 additions & 214 deletions
This file was deleted.

articles/data-lake-store/index.yml

Lines changed: 114 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,114 @@
1+
### YamlMime:Landing
2+
3+
title: Azure Data Lake Storage Gen1 documentation
4+
summary: Learn how to set up, manage, and access a hyper-scale, Hadoop-compatible data lake repository for analytics on data of any size, type, and ingestion speed. Azure Data Lake Storage Gen2 is generally available. We recommend that you start using it today.
5+
6+
metadata:
7+
title: Azure Data Lake Storage Gen1 documentation
8+
description: Learn how to set up, manage, and access a hyper-scale, Hadoop-compatible data lake repository for analytics on data of any size, type, and ingestion speed.
9+
ms.service: data-lake-store
10+
ms.topic: landing-page
11+
author: twooley
12+
ms.author: twooley
13+
ms.date: 03/11/2020
14+
15+
# linkListType: architecture | concept | deploy | download | get-started | how-to-guide | learn | overview | quickstart | reference | sample | tutorial | video | whats-new
16+
17+
landingContent:
18+
# Card
19+
- title: About Azure Data Lake Storage Gen1
20+
linkLists:
21+
- linkListType: overview
22+
links:
23+
- text: What is Azure Data Lake Storage Gen1?
24+
url: data-lake-store-overview.md
25+
- text: Compare with Azure Storage
26+
url: data-lake-store-comparison-with-blob-storage.md
27+
- text: Switch to Data Lake Storage Gen2 documentation
28+
url: ../storage/data-lake-storage/introduction.md
29+
30+
# Card
31+
- title: Get started
32+
linkLists:
33+
- linkListType: get-started
34+
links:
35+
- text: Azure portal
36+
url: data-lake-store-get-started-portal.md
37+
- text: Azure PowerShell
38+
url: data-lake-store-get-started-powershell.md
39+
- text: Azure CLI
40+
url: data-lake-store-get-started-cli-2.0.md
41+
42+
# Card
43+
- title: Load and move data
44+
linkLists:
45+
- linkListType: how-to-guide
46+
links:
47+
- text: Using Azure Data Factory
48+
url: ../data-factory/load-azure-data-lake-store.md
49+
- text: Using Storage Explorer
50+
url: data-lake-store-in-storage-explorer.md
51+
- text: Using AdlCopy
52+
url: data-lake-store-copy-data-azure-storage-blob.md
53+
- text: Using DistCp
54+
url: data-lake-store-copy-data-wasb-distcp.md
55+
- text: Using Sqoop
56+
url: data-lake-store-data-transfer-sql-sqoop.md
57+
58+
# Card
59+
- title: Secure data
60+
linkLists:
61+
- linkListType: overview
62+
links:
63+
- text: Security overview
64+
url: data-lake-store-security-overview.md
65+
- text: Access control
66+
url: data-lake-store-access-control.md
67+
- linkListType: how-to-guide
68+
links:
69+
- text: Secure stored data
70+
url: data-lake-store-secure-data.md
71+
- text: Encryption
72+
url: data-lake-store-encryption.md
73+
- text: Virtual network integration
74+
url: data-lake-store-network-security.md
75+
76+
# Card
77+
- title: Authentication
78+
linkLists:
79+
- linkListType: overview
80+
links:
81+
- text: Authentication options
82+
url: data-lakes-store-authentication-using-azure-active-directory.md
83+
- linkListType: how-to-guide
84+
links:
85+
- text: End-user authentication
86+
url: data-lake-store-end-user-authenticate-using-active-directory.md
87+
- text: Service-to-service authentication
88+
url: data-lake-store-service-to-service-authenticate-using-active-directory.md
89+
90+
# Card
91+
- title: Filesystem operations
92+
linkLists:
93+
- linkListType: how-to-guide
94+
links:
95+
- text: .NET SDK
96+
url: data-lake-store-data-operations-net-sdk.md
97+
- text: Java SDK
98+
url: data-lake-store-get-started-java-sdk.md
99+
- text: REST API
100+
url: data-lake-store-data-operations-rest-api.md
101+
- text: Python
102+
url: data-lake-store-data-operations-python.md
103+
104+
# Card
105+
- title: Account management operations
106+
linkLists:
107+
- linkListType: how-to-guide
108+
links:
109+
- text: .NET SDK
110+
url: data-lake-store-get-started-net-sdk.md
111+
- text: REST API
112+
url: data-lake-store-get-started-rest-api.md
113+
- text: Python
114+
url: data-lake-store-get-started-python.md

articles/hdinsight/hadoop/apache-hadoop-on-premises-migration-best-practices-storage.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -47,7 +47,7 @@ You can create [blob snapshots](https://docs.microsoft.com/rest/api/storageservi
4747
4848
The following methods can be used to import certificates into the Java trust store:
4949

50-
Download the Azure Blob ssl cert to a file
50+
Download the Azure Blob SSL cert to a file
5151

5252
```bash
5353
echo -n | openssl s_client -connect <storage-account>.blob.core.windows.net:443 | sed -ne '/-BEGIN CERTIFICATE-/,/-END CERTIFICATE-/p' > Azure_Storage.cer
@@ -87,7 +87,7 @@ For more information, see the following articles:
8787

8888
Azure Data Lake Storage Gen2 is the latest storage offering. It unifies the core capabilities from the first generation of Azure Data Lake Storage with a Hadoop compatible file system endpoint directly integrated into Azure Blob Storage. This enhancement combines the scale and cost benefits of object storage with the reliability and performance typically associated only with on-premises file systems.
8989

90-
ADLS Gen 2 is built on top of [Azure Blob storage](../../storage/blobs/storage-blobs-introduction.md) and allows you to interface with data using both file system and object storage paradigms. Features from [Azure Data Lake Storage Gen1](../../data-lake-store/index.md), such as file system semantics, file-level security, and scale are combined with low-cost, tiered storage, high availability/disaster recovery capabilities, and a large SDK/tooling ecosystem from [Azure Blob storage](../../storage/blobs/storage-blobs-introduction.md). In Data Lake Storage Gen2, all the qualities of object storage remain while adding the advantages of a file system interface optimized for analytics workloads.
90+
ADLS Gen 2 is built on top of [Azure Blob storage](../../storage/blobs/storage-blobs-introduction.md) and allows you to interface with data using both file system and object storage paradigms. Features from [Azure Data Lake Storage Gen1](../../data-lake-store/index.yml), such as file system semantics, file-level security, and scale are combined with low-cost, tiered storage, high availability/disaster recovery capabilities, and a large SDK/tooling ecosystem from [Azure Blob storage](../../storage/blobs/storage-blobs-introduction.md). In Data Lake Storage Gen2, all the qualities of object storage remain while adding the advantages of a file system interface optimized for analytics workloads.
9191

9292
A fundamental feature of Data Lake Storage Gen2 is the addition of a [hierarchical namespace](../../storage/data-lake-storage/namespace.md) to the Blob storage service, which organizes objects/files into a hierarchy of directories for performant data access. The hierarchical structure enables operations such as renaming or deleting a directory to be single atomic metadata operations on the directory rather than enumerating and processing all objects that share the name prefix of the directory.
9393

0 commit comments

Comments
 (0)