Skip to content

Commit b4bdda7

Browse files
Merge pull request #279229 from spelluru/ehubfreshness0625
Freshness review of Event Hubs articles
2 parents 3bf19c7 + 13a8735 commit b4bdda7

File tree

1 file changed

+12
-11
lines changed

1 file changed

+12
-11
lines changed

articles/event-hubs/event-hubs-capture-overview.md

Lines changed: 12 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -1,12 +1,13 @@
11
---
2-
title: Capture streaming events - Azure Event Hubs | Microsoft Docs
2+
title: Capture streaming events
33
description: This article provides an overview of the Capture feature that allows you to capture events streaming through Azure Event Hubs.
4-
ms.topic: article
5-
ms.date: 05/16/2023
4+
ms.topic: concept-article
5+
ms.date: 06/25/2024
6+
#customer intent: As a developer, I want to know how capture events flowing through an event hub in an Azure Blob Storage or an Azure Data Lake Storage.
67
---
78

89
# Capture events through Azure Event Hubs in Azure Blob Storage or Azure Data Lake Storage
9-
Azure Event Hubs enables you to automatically capture the streaming data in Event Hubs in an [Azure Blob storage](https://azure.microsoft.com/services/storage/blobs/) or [Azure Data Lake Storage Gen 1 or Gen 2](https://azure.microsoft.com/services/data-lake-store/) account of your choice, with the added flexibility of specifying a time or size interval. Setting up Capture is fast, there are no administrative costs to run it, and it scales automatically with Event Hubs [throughput units](event-hubs-scalability.md#throughput-units) in the standard tier or [processing units](event-hubs-scalability.md#processing-units) in the premium tier. Event Hubs Capture is the easiest way to load streaming data into Azure, and enables you to focus on data processing rather than on data capture.
10+
Azure Event Hubs enables you to automatically capture the data streaming through Event Hubs in [Azure Blob storage](https://azure.microsoft.com/services/storage/blobs/) or [Azure Data Lake Storage Gen 1 or Gen 2](https://azure.microsoft.com/services/data-lake-store/) account of your choice. It also provides the flexibility for you to specify a time or a size interval. Enabling or setting up the Event Hubs Capture feature is fast. There are no administrative costs to run it, and it scales automatically with Event Hubs [throughput units](event-hubs-scalability.md#throughput-units) in the standard tier or [processing units](event-hubs-scalability.md#processing-units) in the premium tier. Event Hubs Capture is the easiest way to load streaming data into Azure, and enables you to focus on data processing rather than on data capture.
1011

1112
:::image type="content" source="./media/event-hubs-features/capture.png" alt-text="Image showing capturing of Event Hubs data into Azure Storage or Azure Data Lake Storage":::
1213

@@ -18,7 +19,7 @@ Event Hubs Capture enables you to process real-time and batch-based pipelines on
1819
> [!IMPORTANT]
1920
> - The destination storage (Azure Storage or Azure Data Lake Storage) account must be in the same subscription as the event hub when not using managed identity for authentication.
2021
> - Event Hubs doesn't support capturing events in a premium storage account.
21-
> - Event Hubs capture supports any non-premium Azure storage account with support for block blobs.
22+
> - Event Hubs Capture supports any non-premium Azure storage account with support for block blobs.
2223
2324
## How Event Hubs Capture works
2425

@@ -45,11 +46,11 @@ The date values are padded with zeroes; an example filename might be:
4546
https://mystorageaccount.blob.core.windows.net/mycontainer/mynamespace/myeventhub/0/2017/12/08/03/03/17.avro
4647
```
4748

48-
If your Azure storage blob is temporarily unavailable, Event Hubs Capture will retain your data for the data retention period configured on your event hub and back fill the data once your storage account is available again.
49+
If your Azure storage blob is temporarily unavailable, Event Hubs Capture retains your data for the data retention period configured on your event hub and back fill the data once your storage account is available again.
4950

5051
### Scaling throughput units or processing units
5152

52-
In the standard tier of Event Hubs, the traffic is controlled by [throughput units](event-hubs-scalability.md#throughput-units) and in the premium tier Event Hubs, it's controlled by [processing units](event-hubs-scalability.md#processing-units). Event Hubs Capture copies data directly from the internal Event Hubs storage, bypassing throughput unit or processing unit egress quotas and saving your egress for other processing readers, such as Stream Analytics or Spark.
53+
In the standard tier of Event Hubs, [throughput units](event-hubs-scalability.md#throughput-units) controls the traffic and in the premium tier Event Hubs, [processing units](event-hubs-scalability.md#processing-units) controls the traffic. Event Hubs Capture copies data directly from the internal Event Hubs storage, bypassing throughput unit or processing unit egress quotas and saving your egress for other processing readers, such as Stream Analytics or Spark.
5354

5455
Once configured, Event Hubs Capture runs automatically when you send your first event, and continues running. To make it easier for your downstream processing to know that the process is working, Event Hubs writes empty files when there's no data. This process provides a predictable cadence and marker that can feed your batch processors.
5556

@@ -65,7 +66,7 @@ You can configure Capture at the event hub creation time using the [Azure portal
6566
6667
## How Event Hubs Capture is charged
6768

68-
The capture feature is included in the premium tier so there is no additional charge for that tier. For the Standard tier, the feature is charged monthly, and the charge is directly proportional to the number of throughput units or processing units purchased for the namespace. As throughput units or processing units are increased and decreased, Event Hubs Capture meters increase and decrease to provide matching performance. The meters occur in tandem. For pricing details, see [Event Hubs pricing](https://azure.microsoft.com/pricing/details/event-hubs/).
69+
The capture feature is included in the premium tier so there's no extra charge for that tier. For the Standard tier, the feature is charged monthly, and the charge is directly proportional to the number of throughput units or processing units purchased for the namespace. As throughput units or processing units are increased and decreased, Event Hubs Capture meters increase and decrease to provide matching performance. The meters occur in tandem. For pricing details, see [Event Hubs pricing](https://azure.microsoft.com/pricing/details/event-hubs/).
6970

7071
Capture doesn't consume egress quota as it is billed separately.
7172

@@ -76,15 +77,15 @@ You can create an Azure Event Grid subscription with an Event Hubs namespace as
7677
To learn how to explore captured Avro files, see [Explore captured Avro files](explore-captured-avro-files.md).
7778

7879
## Azure Storage account as a destination
79-
To enable capture on an event hub with Azure Storage as the capture destination, or update properties on an event hub with Azure Storage as the capture destination, the user or service principal must have an RBAC role with the following permissions assigned at the storage account scope. 
80+
To enable capture on an event hub with Azure Storage as the capture destination, or update properties on an event hub with Azure Storage as the capture destination, the user, or service principal must have a role-based access control (RBAC) role with the following permissions assigned at the storage account scope. 
8081

8182
```
8283
Microsoft.Storage/storageAccounts/blobServices/containers/write
8384
Microsoft.Storage/storageAccounts/blobServices/containers/blobs/write
8485
```
8586

8687

87-
Without above permission, you will see below error: 
88+
Without this permission, you see following error: 
8889

8990
```
9091
Generic: Linked access check failed for capture storage destination <StorageAccount Arm Id>.
@@ -95,7 +96,7 @@ TrackingId:<ID>, SystemTracker:mynamespace.servicebus.windows.net:myhub, Timesta
9596

9697
The [Storage Blob Data Owner](../role-based-access-control/built-in-roles.md#storage-blob-data-owner) is a built-in role with above permissions, so add the user account or the service principal to this role.  
9798

98-
## Next steps
99+
## Related content
99100
Event Hubs Capture is the easiest way to get data into Azure. Using Azure Data Lake, Azure Data Factory, and Azure HDInsight, you can perform batch processing and other analytics using familiar tools and platforms of your choosing, at any scale you need.
100101

101102
Learn how to enable this feature using the Azure portal and Azure Resource Manager template:

0 commit comments

Comments
 (0)