Skip to content

Commit f9a6f11

Browse files
authored
Improved Acrolinx Score
Improved Acrolinx Score
1 parent 607ee60 commit f9a6f11

File tree

1 file changed

+8
-8
lines changed

1 file changed

+8
-8
lines changed

articles/hdinsight-aks/flink/join-stream-kafka-table-filesystem.md

Lines changed: 8 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
---
22
title: Enrich the events from Apache Kafka® with the attributes from FileSystem with Apache Flink®
3-
description: Learn how to join stream from Kafka with table from fileSystem using Apache Flink® DataStream API
3+
description: Learn how to join stream from Kafka with table from fileSystem using Apache Flink® DataStream API.
44
ms.service: hdinsight-aks
55
ms.topic: how-to
66
ms.date: 03/14/2024
@@ -10,13 +10,13 @@ ms.date: 03/14/2024
1010

1111
[!INCLUDE [feature-in-preview](../includes/feature-in-preview.md)]
1212

13-
In this article, you can learn how you can enrich the real time events by joining a stream from Kafka with table on ADLS Gen2 using Flink Streaming. We use Flink Streaming API to join events from HDInsight Kafka with attributes from ADLS Gen2, further we use attributes-joined events to sink into another Kafka topic.
13+
In this article, you can learn how you can enrich the real time events by joining a stream from Kafka with table on ADLS Gen2 using Flink Streaming. We use Flink Streaming API to join events from HDInsight Kafka with attributes from ADLS Gen2. Further we use attributes-joined events to sink into another Kafka topic.
1414

1515
## Prerequisites
1616

1717
* [Flink cluster on HDInsight on AKS](../flink/flink-create-cluster-portal.md)
1818
* [Kafka cluster on HDInsight](../../hdinsight/kafka/apache-kafka-get-started.md)
19-
* You're required to ensure the network settings are taken care as described on [Using Kafka on HDInsight](../flink/process-and-consume-data.md); that's to make sure HDInsight on AKS and HDInsight clusters are in the same VNet
19+
* Ensure the network settings are taken care as described on [Using Kafka on HDInsight](../flink/process-and-consume-data.md) to make sure HDInsight on AKS and HDInsight clusters are in the same VNet
2020
* For this demonstration, we're using a Window VM as maven project develop environment in the same VNet as HDInsight on AKS
2121

2222
## Kafka topic preparation
@@ -45,7 +45,7 @@ We're creating a topic called `user_events`.
4545

4646
## Prepare file on ADLS Gen2
4747

48-
We are creating a file called `item attributes` in our storage
48+
We're creating a file called `item attributes` in our storage
4949

5050
- The purpose is to read a batch of `item attributes` from a file on ADLS Gen2. Each item has the following fields:
5151
```
@@ -59,7 +59,7 @@ We are creating a file called `item attributes` in our storage
5959

6060
## Develop the Apache Flink job
6161

62-
In this step we perform the following activities
62+
In this step, we perform the following activities
6363
- Enrich the `user_events` topic from Kafka by joining with `item attributes` from a file on ADLS Gen2.
6464
- We push the outcome of this step, as an enriched user activity of events into a Kafka topic.
6565

@@ -259,7 +259,7 @@ public class KafkaJoinGen2Demo {
259259
}
260260
```
261261

262-
## Package jar and submit to Apache Flink
262+
## Package jar, and submit to Apache Flink
263263

264264
We're submitting the packaged jar to Flink:
265265

@@ -270,13 +270,13 @@ We're submitting the packaged jar to Flink:
270270

271271
### Produce real-time `user_events` topic on Kafka
272272

273-
We are able to produce real-time user behavior event `user_events` in Kafka.
273+
We're able to produce real-time user behavior event `user_events` in Kafka.
274274

275275
:::image type="content" source="./media/join-stream-kafka-table-filesystem/step-5-kafka-3-2.png" alt-text="Screenshot showing a real-time user behavior event on Kafka 3.2." border="true" lightbox="./media/join-stream-kafka-table-filesystem/step-5-kafka-3-2.png":::
276276

277277
### Consume the `itemAttributes` joining with `user_events` on Kafka
278278

279-
We are now using `itemAttributes` on filesystem join user activity events `user_events`.
279+
We're now using `itemAttributes` on filesystem join user activity events `user_events`.
280280

281281
:::image type="content" source="./media/join-stream-kafka-table-filesystem/step-6-kafka-3-2.png" alt-text="Screenshot showing Consume the item attributes-joined user activity events on Kafka 3.2." border="true" lightbox="./media/join-stream-kafka-table-filesystem/step-6-kafka-3-2.png":::
282282

0 commit comments

Comments
 (0)