You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/hdinsight/kafka/kafka-faq.md
+19-14Lines changed: 19 additions & 14 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -6,17 +6,17 @@ ms.author: hrasheed
6
6
ms.reviewer: jasonh
7
7
ms.service: hdinsight
8
8
ms.topic: conceptual
9
-
ms.date: 08/09/2019
9
+
ms.date: 08/14/2019
10
10
---
11
11
# Frequently asked questions about Apache Kafka in Azure HDInsight
12
12
13
13
This article addresses some common questions about using Apache Kafka on Azure HDInsight.
14
14
15
-
## What Kafka versions are supported by HDInsight
15
+
## What Kafka versions are supported by HDInsight?
16
16
17
17
Our officially supported component versions can be found [here](../hdinsight-component-versioning.md#supported-hdinsight-versions). We recommend always using the latest version to ensure the best possible performance and user experience.
18
18
19
-
## What resources are provided in an HDInsight Kafka cluster and what resources am I charged for
19
+
## What resources are provided in an HDInsight Kafka cluster and what resources am I charged for?
20
20
21
21
A HDInsight Kafka cluster includes the following resources:
22
22
@@ -30,23 +30,23 @@ All of these resources are charged based on our [HDInsight pricing model](https:
30
30
31
31
For a more detailed description of various node types, see [Azure HDInsight virtual network architecture](../hdinsight-virtual-network-architecture.md). Pricing is based on per minute node usage. Prices vary depending on node size, number of nodes, type of managed disk used, and region.
32
32
33
-
## Do Apache Kafka APIs work with HDInsight
33
+
## Do Apache Kafka APIs work with HDInsight?
34
34
35
35
Yes, HDInsight uses native Kafka APIs. Your client application code doesn't need to change. See [Tutorial: Use the Apache Kafka Producer and Consumer APIs](./apache-kafka-producer-consumer-api.md) to see how you can use Java-based producer/consumer APIs with your cluster.
36
36
37
-
## Can I change cluster configurations
37
+
## Can I change cluster configurations?
38
38
39
39
Yes, through the Ambari portal. Each component in the portal has a **configs** section, which can be used to change component configurations. Some changes may require broker restarts.
40
40
41
-
## What type of authentication does HDInsight Kafka support
41
+
## What type of authentication does HDInsight support for Apache Kafka?
42
42
43
43
Using [Enterprise Security Package (ESP)](../domain-joined/apache-domain-joined-architecture.md), customers can get topic-level security for their Kafka clusters. See [Tutorial: Configure Apache Kafka policies in HDInsight with Enterprise Security Package (Preview)](../domain-joined/apache-domain-joined-run-kafka.md), for more information.
44
44
45
-
## Is my data encrypted and can I use my own keys
45
+
## Is my data encrypted? Can I use my own keys?
46
46
47
-
All Kafka messages on the managed disks are encrypted with Azure Storage Service Encryption (SSE). Data-in-transit (for example, data being transmitted from clients to brokers and the other way around) isn't encrypted by default. It's possible to encrypt such traffic by setting up SSL on your own. Additionally, HDInsight allows customers to manage their own keys (BYOK) to encrypt the data at rest.
47
+
All Kafka messages on the managed disks are encrypted with [Azure Storage Service Encryption (SSE)](../../storage/common/storage-service-encryption.md). Data-in-transit (for example, data being transmitted from clients to brokers and the other way around) isn't encrypted by default. It's possible to encrypt such traffic by [setting up SSL on your own](./apache-kafka-ssl-encryption-authentication.md). Additionally, HDInsight allows customers to manage their own keys to encrypt the data at rest. See [Bring your own key for Apache Kafka on Azure HDInsight](apache-kafka-byok.md), for more information.
48
48
49
-
## How do I connect clients to my cluster
49
+
## How do I connect clients to my cluster?
50
50
51
51
For Kafka clients to communicate with Kafka brokers, they must be able to reach the brokers over the network. For HDInsight clusters, the Virtual Network (VNet) is the security boundary. Hence, the easiest way to connect clients to your HDInsight cluster is to create clients within the same VNet as the cluster. Other scenarios include:
52
52
@@ -56,11 +56,11 @@ For Kafka clients to communicate with Kafka brokers, they must be able to reach
56
56
57
57
* Creating a public endpoint for your Kafka service – If your enterprise security requirements allow it, you can deploy a public endpoint for your Kafka brokers, or a self-managed open-source REST end point with a public endpoint.
58
58
59
-
## Can I add more disk space on an existing cluster
59
+
## Can I add more disk space on an existing cluster?
60
60
61
61
To increase the amount of space available for Kafka messages, you can increase the number of nodes. Currently, adding more disks to an existing cluster isn't supported.
62
62
63
-
## How can I have maximum data durability
63
+
## How can I have maximum data durability?
64
64
65
65
Data durability allows you to achieve the lowest risk of message loss. In order to achieve maximum data durability, we recommend the following settings:
66
66
@@ -72,14 +72,19 @@ Data durability allows you to achieve the lowest risk of message loss. In order
72
72
73
73
Configuring Kafka for higher data consistency affects the availability of brokers to produce requests.
74
74
75
-
## Can I replicate my data to multiple clusters
75
+
## Can I replicate my data to multiple clusters?
76
76
77
77
Yes, data can be replicated to multiple clusters using Kafka MirrorMaker. Details on setting up MirrorMaker can be found [here](apache-kafka-mirroring.md). Additionally, there are other self-managed open-source technologies and vendors that can help achieve replication to multiple clusters such as [Brooklin](https://github.com/linkedin/Brooklin/).
78
78
79
-
## Can I upgrade my cluster/ How should I upgrade my cluster
79
+
## Can I upgrade my cluster? How should I upgrade my cluster?
80
80
81
81
We don't currently support in-place cluster version upgrades. To update your cluster to a higher Kafka version, create a new cluster with the version that you want and migrate your Kafka clients to use the new cluster.
82
82
83
-
## How do I monitor my Kafka cluster
83
+
## How do I monitor my Kafka cluster?
84
84
85
85
Use Azure monitor to analyze your [Kafka logs](./apache-kafka-log-analytics-operations-management.md).
86
+
87
+
## Next steps
88
+
89
+
*[Set up Secure Sockets Layer (SSL) encryption and authentication for Apache Kafka in Azure HDInsight](./apache-kafka-ssl-encryption-authentication.md)
90
+
*[Use MirrorMaker to replicate Apache Kafka topics with Kafka on HDInsight](./apache-kafka-mirroring.md)
0 commit comments