You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/hdinsight/kafka/kafka-faq.yml
+4-4Lines changed: 4 additions & 4 deletions
Original file line number
Diff line number
Diff line change
@@ -4,7 +4,7 @@ metadata:
4
4
description: Get answers to common questions about Apache Kafka on Azure HDInsight, a managed Hadoop cloud service.
5
5
ms.service: hdinsight
6
6
ms.topic: faq
7
-
ms.date: 12/21/2022
7
+
ms.date: 01/10/2024
8
8
title: Frequently asked questions about Apache Kafka in Azure HDInsight
9
9
summary: This article addresses some common questions about using Apache Kafka on Azure HDInsight.
10
10
@@ -57,7 +57,7 @@ sections:
57
57
answer: |
58
58
For Kafka clients to communicate with Kafka brokers, they must be able to reach the brokers over the network. For HDInsight clusters, the Virtual Network (VNet) is the security boundary. Hence, the easiest way to connect clients to your HDInsight cluster is to create clients within the same VNet as the cluster. Other scenarios include:
59
59
60
-
* Connecting clients in a different Azure VNet – Peer the cluster VNet and the client VNet and configure the cluster for [IP Advertising](apache-kafka-connect-vpn-gateway.md#configure-kafka-for-ip-advertising). When using IP advertising, Kafka clients must use Broker IP addresses to connect with the brokers, instead of Fully Qualified Domain Names (FQDNs).
60
+
* To connect clients in a different Azure VNet – Peer the cluster VNet and the client VNet and configure the cluster for [IP Advertising](apache-kafka-connect-vpn-gateway.md#configure-kafka-for-ip-advertising). When using IP advertising, Kafka clients must use Broker IP addresses to connect with the brokers, instead of Fully Qualified Domain Names (FQDNs).
61
61
62
62
* Connecting on-premises clients – Using a VPN network and setting up custom DNS servers as described in [Plan a virtual network for Azure HDInsight](../hdinsight-plan-virtual-network-deployment.md).
63
63
@@ -71,7 +71,7 @@ sections:
71
71
- question: |
72
72
Can a Kafka cluster work with Databricks?
73
73
answer: |
74
-
Yes, Kafka clusters can work with Databricks so long as they are in the same VNet. To use a Kafka cluster with Databricks, create a VNet with an HDInsight Kafka cluster in it, then specify that VNet when you create your Databricks workspace and use VNet injection. For more information, see [Deploy Azure Databricks in your Azure Virtual Network (VNet Injection)](/azure/databricks/administration-guide/cloud-configurations/azure/vnet-inject). You will need to provide the bootstrap broker names of the Kafka cluster when creating the Databricks workspace. For information on retrieving the Kafka broker names, see [Get the Apache Zookeeper and Broker host information](./apache-kafka-get-started.md#getkafkainfo).
74
+
Yes, Kafka clusters can work with Databricks so long as they are in the same VNet. To use a Kafka cluster with Databricks, create a VNet with an HDInsight Kafka cluster in it, then specify that VNet when you create your Databricks workspace and use VNet injection. For more information, see [Deploy Azure Databricks in your Azure Virtual Network (VNet Injection)](/azure/databricks/administration-guide/cloud-configurations/azure/vnet-inject). You need to provide the bootstrap broker names of the Kafka cluster when creating the Databricks workspace. For information on retrieving the Kafka broker names, see [Get the Apache Zookeeper and Broker host information](./apache-kafka-get-started.md#getkafkainfo).
75
75
76
76
- question: |
77
77
How can I have maximum data durability?
@@ -81,7 +81,7 @@ sections:
81
81
* use a minimum replication factor of 3 in most regions
82
82
* use a minimum replication factor of 4 in regions with only two fault domains
83
83
* disable unclean leader elections
84
-
* set **min.insync.replicas** to 2 or more - this changes the number of replicas which must be completely in sync with the leader before a write can proceed
84
+
* set **min.insync.replicas** to 2 or more - this changes the number of replicas, which must be completely in sync with the leader before a write can proceed
85
85
* set the **acks** property to **all** - this property requires all replicas to acknowledge all messages
86
86
87
87
Configuring Kafka for higher data consistency affects the availability of brokers to produce requests.
0 commit comments