You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have a problem with my kafka cluster, running on a self-hosted kubernetes cluster.
Sometimes, when i node gets drained and restarted (due to automatic updates), once it comes back up, zookeeper fails to reconnect to the other zookeeper instances.
Looking at the failing instance i get the following log entries over and over again:
WARN Cannot open channel to 1 at election address flexogrow-kafka-zookeeper-0.flexogrow-kafka-zookeeper-nodes.flexogrow-kafka.svc/10.42.7.19:3888 (org.apache.zookeeper.server.quorum.QuorumCnxManager) [QuorumConnectionThread-[myid=3]-1]
java.net.ConnectException: Connection refused (Connection refused)
On the other instance i can't see any log entries refering to the connection attempt, however, restarting it fixes the problem.
I'm more or less out of ideas on how to fix this, can anyone help me?
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
Uh oh!
There was an error while loading. Please reload this page.
-
Hello
I have a problem with my kafka cluster, running on a self-hosted kubernetes cluster.
Sometimes, when i node gets drained and restarted (due to automatic updates), once it comes back up, zookeeper fails to reconnect to the other zookeeper instances.
Looking at the failing instance i get the following log entries over and over again:
On the other instance i can't see any log entries refering to the connection attempt, however, restarting it fixes the problem.
I'm more or less out of ideas on how to fix this, can anyone help me?
Here is the configuration i currently use:
Beta Was this translation helpful? Give feedback.
All reactions