RabbitMQ pod stuck on termination with fedaration crash #6212
Replies: 2 comments
-
This exception simply says that an upstream channel terminated (can be due to a restart of a remote node). This is unlikely to be a root cause. See what |
Beta Was this translation helpful? Give feedback.
-
@michaelklishin Thanks Michael for a tip. After upgrading 10 clusters (each in 3 node config) from |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
On clusters where we have federation configured we have observed that sometimes one or more pods are stuck in
Terminating
state during the restart/upgrade.Before the cluster restart, the commands
rabbitmq-queues check_if_node_is_quorum_critical
andrabbitmq-queues check_if_node_is_mirror_sync_critical
executed for each of the nodes confirmed that all nodes are in sync with data, so there were no leftovers from some quorum queues.We are using RabbitMQ in version
3.11.1
running in k8s, provisioned by thecluster-operator
in version2.0.0
. We are using cluster configured with a 3 nodes with following config:On clusters where there is no federation configured this problem does not exists.
Some pods are shutting down quickly and without errors in logs, on others we are observing logs with:
Full log in the attachment.
rabbit-stuck-on-terminate.txt
Beta Was this translation helpful? Give feedback.
All reactions