Replies: 3 comments 16 replies
-
How are you running RabbitMQ, in which type of environment and what type of disks / filesystem do you use? Can you share full logs from all nodes or ideally the data directory with us? Have you made any recent changes to your infrastructure? |
Beta Was this translation helpful? Give feedback.
-
Since Kubernetes adds a ton of complexity and additional layers to debug, we need a way to reproduce this behavior outside of Kubernetes. "Install Kubernetes" is not a reproduction step because is a good dozen of flavors available, many fragmented in subtle ways. |
Beta Was this translation helpful? Give feedback.
-
All nodes log exactly the same exception that looks like this:
(the segment file names are different, of course). It looks quite likely that something in this cluster is off on all nodes, could be disk volume availability or something like that. It is very unusual to see the same local operation exception on all nodes, repeatedly, otherwise. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Describe the bug
3 nodes cluster with quorum queues. One queue got in a bad state: consumers get Connection closed: 541 and rabbitmq server logs show crashes with exception ra_log_segment_unexpected_eof.
This exception get a single result on google: 9733 / but segment_max_entries = 4096 so it seems unrelated.
Rabbitmq management UI showed '???' in metrics for the queue (ready/acked messages...)
rabbitmq-queues quorum_status queue shows one node is pre-vote state and two in timeout.
First try to delete the queue from the UI: error with timeout.
Tried from the cli to delete_member each node:
tried again from the ui and it worked...
I experienced similar issue one another cluster few days ago. rabbitmq-management ui was showing numbers in metrics, queue was empty with 2 unacked messages but rabbitmq-server was crashing with the same ra_log_segment_unexpected_eof exception.
Reproduction steps
...
Expected behavior
i expect rabbitmq server not to crash
Additional context
No response
Beta Was this translation helpful? Give feedback.
All reactions