Ra 1.2: leader operations fail with a timeout #374
Replies: 6 comments
-
This means the cluster does not have an elected leader or quorum online. Or never had them. |
Beta Was this translation helpful? Give feedback.
-
We cannot suggest much with this amount of information. Consider gathering node logs and state. Ra 1.2.x is several minors and that be major behind. |
Beta Was this translation helpful? Give feedback.
-
Without any code or logs, #179, #251, #264 look potentially relevant. #179 mentions how to enable logging, including debug logging. |
Beta Was this translation helpful? Give feedback.
-
Thanks so much for all these information. I will enable logging and try my best to provide more information. I appreciate your help. |
Beta Was this translation helpful? Give feedback.
-
Here is what I found: |
Beta Was this translation helpful? Give feedback.
-
There were many changes around member state, and some upcoming, so I don't think this is relevant any more in the 2.10.x era of Ra. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi,
I have 2 nodes and 3 clusters running on them. Node 1 is the leader in all 3 clusters. I stopped the node 1, then node 2 became leaders for 2 clusters, but node 2 in the last cluster is in state 'pre_vote', and seems stuck there.
If I run ra:members(cluster_3), I got {timeout, cluster_3}.
Then I tried everything I could imagine, it just stuck there. I tried, restart_server, stop_server/restart_server, start_or_restart_cluster. I always got error already_started.
I will try delete_cluster/start_cluster. Update: delete_cluster got {error,{no_more_servers_to_try,[{timeout,{cluster_3, node_2}}]}}
Could you please suggest how could I recover from this situation? I'm on v1.1.2, I know it's pretty old version, but that's what I have.
Thank you so much.
Jing
Beta Was this translation helpful? Give feedback.
All reactions