Replies: 1 comment 1 reply
-
|
Hello, |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hello,
I’m running into an issue when trying to remove a node from a cluster after it has already been shut down. I’m not sure if I’m doing something wrong or if this is a bug or missing feature.
🧩 Scenario
I have a cluster consisting of 3 nodes.
When a node shuts down (ASP.NET Core exit), it attempts to:
1. Send a signal to the cluster leader to gracefully unregister itself.
2. If the shutting-down node is the leader, it unregisters itself as leader und then sends a leave signal.
If this does not work, the leader runs a cleanup routine that should automatically remove the unreachable node after 15 seconds.
However — no matter what I try, the node simply does not get removed.
⚙️ Technical Setup
• 3-node cluster
• Graceful shutdown implemented
• Leader fallback cleanup routine for unreachable nodes
• Persistence engine enabled
• Everything else in the cluster works fine — only removing a member fails
❗ The Problem
The node cannot be removed during graceful shutdown nor by the leader’s cleanup logic.
It stays in the cluster state indefinitely, even though it is completely offline and unreachable.
❓ Question
What is the correct way to remove a node that is already unreachable?
And how should a node perform a proper graceful shutdown and ensure it is fully removed from the cluster?
🧭 Why this is needed
• High-availability scaling
• Handling random node failures safely
Thanks in advance for any guidance!
Beta Was this translation helpful? Give feedback.
All reactions