k3s 1.35.1 not starting flannel CNI after power cut #13675
Closed
BinaryMan32
started this conversation in
General
Replies: 2 comments 3 replies
-
|
Not sure if I should have created an issue instead of a discussion, let me know if that would be more appropriate. |
Beta Was this translation helpful? Give feedback.
0 replies
-
|
This sounds kind of like #13277 - maybe take a look at the causes people have identified there? |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
I have a 9 node cluster of turing RK1s running k3s 1.35.1 with HA (3 control plane/servers). I started with 4 nodes about 2 years ago and have gradually grown the cluster to 9 nodes. A few days ago I was adding a 10th node and accidentally cut power to a couple of the existing nodes instead of shutting them down gracefully. When they restarted, the worker rejoined with no issues but the control plane node would not rejoin.
I see many messages in the logs similar to:
These are just symptoms though, not root cause.
On the last SUCCESSFUL k3s startup, I saw messages like this:
Now, these messages do not appear in the logs. I can confirm there are no cni or flannel interfaces:
On other working nodes I see:
I tried uninstalling and re-installing k3s on this node, but the behavior has not changed.
I don't see anything sensitive in the logs, so I'll attach the log of the failed startup in case something in there is interesting to diagnose the issue.
k3s-failed-startup-logs.txt
The only other brute force approach I can think of is to reflash the operating system. I'll try that in a few days, but I'd like to understand what happened in case there's an easier way to fix this or prevent it happening again.
Beta Was this translation helpful? Give feedback.
All reactions