@@ -10,7 +10,7 @@ author: jessegler
10
10
---
11
11
# Troubleshoot a KubernetesCluster with a node in NotReady state
12
12
13
- Follow this troubleshooting guide if you see a kubernetesCluster with a node in NotReady.
13
+ Follow this troubleshooting guide if you see a kubernetesCluster with a node in ** NotReady** .
14
14
15
15
## Prerequisites
16
16
@@ -24,39 +24,44 @@ Follow this troubleshooting guide if you see a kubernetesCluster with a node in
24
24
25
25
## Procedure
26
26
27
- Delete the node by following the example below. This will allow the Cluster to attempt to reschedule and restart the node.
27
+ Delete the node by following the instructions below. This will allow the Cluster to attempt to reschedule and restart the node.
28
28
29
- ~~~ bash
30
- # get list of nodes, wide
31
29
30
+ 1 . Use kubectl to list the nodes using the wide flag. Observe the node in ** NotReady** status.
31
+
32
+ ~~~ bash
32
33
$ kubectl get nodes -owide
33
34
NAME STATUS ROLES AGE VERSION INTERNAL-IP EXTERNAL-IP OS-IMAGE KERNEL-VERSION CONTAINER-RUNTIME
34
35
mytest-naks1-3b466a17-agentpool1-md-6bg5h-7qt2b Ready < none> 6d3h v1.27.3 10.4.74.30 < none> CBL-Mariner/Linux 5.15.153.1-2.cm2 containerd://1.6.26
35
36
mytest-naks1-3b466a17-agentpool1-md-6bg5h-dqmzw Ready < none> 6d3h v1.27.3 10.4.74.31 < none> CBL-Mariner/Linux 5.15.153.1-2.cm2 containerd://1.6.26
36
37
mytest-naks1-3b466a17-agentpool1-md-6bg5h-lkhhq NotReady < none> 6d3h v1.27.3 10.4.74.29 < none> CBL-Mariner/Linux 5.15.153.1-2.cm2 containerd://1.6.26
37
38
mytest-naks1-3b466a17-control-plane-6q7ns Ready control-plane 6d3h v1.27.3 10.4.74.14 < none> CBL-Mariner/Linux 5.15.153.1-2.cm2 containerd://1.6.26
38
39
mytest-naks1-3b466a17-control-plane-8qqvz Ready control-plane 6d3h v1.27.3 10.4.74.28 < none> CBL-Mariner/Linux 5.15.153.1-2.cm2 containerd://1.6.26
39
- mytest-naks1-3b466a17-control-plane-g42mh Ready control-plane 6d3h v1.27.3 10.4.74.32 < none> CBL-Mariner/Linux 5.15.153.1-2.cm2 containerd://1.6.26
40
+ mytest-naks1-3b466a17-control-plane-g42mh Ready control-plane 6d3h v1.27.3 10.4.74.32 < none> CBL-Mariner/Linux 5.15.153.1-2.cm2 containerd://1.6.26
41
+ ~~~
40
42
41
- # see NotReady node. issue kubectl command to delete it:
43
+ 1 . Issue the kubectl command to delete the node.
42
44
45
+ ~~~ bash
43
46
$ kubectl delete node mytest-naks1-3b466a17-agentpool1-md-6bg5h-lkhhq
44
47
node " mytest-naks1-3b466a17-agentpool1-md-6bg5h-lkhhq" deleted
48
+ ~~~
45
49
50
+ 1 . List the nodes again and see that the node is gone.
46
51
47
- # node list is now shows that node is gone:
48
-
52
+ ~~~ bash
49
53
$ kubectl get nodes -owide
50
54
NAME STATUS ROLES AGE VERSION INTERNAL-IP EXTERNAL-IP OS-IMAGE KERNEL-VERSION CONTAINER-RUNTIME
51
55
mytest-naks1-3b466a17-agentpool1-md-6bg5h-7qt2b Ready < none> 6d3h v1.27.3 10.4.74.30 < none> CBL-Mariner/Linux 5.15.153.1-2.cm2 containerd://1.6.26
52
56
mytest-naks1-3b466a17-agentpool1-md-6bg5h-dqmzw Ready < none> 6d3h v1.27.3 10.4.74.31 < none> CBL-Mariner/Linux 5.15.153.1-2.cm2 containerd://1.6.26
53
57
mytest-naks1-3b466a17-control-plane-6q7ns Ready control-plane 6d3h v1.27.3 10.4.74.14 < none> CBL-Mariner/Linux 5.15.153.1-2.cm2 containerd://1.6.26
54
58
mytest-naks1-3b466a17-control-plane-8qqvz Ready control-plane 6d3h v1.27.3 10.4.74.28 < none> CBL-Mariner/Linux 5.15.153.1-2.cm2 containerd://1.6.26
55
59
mytest-naks1-3b466a17-control-plane-g42mh Ready control-plane 6d3h v1.27.3 10.4.74.32 < none> CBL-Mariner/Linux 5.15.153.1-2.cm2 containerd://1.6.26
60
+ ~~~
56
61
62
+ 1 . Wait 5-15 minutes for the node to be replaced. See that its returned with a new name. It will show ** NotReady** as it comes up.
57
63
58
- # wait 5-15 minutes while its replaced. see it returned with a new name:
59
-
64
+ ~~~ bash
60
65
$ kubectl get nodes -owide
61
66
NAME STATUS ROLES AGE VERSION INTERNAL-IP EXTERNAL-IP OS-IMAGE KERNEL-VERSION CONTAINER-RUNTIME
62
67
mytest-naks1-3b466a17-agentpool1-md-6bg5h-7qt2b Ready < none> 6d3h v1.27.3 10.4.74.30 < none> CBL-Mariner/Linux 5.15.153.1-2.cm2 containerd://1.6.26
@@ -65,10 +70,11 @@ mytest-naks1-3b466a17-agentpool1-md-6bg5h-nxkks NotReady <none> 42s
65
70
mytest-naks1-3b466a17-control-plane-6q7ns Ready control-plane 6d3h v1.27.3 10.4.74.14 < none> CBL-Mariner/Linux 5.15.153.1-2.cm2 containerd://1.6.26
66
71
mytest-naks1-3b466a17-control-plane-8qqvz Ready control-plane 6d3h v1.27.3 10.4.74.28 < none> CBL-Mariner/Linux 5.15.153.1-2.cm2 containerd://1.6.26
67
72
mytest-naks1-3b466a17-control-plane-g42mh Ready control-plane 6d3h v1.27.3 10.4.74.32 < none> CBL-Mariner/Linux 5.15.153.1-2.cm2 containerd://1.6.26
73
+ ~~~
68
74
75
+ 1 . Wait a bit longer and the ** NotReady** node becomes ** Ready** .
69
76
70
- # wait a bit longer, while the new NotReady node becomes Ready:
71
-
77
+ ~~~ bash
72
78
$ kubectl get nodes -owide
73
79
NAME STATUS ROLES AGE VERSION INTERNAL-IP EXTERNAL-IP OS-IMAGE KERNEL-VERSION CONTAINER-RUNTIME
74
80
mytest-naks1-3b466a17-agentpool1-md-6bg5h-7qt2b Ready < none> 6d3h v1.27.3 10.4.74.30 < none> CBL-Mariner/Linux 5.15.153.1-2.cm2 containerd://1.6.26
0 commit comments