K3S metrics server connection timed out #13491
-
|
Hello folks, This is the current scenario:
All these hosts are located in the same network. In addition, all nodes are showing error messages about v1beta1.metrics.k8s.io, here an example: Each node is able to establish TCP/10250 connection to another nodes using the remote host IP (10.10.10.5 -> 10.10.10.7). The ip address "10.42.1.34" exists in my scenario, it is reacheable just via ICMP from the three nodes. In the end, I am able to see metrics from just one of this node using the command line "kubectl top". How can I solve this issue? |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 2 replies
-
Yes, that is how it works. The metrics-server pods scrapes the nodes on the kubelet metrics port, and aggregates the data in memory. The apiserver proxies metrics requests to the metrics-server pod, which serves metrics. In this case, it appears that the apiserver on the nodes not running the metrics-server pod cannot reach the pod by its IP. Are you sure you have the correct ports open between nodes? Check the port requirements doc, and make sure you don't have any conflicting firewall/iptables rules blocking traffic between nodes. If flannel is working, you should be able to curl the metrics-server pod ( |
Beta Was this translation helpful? Give feedback.
-
|
Hi @brandond , Yes, the nodes not running the metrics-server are not able to reach the pod by its IP. Have I to install this component manually? |
Beta Was this translation helpful? Give feedback.

Hi @brandond
thanks for your help and your hints!
I decided to inspect the network traffic using tcpdump and Wireshark to investigate further and determine the cause of the TCP timeout connections.
ICMP communication appeared to be working correctly, but Wireshark detected anomalies; some ICMP requests were not receiving a response (not visible from RHEL CLI).
The 10.42.0.89 IP Address that you see in the picture was the metrics-server IP Address during the troubleshooting.
I searched for more information about flannel and found the solution in this thread: #5013
I am running RHEL 8.10 (updated) and it is still affected by this known bug.
To solve the issue I decided to setup a SystemD s…