Skip to content

Commit 36419dd

Browse files
authored
Merge pull request #28280 from chaitanyaenr/etcd-network-latency
Add etcd network peer latency recommendation
2 parents 4657dc7 + 27994e0 commit 36419dd

File tree

1 file changed

+8
-0
lines changed

1 file changed

+8
-0
lines changed

modules/recommended-etcd-practices.adoc

Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -54,3 +54,11 @@ $ sudo docker run --volume /var/lib/etcd:/var/lib/etcd:Z quay.io/openshift-scale
5454

5555
The output reports whether the disk is fast enough to host etcd by comparing the 99th percentile
5656
of the fsync metric captured from the run to see if it is less than 10ms.
57+
58+
Etcd replicates the requests among all the members, so its performance strongly depends on network
59+
input/output (IO) latency. High network latencies result in etcd heartbeats taking longer than the
60+
election timeout, which leads to leader elections that are disruptive to the cluster. A key metric
61+
to monitor on a deployed {product-title} cluster is the 99th percentile of etcd network peer latency
62+
on each etcd cluster member. Use Prometheus to track the metric. `histogram_quantile(0.99, rate(etcd_network_peer_round_trip_time_seconds_bucket[2m]))`
63+
reports the round trip time for etcd to finish replicating the client requests between the members;
64+
it should be less than 50 ms.

0 commit comments

Comments
 (0)