Merge pull request #51709 from sheriff-rh/etcd-fix

adellape · web-flow · commit b0a4b49d25ef · 2022-10-14T10:45:13.000-06:00
diff --git a/modules/etcd-defrag.adoc b/modules/etcd-defrag.adoc
@@ -9,7 +9,7 @@
 
 For large and dense clusters, etcd can suffer from poor performance if the keyspace grows too large and exceeds the space quota. Periodically maintain and defragment etcd to free up space in the data store. Monitor Prometheus for etcd metrics and defragment it when required; otherwise, etcd can raise a cluster-wide alarm that puts the cluster into a maintenance mode that accepts only key reads and deletes.
 
-.Monitor these key metrics:
+Monitor these key metrics:
 
 * `etcd_server_quota_backend_bytes`, which is the current quota limit
 * `etcd_mvcc_db_total_size_in_use_in_bytes`, which indicates the actual database usage after a history compaction
diff --git a/modules/recommended-etcd-practices.adoc b/modules/recommended-etcd-practices.adoc
@@ -60,9 +60,9 @@ $ sudo docker run --volume /var/lib/etcd:/var/lib/etcd:Z quay.io/openshift-scale
 
 The output reports whether the disk is fast enough to host etcd by comparing the 99th percentile of the fsync metric captured from the run to see if it is less than 20 ms. A few of the most important etcd metrics that might affected by I/O performance are as follow:
 
-- `etcd_disk_wal_fsync_duration_seconds_bucket` metric reports the etcd's WAL fsync duration.
-- `etcd_disk_backend_commit_duration_seconds_bucket`  metric reports the etcd backend commit latency duration.
-- `etcd_server_leader_changes_seen_total` metric reports the leader changes.
+* `etcd_disk_wal_fsync_duration_seconds_bucket` metric reports the etcd's WAL fsync duration
+* `etcd_disk_backend_commit_duration_seconds_bucket`  metric reports the etcd backend commit latency duration
+* `etcd_server_leader_changes_seen_total` metric reports the leader changes
 
 Because etcd replicates the requests among all the members, its performance strongly depends on network input/output (I/O) latency. High network latencies result in etcd heartbeats taking longer than the election timeout, which results in leader elections that are disruptive to the cluster. A key metric to monitor on a deployed {product-title} cluster is the 99th percentile of etcd network peer latency on each etcd cluster member. Use Prometheus to track the metric.