@@ -24,14 +24,18 @@ The following metrics should be collected / monitored
2424 * Disk Space (free / used)
2525
2626* For ClickHouse:
27- * Connections (count)
27+ * Connections (Number of queries running)
28+ * DDL queue length
2829 * RWLocks
29- * Read / Write / Return (bytes)
30- * Read / Write / Return (rows)
31- * Zookeeper operations (count)
32- * Absolute delay
30+ * Read / Write / Return (bytes/rows)
31+ * Merges (queue length, memory used)
32+ * Mutations
3333 * Query duration (optional)
34- * Replication parts and queue (count)
34+ * Replication queue length and lag
35+ * Read only tables
36+ * ZooKeeper latencies
37+ * Zookeeper operations (count)
38+ * S3 errors (if used)
3539
3640* For Zookeeper:
3741 * [ See separate article] ( ../altinity-kb-zookeeper/zookeeper-monitoring/ )
@@ -72,6 +76,7 @@ The following metrics should be collected / monitored
7276## Dashboard querying ClickHouse directly via Grafana plugin
7377
7478* https://grafana.com/blog/2022/05/05/introducing-the-official-clickhouse-plugin-for-grafana/
79+ * https://gist.github.com/filimonov/271e5b27c085356c67db3c1bf2204506
7580
7681### Zabbix
7782
@@ -339,3 +344,4 @@ See https://clickhouse.com/docs/en/operations/opentelemetry/
339344* [ Monitor ClickHouse with Datadog] ( https://www.datadoghq.com/blog/monitor-clickhouse/ )
340345* [ Unsorted notes on monitor and Alerts] ( https://docs.google.com/spreadsheets/d/1K92yZr5slVQEvDglfZ88k_7bfsAKqahY9RPp_2tSdVU/edit#gid=521173956 )
341346* https://intl.cloud.tencent.com/document/product/1026/36887
347+ * [ Tinybird experience (scroll to monitoring section)] ( https://www.tinybird.co/blog/what-i-learned-operating-clickhouse-part-ii )
0 commit comments