You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: kafka-observ-lib/signals/totalTime.libsonnet
+9-5Lines changed: 9 additions & 5 deletions
Original file line number
Diff line number
Diff line change
@@ -16,23 +16,27 @@ function(this)
16
16
signals: {
17
17
18
18
local commonRequestQueueDescription = |||
19
-
A high value can imply there aren't enough IO threads or the CPU is a bottleneck,
20
-
or the request queue isnt large enough. The request queue size should match the number of connections.
19
+
High values indicate insufficient IO threads, CPU bottlenecks, or undersized request queue.
20
+
Queue size should match connection count.
21
21
|||,
22
22
23
23
local commonLocalDescription = |||
24
-
In most cases, a high value can imply slow local storage or the storage is a bottleneck. One should also investigate LogFlushRateAndTimeMs to know how long page flushes are taking, which will also indicate a slow disk. In the case of FetchFollower requests, time spent in LocalTimeMs can be the result of a ZooKeeper write to change the ISR.
24
+
High values often indicate slow storage or disk bottlenecks.
25
+
Check LogFlushRateAndTimeMs for disk performance issues.
25
26
|||,
26
27
27
28
local commonRemoteDescription = |||
29
+
For fetch requests, high values may indicate caught-up consumers with no new data (normal if near max wait time).
30
+
Configure via replica.fetch.wait.max.ms and fetch.max.wait.ms.
28
31
|||,
29
32
30
33
local commonResponseQueueDescription = |||
31
-
A high value can imply there aren't enough network threads or the network cant dequeue responses quickly enough, causing back pressure in the response queue.
34
+
High values indicate insufficient network threads or slow network dequeue causing backpressure.
32
35
|||,
33
36
34
37
local commonResponseDescription = |||
35
-
A high value can imply the zero-copy from disk to the network is slow, or the network is the bottleneck because the network cant dequeue responses of the TCP socket as quickly as theyre being created. If the network buffer gets full, Kafka will block.
38
+
High values indicate slow zero-copy operations or network saturation.
0 commit comments