fix(metrics): add replication task latency histogram#7684
fix(metrics): add replication task latency histogram#7684zawadzkidiana merged 9 commits intocadence-workflow:masterfrom
Conversation
8db2480 to
59dd869
Compare
Code Review 👍 Approved with suggestions 0 resolved / 1 findingsClean observability enhancement adding exponential histograms for replication task-ack metrics. Follows established dual-emit patterns correctly; minor missing test for clampInt64ToInt helper remains from prior review. 💡 Quality: Missing unit tests for
|
| Auto-apply | Compact |
|
|
Was this helpful? React with 👍 / 👎 | Gitar
72a5c2b to
94f4f2c
Compare
e3ddec0 to
3669c81
Compare
811185a to
cfac903
Compare
b52020f to
cef3eb5
Compare
…ed diff Dual-emit exponential histograms alongside existing timers for ReplicationTasksLag, ReplicationTasksReturned, and ReplicationTasksReturnedDiff to support histogram migration. ReplicationTasksLag uses the already-defined ExponentialReplicationTasksLag (duration-based Mid1ms24h). Returned and ReturnedDiff use new integer histograms (Mid1To16k) with _counts suffix. All six metric names added to HistogramMigrationMetrics allowlist. Signed-off-by: Diana Zawadzki <dzawa@live.de>
Address review feedback by removing task_latency_ns histogram and using ExponentialTaskProcessingLatency in task_ack_manager. Also remove stale histogram-migration allowlist entries for task_latency/task_latency_ns. Signed-off-by: zawadzki <zawadzki@uber.com>
Signed-off-by: Diana Zawadzki <dzawa@live.de>
1d18b34 to
6a32d1c
Compare
Signed-off-by: Diana Zawadzki <dzawa@live.de>
Signed-off-by: Diana Zawadzki <dzawa@live.de>
Add integer histogram dual-emission for ReplicationTasksFetched and ReplicationTasksLagRaw, and include their names in HistogramMigrationMetrics for config consistency checks. Signed-off-by: Diana Zawadzki <dzawa@live.de>
…ager Signed-off-by: Diana Zawadzki <dzawa@live.de>
Signed-off-by: Diana Zawadzki <dzawa@live.de>
Signed-off-by: Diana Zawadzki <dzawa@live.de>
Code Review ✅ Approved 4 resolved / 4 findingsAll 5 previously-identified issues (wrong metric constant, merge conflicts, missing config entries, mismatched histogram method) have been resolved. Metric names, bucket types, and histogram emission methods are now consistent across config.go, defs.go, and task_ack_manager.go. ✅ 4 resolved✅ Bug: Unresolved merge conflict markers in config.go
✅ Bug: Histogram paired with wrong timer: TaskLatency vs TaskProcessingLatency
✅ Bug: Missing
|
| Auto-apply | Compact |
|
|
Was this helpful? React with 👍 / 👎 | Gitar
What changed?
Added an exponential histogram for replication task latency in TaskAckManager alongside the existing timer (task_latency_ns)
Why?
Replication task latency was tracked only via timers, which limits distribution analysis. Adding a histogram retains the existing timer while enabling bucketed/percentile insights in M3/Grafana without changing processing logic.
How did you test it?
Verified metric emission in uMonitor.
Potential risks
Low risk: additional metrics emission on the replication ack path; minimal overhead.
Release notes
Replication now emits task_latency_ns histogram for replication task latency. CadenceCDNC-17610
Documentation Changes
N/A