You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: content/integrate/redis-data-integration/observability.md
+23-1Lines changed: 23 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -113,6 +113,25 @@ RDI reports with their descriptions.
113
113
|`monitor_time_elapsed_created`| Gauge | Timestamp when the monitor time elapsed counter was created | Informational - no alerting needed |
114
114
|`rdi_incoming_entries`| Gauge | Count of incoming events by `data_source` and `operation` type (pending, inserted, updated, deleted, filtered, rejected) | Informational - monitor for trends, alert only on "rejected" > 0 |
115
115
|`rdi_stream_event_latency_ms`| Gauge | Latency in milliseconds of the oldest event in each data stream, labeled by `data_source`| Informational - monitor based on business SLA requirements |
116
+
|**Processor Performance Total Metrics**||||
117
+
|`rdi_processed_batches_total`| Counter | Total number of processed batches | Informational - use for data ingestion and load tracking |
118
+
|`rdi_processor_batch_size_total`| Counter | Total batch size across all processed batches | Informational - use for throughput analysis |
119
+
|`rdi_processor_read_time_ms_total`| Counter | Total read time in milliseconds across all batches | Informational - use for performance analysis |
120
+
|`rdi_processor_transform_time_ms_total`| Counter | Total transform time in milliseconds across all batches | Informational - use for performance analysis |
121
+
|`rdi_processor_write_time_ms_total`| Counter | Total write time in milliseconds across all batches | Informational - use for performance analysis |
122
+
|`rdi_processor_process_time_ms_total`| Counter | Total process time in milliseconds across all batches | Informational - use for performance analysis |
123
+
|`rdi_processor_ack_time_ms_total`| Counter | Total acknowledgment time in milliseconds across all batches | Informational - use for performance analysis |
124
+
|`rdi_processor_total_time_ms_total`| Counter | Sum of the total `read_time`, `process_time` and `ack_time` values in milliseconds across all batches | Informational - use for performance analysis |
125
+
|`rdi_processor_rec_per_sec_total`| Gauge | Total records per second across all batches | Informational - use for throughput analysis |
126
+
|**Processor Performance Last Batch Metrics**||||
127
+
|`rdi_processor_batch_size_last`| Gauge | Last batch size processed | Informational - use for real-time monitoring |
128
+
|`rdi_processor_read_time_ms_last`| Gauge | Last batch read time in milliseconds | Informational - use for real-time performance monitoring |
129
+
|`rdi_processor_transform_time_ms_last`| Gauge | Last batch transform time in milliseconds | Informational - use for real-time performance monitoring |
130
+
|`rdi_processor_write_time_ms_last`| Gauge | Last batch write time in milliseconds | Informational - use for real-time performance monitoring |
131
+
|`rdi_processor_process_time_ms_last`| Gauge | Last batch process time in milliseconds | Informational - use for real-time performance monitoring |
132
+
|`rdi_processor_ack_time_ms_last`| Gauge | Last batch acknowledgment time in milliseconds | Informational - use for real-time performance monitoring |
133
+
|`rdi_processor_total_time_ms_last`| Gauge | Last batch total time in milliseconds | Informational - use for real-time performance monitoring |
134
+
|`rdi_processor_rec_per_sec_last`| Gauge | Last batch records per second | Informational - use for real-time throughput monitoring |
116
135
117
136
{{< note >}}
118
137
**Additional information about stream processor metrics:**
@@ -121,11 +140,14 @@ RDI reports with their descriptions.
121
140
- Metrics with the `_created` suffix are automatically generated by Prometheus for counters and gauges to track when they were first created.
122
141
- The `rdi_incoming_entries` metric provides a detailed breakdown for each data source by operation type.
123
142
- The `rdi_stream_event_latency_ms` metric helps monitor data freshness and processing delays.
143
+
- The processor performance metrics are divided into two categories:
144
+
-**Total metrics**: Accumulate values across all processed batches for historical analysis
145
+
-**Last batch metrics**: Show real-time performance data for the most recently processed batch
124
146
{{< /note >}}
125
147
126
148
## Recommended alerting strategy
127
149
128
-
The alerting strategy described in the sections below focuses on system failures and data integrity issues that require immediate attention. Most ther metrics are informational, so you should monitor them for trends rather than trigger alerts.
150
+
The alerting strategy described in the sections below focuses on system failures and data integrity issues that require immediate attention. Most other metrics are informational, so you should monitor them for trends rather than trigger alerts.
0 commit comments