Skip to content

Commit 37e0fdc

Browse files
Replace rdi_processor_performance_batch_avg with multiple standalone
metrics and write a description for them
1 parent b17f5c4 commit 37e0fdc

File tree

1 file changed

+22
-3
lines changed

1 file changed

+22
-3
lines changed

content/integrate/redis-data-integration/observability.md

Lines changed: 22 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -115,7 +115,24 @@ RDI reports with their descriptions.
115115
| `rdi_stream_event_latency_ms` | Gauge | Latency in milliseconds of the oldest event in each data stream, labeled by `data_source` | Informational - monitor based on business SLA requirements |
116116
| `rdi_processed_batches_total` | Counter | Count of the total processed batches | Informational - use for data ingestion and load tracking |
117117
| `rdi_processed_batches_created` | Gauge | Timestamp when the processed batches counter was created | Informational - no alerting needed |
118-
| `rdi_processor_performance_batch_avg` | Gauge | Average processor performance for all batches | Informational - use for data ingestion tracking and debugging purposes |
118+
| **Processor Performance Total Metrics** | | | |
119+
| `rdi_processor_batch_size_total` | Counter | Total batch size across all processed batches | Informational - use for throughput analysis |
120+
| `rdi_processor_read_time_ms_total` | Counter | Total read time in milliseconds across all batches | Informational - use for performance analysis |
121+
| `rdi_processor_transform_time_ms_total` | Counter | Total transform time in milliseconds across all batches | Informational - use for performance analysis |
122+
| `rdi_processor_write_time_ms_total` | Counter | Total write time in milliseconds across all batches | Informational - use for performance analysis |
123+
| `rdi_processor_process_time_ms_total` | Counter | Total process time in milliseconds across all batches | Informational - use for performance analysis |
124+
| `rdi_processor_ack_time_ms_total` | Counter | Total acknowledgment time in milliseconds across all batches | Informational - use for performance analysis |
125+
| `rdi_processor_total_time_ms_total` | Counter | Total total time in milliseconds across all batches | Informational - use for performance analysis |
126+
| `rdi_processor_rec_per_sec_total` | Counter | Total records per second across all batches | Informational - use for throughput analysis |
127+
| **Processor Performance Last Batch Metrics** | | | |
128+
| `rdi_processor_batch_size_last` | Gauge | Last batch size processed | Informational - use for real-time monitoring |
129+
| `rdi_processor_read_time_ms_last` | Gauge | Last batch read time in milliseconds | Informational - use for real-time performance monitoring |
130+
| `rdi_processor_transform_time_ms_last` | Gauge | Last batch transform time in milliseconds | Informational - use for real-time performance monitoring |
131+
| `rdi_processor_write_time_ms_last` | Gauge | Last batch write time in milliseconds | Informational - use for real-time performance monitoring |
132+
| `rdi_processor_process_time_ms_last` | Gauge | Last batch process time in milliseconds | Informational - use for real-time performance monitoring |
133+
| `rdi_processor_ack_time_ms_last` | Gauge | Last batch acknowledgment time in milliseconds | Informational - use for real-time performance monitoring |
134+
| `rdi_processor_total_time_ms_last` | Gauge | Last batch total time in milliseconds | Informational - use for real-time performance monitoring |
135+
| `rdi_processor_rec_per_sec_last` | Gauge | Last batch records per second | Informational - use for real-time throughput monitoring |
119136

120137
{{< note >}}
121138
**Additional information about stream processor metrics:**
@@ -124,12 +141,14 @@ RDI reports with their descriptions.
124141
- Metrics with the `_created` suffix are automatically generated by Prometheus for counters and gauges to track when they were first created.
125142
- The `rdi_incoming_entries` metric provides a detailed breakdown for each data source by operation type.
126143
- The `rdi_stream_event_latency_ms` metric helps monitor data freshness and processing delays.
127-
- The `rdi_processor_performance_batch_avg` metric provides detailed performance insights.
144+
- The processor performance metrics are divided into two categories:
145+
- **Total metrics** (Counters): Accumulate values across all processed batches for historical analysis
146+
- **Last batch metrics** (Gauges): Show real-time performance data for the most recently processed batch
128147
{{< /note >}}
129148

130149
## Recommended alerting strategy
131150

132-
The alerting strategy described in the sections below focuses on system failures and data integrity issues that require immediate attention. Most ther metrics are informational, so you should monitor them for trends rather than trigger alerts.
151+
The alerting strategy described in the sections below focuses on system failures and data integrity issues that require immediate attention. Most other metrics are informational, so you should monitor them for trends rather than trigger alerts.
133152

134153
### Critical alerts (immediate response required)
135154

0 commit comments

Comments
 (0)