You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: administration/monitoring.md
+84Lines changed: 84 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -203,6 +203,7 @@ The following terms are key to understanding how Fluent Bit processes metrics:
203
203
|`fluentbit_output_retried_records_total`| name: the name or alias for the output instance | The number of log records that experienced a retry. This metric is calculated at the chunk level, the count increased when an entire chunk is marked for retry. An output plugin might perform multiple actions that generate many error messages when uploading a single chunk. | counter | records |
204
204
|`fluentbit_output_retries_failed_total`| name: the name or alias for the output instance | The number of times that retries expired for a chunk. Each plugin configures a `Retry_Limit`, which applies to chunks. When the `Retry_Limit` is exceeded, the chunk is discarded and this metric is incremented. | counter | chunks |
205
205
|`fluentbit_output_retries_total`| name: the name or alias for the output instance | The number of times this output instance requested a retry for a chunk. | counter | chunks |
206
+
|`fluentbit_output_latency_seconds`| input: the name of the input plugin instance, output: the name of the output plugin instance | End-to-end latency from chunk creation to successful delivery. Provides observability into chunk-level pipeline performance. | histogram | seconds |
206
207
|`fluentbit_uptime`| hostname: the hostname on running Fluent Bit | The number of seconds that Fluent Bit has been running. | counter | seconds |
207
208
|`fluentbit_process_start_time_seconds`| hostname: the hostname on running Fluent Bit | The Unix Epoch time stamp for when Fluent Bit started. | gauge | seconds |
208
209
|`fluentbit_build_info`| hostname: the hostname, version: the version of Fluent Bit, os: OS type | Build version information. The returned value is originated from initializing the Unix Epoch time stamp of configuration context. | gauge | seconds |
@@ -231,6 +232,89 @@ The following are detailed descriptions for the metrics collected by the storage
231
232
|`fluentbit_output_upstream_total_connections`| name: the name or alias for the output instance | The sum of the connection count of each output plugins. | gauge | bytes |
232
233
|`fluentbit_output_upstream_busy_connections`| name: the name or alias for the output instance | The sum of the connection count in a busy state of each output plugins. | gauge | bytes |
233
234
235
+
### Output latency metric
236
+
237
+
> note: feature introduced in v4.0.6.
238
+
239
+
The `fluentbit_output_latency_seconds` histogram metric captures end-to-end latency from the time a chunk is created by an input plugin until it is successfully delivered by an output plugin. This provides observability into chunk-level pipeline performance and helps identify slowdowns or bottlenecks in the output path.
240
+
241
+
#### Bucket configuration
242
+
243
+
The histogram uses the following default bucket boundaries, designed around Fluent Bit's typical flush interval of 1 second:
244
+
245
+
```
246
+
0.5, 1.0, 1.5, 2.5, 5.0, 10.0, 20.0, 30.0, +Inf
247
+
```
248
+
249
+
These boundaries provide:
250
+
-**High resolution around 1s latency**: Captures normal operation near the default flush interval
251
+
-**Small backpressure detection**: Identifies minor delays in the 1-2.5s range
252
+
-**Bottleneck identification**: Detects retry cycles, network stalls, or plugin bottlenecks in higher ranges
253
+
-**Complete coverage**: The `+Inf` bucket ensures all latencies are captured
254
+
255
+
#### Example output
256
+
257
+
When exposed via Fluent Bit's built-in HTTP server, the metric appears in Prometheus format:
258
+
259
+
```prometheus
260
+
# HELP fluentbit_output_latency_seconds End-to-end latency in seconds
0 commit comments