Incomplete log records in Kubernetes

**Describe the bug**

Fluent Bit emits incomplete (split) log records during container log file rotation managed by `containerd`.
When `containerd` splits a log record across two files at rotation time,
Fluent Bit forwards each fragment as a separate log record instead of joining them.
The result is malformed JSON records with missing fields that arrive at the destination silently broken.

The bug appears only under high load, since only in this case `containerd` splits a log record across two files.
Over 1 hour of testing at 10k logs/sec from 2 Pods, Fluent Bit produced 34 split records.

**To Reproduce**

1. Clone the benchmark repository:
```
git clone https://github.com/VictoriaMetrics/log-collectors-benchmark
cd log-collectors-benchmark
```

2. Create a `kind` Kubernetes cluster (requires `kubectl`, `kind`, `helm`, `docker`, `make`):
```
kind create cluster --name log-collectors-bench
```

3. Install [VictoriaLogs](https://docs.victoriametrics.com/victorialogs/) as the log storage backend:
```
helm repo add vm https://victoriametrics.github.io/helm-charts/

helm install vls vm/victoria-logs-single --namespace logging --create-namespace
```

4. Configure Fluent Bit to write to VictoriaLogs:
```
make set-endpoint VLS_HOST='vls-victoria-logs-single-server.logging.svc.cluster.local' VLS_PORT=9428
```

5. Deploy Fluent Bit:
```
make bench-up-fluent-bit
```

6. Start the load generator:
```
make bench-up-generator GENERATOR_REPLICAS=1 LOGS_PER_SECOND=10000 RAMP_UP=false
```

You can increase the number of the load generator replicas (`GENERATOR_REPLICAS`) to greater value if your machine is fast enough.
This will increase the load and a chance to reproduce the bug.

7. Forward the VictoriaLogs port to your local machine:
```
kubectl port-forward -n logging vls-victoria-logs-single-server-0 9428:9428
```

8. Wait approximately 30 minutes (the bug is intermittent and appears only under sustained load).

9. Query VictoriaLogs for malformed records using the expression `sequence_id:""` -
this finds all records missing the `sequence_id` field, which are the split fragments.

10. Clean up:
```
make bench-down-all
```

**Expected behavior**

Fluent Bit should detect that a log record was split at a file rotation boundary and reconstruct the complete record before forwarding it.

**Screenshots**

<img width="1719" height="682" alt="Image" src="https://github.com/user-attachments/assets/857100e4-b1a5-4ad9-af5b-1f295843d4ee" />

**Your Environment**

* Version used: **v4.2.3**
* Configuration: default config from the [official Helm chart](https://github.com/fluent/helm-charts). See full configuration here: https://github.com/VictoriaMetrics/log-collectors-benchmark/blob/11e1fa7760a4b53d9ed1f59a61d2195b627bd2f9/values/fluent-bit.yml
* Environment name and version: Kubernetes (`kind` v0.31.0), single-node cluster
* Server type and version: GCP `n2-highcpu-32` (32 vCPU, 32 GiB RAM, local SSD)
* Operating System and version: Ubuntu 22.04
* Filters and plugins: `tail` input plugin reading from `/var/log/pods`, JSON parser

**Additional context**

The root cause appears to be specific to the **last log record of a file** at rotation time.
The record is split across two files by `containerd` and is marked with the partial flag (`P` in CRI format),
even though its size does not exceed the standard 16 KiB threshold at which `containerd` normally splits long lines.
Fluent Bit forwards each part as a separate record instead of waiting for and joining the continuation from the new file.

We custom-modified our collector to verify that the issue is rotation-specific. Since other collectors don't encounter this, we've confirmed the application is writing logs properly and isn't the source of truncated or partial log lines.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Incomplete log records in Kubernetes #11602

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Incomplete log records in Kubernetes #11602

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions