feat: Apply upload criteria to memory buffering; Upload the memory buffer to S3 on exit. by davemarco · Pull Request #17 · y-scope/fluent-bit-clp

davemarco · 2026-01-30T19:42:33Z

Description

PR adds support to buffer logs in memory rather than immediately flushing them when not using disk buffer. Note the functionality gain here is not really the point of this PR. In reality, this is a refactoring PR to address interface issues in a different PR - see comment. Once this change is complete, the behaviour of disk and memory modes will be more similar, and we can remove separate diskUploadListener and memoryUploadListener (unify into a single upload listener) in #8.

Memory buffering

Changed memory buffer behavior to buffer logs before uploading to S3. Previously, memory buffer mode would immediately send logs to S3 on each flush. Now it accumulates logs until the configured upload size is reached, making it consistent with disk buffer behavior.

Gracefull exit to s3

Added graceful exit handling for memory buffer mode. Since memory buffer logs are not persisted to disk, the plugin now attempts to flush buffered data to S3 on shutdown. This is not needed for disk buffer mode since those files are recovered on restart (however it may sense to make this a configurable option in the future).

Checklist

The PR satisfies the contribution guidelines.
This is a breaking change and that has been indicated in the PR title, OR this isn't a
breaking change.
Necessary docs have been updated, OR no docs need to be updated.

Validation performed

Tested that logs are buffered, and that the buffered file sent to s3 is readable by log viewer. Tested that the logs are sent to s3 when sent kill signal, and that file is readable by log viewer.

… into linter

Co-authored-by: davidlion <davidlion2@protonmail.com>

… into memoryBuffering

coderabbitai · 2026-01-30T19:42:43Z

Important

Review skipped

Auto reviews are disabled on base/target branches other than the default branch.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

davidlion

The title seems to not really match the changes or the description. We probably need to either change the title or add more context to the description.

davidlion · 2026-02-06T19:21:16Z

internal/irzstd/memory.go

+// Checks if writer is empty. True if no events are buffered. Try to avoid calling this as will
+// flush Zstd Writer potentially creating unnecessary frames.
+//
+// Returns:
+//   - empty: Boolean value that is true if buffer is empty
+//   - err: nil error to comply with interface
+func (w *memoryWriter) CheckEmpty() (bool, error) {
+	w.zstdWriter.Flush()


The name of this method doesn't match what it does and to some degree doesn't really match its use case either.

the plugin now attempts to flush buffered data to S3 on shutdown. This is not needed for disk buffer mode since those files are recovered on restart (however it may sense to make this a configurable option in the future).

I feel it is confusing that during a graceful exit we don't upload to S3 when using disk buffering. Instead we wait until restart for recovery to handle this. If I stop fluentbit, I'd expect all the logs available to be uploaded.

I feel adding a flush method would fit the use case more.

What do you think?

Just to clarify — are you suggesting introducing a separate Flush() method on the writer, in addition to CheckEmpty()?

I understand the expectation, but I was hesitant to upload on graceful shutdown in disk buffer mode because it can introduce duplication or corruption issues.

If we upload but the process is forcefully exited before the disk files are properly truncated, recovery could upload them again and create duplicates. Forcing additional flushes from the IR file to the Zstd file during shutdown could also be slow and, if interrupted mid-write, leave files in an inconsistent state.

One of the reasons to select disk buffering mode is fault tolerance; exiting quickly and relying on upload on restart is the safe path.

If we want “upload everything on stop” behavior for disk buffering, we could consider making it a configurable option if that’s something users would expect.

Just to clarify — are you suggesting introducing a separate Flush() method on the writer, in addition to CheckEmpty()?

Yup

For the behaviour, I think it is fair as long as we document everything.

davemarco · 2026-02-06T21:03:07Z

The title seems to not really match the changes or the description. We probably need to either change the title or add more context to the description.

I feel the current title reflects the high-level change from a user perspective. Previously, memory mode sent each event directly to S3 as it was received. With this change, users can configure a specific upload size before data is flushed.

Because the old mode did not buffer data, graceful shutdown handling wasn’t necessary. Introducing buffering made that logic required.

That said, I agree the description could be updated to better align with the title. We could also consider referencing S3 explicitly in the title if that would make the scope clearer. Let me know if you still feel the title doesn’t capture the change.

davemarco · 2026-02-06T23:35:42Z

plugins/out_clp_s3/internal/flush/flush.go

 //   - readyToUpload: Boolean if upload criteria met or not
 //   - err: Error getting Zstd buffer size
 func checkUploadCriteriaMet(eventManager *outctx.EventManager, uploadSizeMb int) (bool, error) {
-	if !eventManager.Writer.GetUseDiskBuffer() {


this one change changes buffer behaviour to not immediately upload

davidlion

Sorry for the slow review.

I think re-naming CheckEmpty to Empty and adding a Flush method to the interface (and calling both in ToS3) should be fine.

I still feel it is confusing that in one mode it will upload on exit and in another it won't, but as long as it is documented I don't think it is a blocking problem (and I was too caught up on it).

You can update the readme in this PR or another, up to you.

marco and others added 30 commits January 16, 2026 15:08

latest

fff3c95

latest

9abc3bf

latest

b7300e8

latest

960258d

latest

bf688b0

latest

22e1166

update linter

2a1bea7

latest

73e035a

latest

a7fa236

latest

446ffc8

latest

f0c6d4d

latest

2d8a893

latest

be48488

Merge branch 'main' of https://github.com/davemarco/fluent-bit-clp-fork…

3040e40

… into linter

merge commit

3d2171d

fix reverted changes in merge

f51eeeb

latest

a53e99a

latest

8b1e4c0

latest

7ed89e8

david review

23b12be

fix lint

4d85d1c

david review

62c96d7

Update internal/irzstd/disk.go

93411c2

Co-authored-by: davidlion <davidlion2@protonmail.com>

david review

fc1ec4c

david review

dbd9d64

Merge branch 'main' of https://github.com/davemarco/fluent-bit-clp-fork…

5c58aa5

… into memoryBuffering

latest

f818e28

latest

6ec3667

latest

a42dfca

latest

8eff7e4

latest

a3e31a5

davemarco marked this pull request as ready for review January 30, 2026 19:42

davemarco requested a review from a team as a code owner January 30, 2026 19:42

davemarco requested a review from davidlion January 30, 2026 19:42

davemarco changed the title ~~Memorybuf~~ feat: Add support for log buffering in memory mode. Jan 30, 2026

davemarco changed the title ~~feat: Add support for log buffering in memory mode.~~ feat: Add support for buffering records in memory. Jan 30, 2026

lint

7d60165

davemarco changed the title ~~feat: Add support for buffering records in memory.~~ feat: Add support to control upload size for memory buffering. Jan 30, 2026

davidlion requested changes Feb 6, 2026

View reviewed changes

davemarco commented Feb 6, 2026

View reviewed changes

davidlion requested changes Feb 19, 2026

View reviewed changes

latest

da2a955

davemarco changed the title ~~feat: Add support to control upload size for memory buffering.~~ feat: Apply upload criteria to memory buffering; Upload the memory buffer to S3 on exit. Feb 19, 2026

latest

9499c9c

davemarco requested a review from davidlion February 19, 2026 17:58

davidlion approved these changes Feb 20, 2026

View reviewed changes

davemarco merged commit daa76f9 into y-scope:main Feb 20, 2026
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Apply upload criteria to memory buffering; Upload the memory buffer to S3 on exit.#17

feat: Apply upload criteria to memory buffering; Upload the memory buffer to S3 on exit.#17
davemarco merged 34 commits intoy-scope:mainfrom
davemarco:memorybuf

davemarco commented Jan 30, 2026 •

edited

Loading

Uh oh!

coderabbitai bot commented Jan 30, 2026 •

edited

Loading

Review skipped

Uh oh!

davidlion left a comment

Uh oh!

davidlion Feb 6, 2026

Uh oh!

davemarco Feb 6, 2026

Uh oh!

davemarco Feb 6, 2026 •

edited

Loading

Uh oh!

davidlion Feb 19, 2026

Uh oh!

davemarco commented Feb 6, 2026

Uh oh!

davemarco Feb 6, 2026

Uh oh!

davidlion left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

davemarco commented Jan 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Memory buffering

Gracefull exit to s3

Checklist

Validation performed

Uh oh!

coderabbitai bot commented Jan 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Uh oh!

davidlion left a comment

Choose a reason for hiding this comment

Uh oh!

davidlion Feb 6, 2026

Choose a reason for hiding this comment

Uh oh!

davemarco Feb 6, 2026

Choose a reason for hiding this comment

Uh oh!

davemarco Feb 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

davidlion Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

davemarco commented Feb 6, 2026

Uh oh!

davemarco Feb 6, 2026

Choose a reason for hiding this comment

Uh oh!

davidlion left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

davemarco commented Jan 30, 2026 •

edited

Loading

coderabbitai bot commented Jan 30, 2026 •

edited

Loading

davemarco Feb 6, 2026 •

edited

Loading