Trying to understand chunking, retry and buffer drop #4093
Replies: 5 comments 4 replies
-
The version is too old, please consider using the latest Fluentd version.
Since you set these options, Fluentd flushes chunks every 5 seconds, so the chunk size can be a few KBs.
This is something strange... It may be the old version's issue.
I think it works sequentially in one flush thread.
It means the data is lost. |
Beta Was this translation helpful? Give feedback.
-
So in my case, since |
Beta Was this translation helpful? Give feedback.
-
What's the relation between While total_limit_size says that if the buffer is full, any newly generated additional data is lost. And overflow_action says there is an option to drop_oldest_chunk. That would mean, oldest data is lost and new data is added to buffer. |
Beta Was this translation helpful? Give feedback.
-
Are you referring to these documents?
I don't think these explanations are contradictory. By default, it should behave as described in |
Beta Was this translation helpful? Give feedback.
-
If you think this can be explained more clearly, we are welcome to receive PR! |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
We are using version
1.10.2
withbuf_file
and have been having some issue lately where lot of messages are being lost.I am trying to understand how exactly the parameters work
This is our
td-agent.conf
:We have chunk_limit_size of 2MB but I observe files of a few KBs as well. I thought each file(chunk) should be 2MB until we start with new file. Isn't it ?
The
retry_wait
andretry_timeout
is applicable to each of the chunks individually ? With only 1 worker, it works sequentially that one chunk is retried till timeout is hit, then it picks the next chunk ?I am also wondering if we set the
retry_timeout
> (retry_wait
*retry_max_times
), how does it work ? I mean, if someone sets only 2 retries after 15 seconds, but timeout of say 10 minutes. How will that work ?I see such messages in logs
next_retry_seconds
is exactly after 15sec as configured in the conf file. But the next retry actually appears after 1 minute, why is that so ?And after 15minutes, when it prints
dropping all chunks
, what all is actually dropped ?Beta Was this translation helpful? Give feedback.
All reactions