Prevent Vector from ingesting old log lines upon restart after a maintenance #18041

atibdialpad · 2023-07-21T03:33:30Z

atibdialpad
Jul 21, 2023

I have a use case where Vector is tailing logs from a file (file source) on a linux machine. Now, there are times when I transition the machine to under maintenance at which point we stop vector (and many other processes). Once the maintenance is done, Vector is restarted and the machine is marked operational.
Sometimes during the maintenance, there are lot of errors that gets logged in some of the log files vector was tailing, so when Vector comes back up after the maintenance it ingests all those old logs (due to the checkpoint memory) and these old (not important) logs cause false alarms in our monitoring system. The idea is "errors are expected when in maintenance and we do not want to ingest and alert on them"

To solve this issue, I am thinking of doing the following :

When Vector restarts after the maintenance, let it start afresh by deleting the checkpointing file.
Use https://vector.dev/docs/reference/configuration/sources/file/#read_from so that when the files are re-discovered, only new logs are ingested.

I am yet to test this but does that sound okay @jszwedko ?

gurudeepdialpad · 2023-07-21T07:05:48Z

gurudeepdialpad
Jul 21, 2023

How will you identify a vector restart after a maintenance vs a restart after a crash?

0 replies

atibdialpad · 2023-07-21T07:43:18Z

atibdialpad
Jul 21, 2023
Author

How will you identify a vector restart after a maintenance vs a restart after a crash?

Great Question !!!

So, the trick is having the

When Vector restarts after the maintenance, let it start afresh by deleting the checkpointing file

part outside of vector and in the maintenance workflow code. To be very specific to our use case which you are very familiar with @gurudeepdialpad :-) , I will have the code to delete the checkpointing files when we make the machine ready to start Vector in the Ansible playbook which only executes on a FRESH start.

Nothing changes for crashes. When vector restarts after a crash it resumes from where it left which is the correct and intentional behaviour.

1 reply

gurudeepdialpad Jul 21, 2023

Nice. However, you still have the second part where you are setting up vector to start from the end of the file in the vector configs.
I wonder what vector will choose when there is a checkpointing file and then a non-maintenance restart where the configs might say start from the end of the file.

atibdialpad · 2023-07-21T11:39:36Z

atibdialpad
Jul 21, 2023
Author

I wonder what vector will choose when there is a checkpointing file and then a non-maintenance restart where the configs might say start from the end of the file.

checkpoint takes preference here.

Checkpoint exists and read_from=end ---> Vector resume from the last checkpoint
checkpoint doesn't exist and read_from=end --> Vector reads newly appended lines (like tailing)

The main catch is that without the checkpoint file it's as if vector is "discovering" a new log file (from the config) upon (re)start so it will follow the "read_from" config. With the checkpoint, vector is NOT discovering and irrespective of the read_from config it follows the checkpoint.

TL;DR
read_from is only applicable when discovering new log files.

1 reply

gurudeepdialpad Jul 21, 2023

Fantastic. Looks like we have everything covered.

jszwedko · 2023-07-21T19:28:27Z

jszwedko
Jul 21, 2023
Maintainer

That sounds like a reasonable approach to me @atibdialpad ! Thanks for sharing. Maybe it'll help other Vector users with similar scenarios.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Prevent Vector from ingesting old log lines upon restart after a maintenance #18041

Uh oh!

{{title}}

Uh oh!

Replies: 4 comments 2 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Prevent Vector from ingesting old log lines upon restart after a maintenance #18041

Uh oh!

atibdialpad Jul 21, 2023

Replies: 4 comments · 2 replies

Uh oh!

gurudeepdialpad Jul 21, 2023

Uh oh!

Uh oh!

atibdialpad Jul 21, 2023 Author

Uh oh!

gurudeepdialpad Jul 21, 2023

Uh oh!

Uh oh!

atibdialpad Jul 21, 2023 Author

Uh oh!

gurudeepdialpad Jul 21, 2023

Uh oh!

jszwedko Jul 21, 2023 Maintainer

atibdialpad
Jul 21, 2023

Replies: 4 comments 2 replies

gurudeepdialpad
Jul 21, 2023

atibdialpad
Jul 21, 2023
Author

atibdialpad
Jul 21, 2023
Author

jszwedko
Jul 21, 2023
Maintainer