Unintended log deletions - "Used disk space exceeds LOG_SIZE_LIMIT (0 GB)" #15617
-
Version2.4.201 Installation MethodSecurity Onion ISO image Descriptionother (please provide detail below) Installation Typeother (please provide detail below) Locationon-prem with Internet access Hardware SpecsExceeds minimum requirements CPU32 RAM33.2 GB Storage for /264.0 GB Storage for /nsm67068.1 GB Network Traffic Collectionother (please provide detail below) Network Traffic SpeedsLess than 1Gbps StatusYes, all services on all nodes are running OK Salt StatusYes, there are salt failures (please provide detail below) LogsYes, there are additional clues in /opt/so/log/ (please provide detail below) DetailHello Security Onion, I am writing in as a follow-up to my previous thread re: an issue with unprompted log deletion on our organization's Security Onion deployment. To summarize again, we have two nodes in our Onion grid: a main Standalone server and a Fleet node in our DMZ which sends logs to the first. On the Standalone node the following ILM retention settings are configured: Warm: 30d so-elasticsearch-indices-delete: On I logged in this morning to find that the issue we reported earlier had reoccurred over the weekend. Once again nearly all of our historical logs had been deleted from Dashboards long before the intended ILM retention thresholds, as a result of which nearly everything dated before 15MAR26 was gone. At the suggestion of a poster in the previous thread, I read through /opt/so/log/elasticsearch/so-elasticsearch-indices-delete.log from that day and found several entries of interest, a sample of which I have provided below: [...] Over the next couple of minutes several dozen logs like the ones above follow, cycling through our various log indices and deleting them, until the file terminated for the day with the following: While this does seem to confirm that elasticsearch-indices-delete is indeed the source of the log deletion, we still have several questions about how and why this is happening given the retention settings we have configured.
Thank you again for any and all assistance you can provide. Please let me know if you have any additional questions and I will follow up as soon as I am able. Result of salt-call state.highstate: Guidelines
|
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
|
A couple of places to check. In your Elasticsearch log, If this part of the script failed to yield a number: Then the To find the cron, look in the crontab. It runs every 5 minutes by default If you are confident that your ILM settings will keep the cluster healthy, you should disable the |
Beta Was this translation helpful? Give feedback.
A couple of places to check. In your Elasticsearch log,
/opt/so/log/elasticsearch/securityonion.loglook for an entry for the standalone node disconnecting from the cluster. When the script runs on the cron, the variable forLOG_SIZE_LIMITis calculated in theso-elasticsearch-indices-delete-deletescript asLOG_SIZE_LIMIT_GB=$(/usr/sbin/so-elasticsearch-cluster-space-total 50)If this part of the script failed to yield a number: