Search node CPU at 100% in grid, SO basically non-functional #13644
-
Version2.4.80 Installation MethodSecurity Onion ISO image Descriptionother (please provide detail below) Installation TypeDistributed Locationon-prem with Internet access Hardware SpecsExceeds minimum requirements CPU24 RAM192 Storage for /40 TB Storage for /nsm40 TB Network Traffic Collectionspan port Network Traffic Speeds1Gbps to 10Gbps StatusYes, all services on all nodes are running OK Salt StatusNo, there are no failures LogsYes, there are additional clues in /opt/so/log/ (please provide detail below) DetailDue to issues with my search node server after a graceful shutdown for maintenance, I had to wipe and reload the search node in my Security Onion environment. Originally, I had the OS installed on a separate NVME drive in the server. However, the server BIOS no longer sees that device as a valid boot option, forcing me to re-install SO on the 40 TB RAID virtual drive in the server instead. Both the OS and the /nsm volume are on the same RAID virtual disk now. Since reloading the search node it has been in a faulted state in my SO grid showing 100% CPU usage and "Process Status: Unknown". The OS Uptime metric appears to not be updating as well. However when looking at processes via the CLI, the actual CPU usage is close to 0% across all cores. Running salt-call state.highstate does not return any errors. so-status on both the manager and sensor nodes come back with all services up and running. The securityonion.log file in /opt/so/log/elasticsearch on the manager node is almost 5 GB in size and it appears to be mostly (if not all) WARN messages like the one below:
If there is a way to get the sensor node connected and working correctly without having to redo my entire Security Onion deployment I would be grateful. If other information would be helpful for troubleshooting I will do my best to provide it. Thanks. Guidelines
|
Beta Was this translation helpful? Give feedback.
Replies: 4 comments 19 replies
-
Beta Was this translation helpful? Give feedback.
-
When running "salt * state.highstate", I encouter this error on the search node. It is the only error returned from running the highstate across all nodes:
|
Beta Was this translation helpful? Give feedback.
-
@tsmith-spscc The title of this discussion specifies
Have you run full diagnostics on your hardware? |
Beta Was this translation helpful? Give feedback.
-
@jgiuliano2024 If you need help with your deployment, please start a separate discussion and provide detailed information about your deployment. |
Beta Was this translation helpful? Give feedback.
@tsmith-spscc Based on a similar discussion at #13691, check to make sure that all of your nodes are synced to the same NTP server and have the same exact time on them.