-
Notifications
You must be signed in to change notification settings - Fork 0
Description
I've noticed that when I run the socket_reader_test several times, the emu-crt-bern-01 process will occasionally crash. The console output says "Segmentation fault (invalid memory reference)"
This seems to happen as the system is shutting down. (maybe the stop or scrap transition?)
I've seen the problem happen both on np04-srv-005 (EHN1 DAQ computer) and daq.fnal.gov (Fermilab teststand computer). And, I've seen it both with the fddaq-v5.4.0-rc1 build and a recent nightly build.
Below are instructions for reproducing the problem. (To help run the test several times, I created a "bundle" script [copied from daqsystemtest]. We may or may not want to keep this script around after this problem is debugged.)
These instructions make use of a recently nightly build. I'll mark this Issue as targeted to the v5.5.0 release, but if someone wants to advocate for trying to get a fix into 5.4.0, we can certainly talk about that. In any case, it would be great if someone would investigate the source of the crash and implement any code changes that seem to be needed.
DATE_PREFIX=`date '+%d%b'`
TIME_SUFFIX=`date '+%H%M'`
source /cvmfs/dunedaq.opensciencegrid.org/setup_dunedaq.sh
setup_dbt latest
dbt-create -n NFD_DEV_250820_A9 ${DATE_PREFIX}FDDevTest_${TIME_SUFFIX}
cd ${DATE_PREFIX}FDDevTest_${TIME_SUFFIX}/sourcecode
git clone https://github.com/DUNE-DAQ/daqsystemtest.git -b develop
git clone https://github.com/DUNE-DAQ/asiolibs.git -b kbiery/adding_bundle_script
cd ..
dbt-workarea-env
dbt-build -j 12
dbt-workarea-env
asiolibs_integtest_bundle.sh -N 10 --stop-on-fail