Skip to content

Investigate occasional crashes in the socket_reader_test #21

@bieryAtFnal

Description

@bieryAtFnal

I've noticed that when I run the socket_reader_test several times, the emu-crt-bern-01 process will occasionally crash. The console output says "Segmentation fault (invalid memory reference)"

This seems to happen as the system is shutting down. (maybe the stop or scrap transition?)

I've seen the problem happen both on np04-srv-005 (EHN1 DAQ computer) and daq.fnal.gov (Fermilab teststand computer). And, I've seen it both with the fddaq-v5.4.0-rc1 build and a recent nightly build.

Below are instructions for reproducing the problem. (To help run the test several times, I created a "bundle" script [copied from daqsystemtest]. We may or may not want to keep this script around after this problem is debugged.)

These instructions make use of a recently nightly build. I'll mark this Issue as targeted to the v5.5.0 release, but if someone wants to advocate for trying to get a fix into 5.4.0, we can certainly talk about that. In any case, it would be great if someone would investigate the source of the crash and implement any code changes that seem to be needed.

DATE_PREFIX=`date '+%d%b'`
TIME_SUFFIX=`date '+%H%M'`

source /cvmfs/dunedaq.opensciencegrid.org/setup_dunedaq.sh
setup_dbt latest
dbt-create -n NFD_DEV_250820_A9 ${DATE_PREFIX}FDDevTest_${TIME_SUFFIX}
cd ${DATE_PREFIX}FDDevTest_${TIME_SUFFIX}/sourcecode

git clone https://github.com/DUNE-DAQ/daqsystemtest.git -b develop
git clone https://github.com/DUNE-DAQ/asiolibs.git -b kbiery/adding_bundle_script
cd ..

dbt-workarea-env
dbt-build -j 12
dbt-workarea-env

asiolibs_integtest_bundle.sh -N 10 --stop-on-fail

Metadata

Metadata

Assignees

Labels

bugSomething isn't working

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions