Skip to content

resizing of LPF tag data structures not done #59

@KADichev

Description

@KADichev

Zero class now exposes a method void resizeTagreg( size_t size ), which is available via lpf_err_t lpf_resize_tag_register(lpf_t ctx, size_t max_tags) in include/lpf/tags.h

However, this function is never called, and this leads to immediate failure of all tests relying on zero engine


 mpirun  -n 1  /storage/home/kdichev/LPF-gitlab2/build/src/MPI/zero_test
Running main() from /scratch/kdichev/.spack/stage/spack-stage-googletest-1.14.0-th5nac5n2cvmf3nluwlgarz242h2bug6/spack-src/googletest/src/gtest_main.cc
[==========] Running 11 tests from 1 test suite.
[----------] Global test environment set-up.
[----------] 11 tests from ZeroTests
[ RUN      ] ZeroTests.init
[       OK ] ZeroTests.init (0 ms)
[ RUN      ] ZeroTests.resizeMemreg
[       OK ] ZeroTests.resizeMemreg (0 ms)
[ RUN      ] ZeroTests.resizeMesgq
[       OK ] ZeroTests.resizeMesgq (6 ms)
[ RUN      ] ZeroTests.regVars
[srv04:1248680:0:1248680] Caught signal 11 (Segmentation fault: address not mapped to object at address 0x8)
BFD: DWARF error: section .debug_info is larger than its filesize! (0x536af6 vs 0x3d6ae8)
BFD: DWARF error: section .debug_info is larger than its filesize! (0x536af6 vs 0x3d6ae8)

/storage/home/kdichev/LPF-gitlab2/src/MPI/zero.cpp: [ lpf::mpi::Zero::tryIncrement() ]
      ...
      252 {
      253     switch (phase) {
      254         case Phase::INIT:
==>   255             rcvdMsgCount[tag] = 0;
      256             getMsgCount[tag] = 0;
      257             m_recvInitMsgCount[tag] = 0;
      258             m_getInitMsgCount[tag] = 0;

BFD: DWARF error: section .debug_info is larger than its filesize! (0x536af6 vs 0x3d6ae8)
BFD: DWARF error: section .debug_info is larger than its filesize! (0x536af6 vs 0x3d6ae8)
==== backtrace (tid:1248680) ====
 0 0x00000000000537fc lpf::mpi::Zero::tryIncrement()  /storage/home/kdichev/LPF-gitlab2/src/MPI/zero.cpp:255
 1 0x000000000004c8c8 lpf::mpi::Zero::regLocal()  /storage/home/kdichev/LPF-gitlab2/src/MPI/zero.cpp:647
 2 0x000000000003c3fc ZeroTests_regVars_Test::TestBody()  /storage/home/kdichev/LPF-gitlab2/src/MPI/zero.t.cpp:98
 3 0x000000000005c20c testing::internal::HandleExceptionsInMethodIfSupported<testing::Test, void>()  ???:0
 4 0x000000000003ed0c testing::Test::Run()  ???:0
 5 0x000000000003eeb4 testing::TestInfo::Run()  ???:0
 6 0x000000000003f168 testing::TestSuite::Run()  ???:0
 7 0x000000000005291c testing::internal::UnitTestImpl::RunAllTests()  ???:0
 8 0x000000000003f25c testing::UnitTest::Run()  ???:0
 9 0x0000000000000af0 main()  ???:0
10 0x00000000000273fc __libc_init_first()  ???:0
11 0x00000000000274cc __libc_start_main()  ???:0
12 0x000000000003c1b0 _start()  ???:0
=================================
[srv04:1248680] *** Process received signal ***
[srv04:1248680] Signal: Segmentation fault (11)
[srv04:1248680] Signal code:  (-6)
[srv04:1248680] Failing at address: 0x272a00130da8
[srv04:1248680] [ 0] linux-vdso.so.1(__kernel_rt_sigreturn+0x0)[0x40002a7207dc]
[srv04:1248680] [ 1] /storage/home/kdichev/LPF-gitlab2/build/src/MPI/zero_test(+0x537fc)[0xaaaacccc37fc]
[srv04:1248680] [ 2] /storage/home/kdichev/LPF-gitlab2/build/src/MPI/zero_test(+0x4c8c8)[0xaaaacccbc8c8]
[srv04:1248680] [ 3] /storage/home/kdichev/LPF-gitlab2/build/src/MPI/zero_test(_ZN22ZeroTests_regVars_Test8TestBodyEv+0x80)[0xaaaacccac3fc]
[srv04:1248680] [ 4] /spack/users/kdichev/.spack/var/spack/environments/arm/.spack-env/view/lib/libgtest.so.1.14.0(_ZN7testing8internal35HandleExceptionsInMethodIfSupportedINS_4TestEvEET0_PT_MS4_FS3_vEPKc+0x5c)[0x40002a8ec20c]
[srv04:1248680] [ 5] /spack/users/kdichev/.spack/var/spack/environments/arm/.spack-env/view/lib/libgtest.so.1.14.0(_ZN7testing4Test3RunEv+0xf8)[0x40002a8ced0c]
[srv04:1248680] [ 6] /spack/users/kdichev/.spack/var/spack/environments/arm/.spack-env/view/lib/libgtest.so.1.14.0(_ZN7testing8TestInfo3RunEv+0x184)[0x40002a8ceeb4]
[srv04:1248680] [ 7] /spack/users/kdichev/.spack/var/spack/environments/arm/.spack-env/view/lib/libgtest.so.1.14.0(_ZN7testing9TestSuite3RunEv+0x298)[0x40002a8cf168]
[srv04:1248680] [ 8] /spack/users/kdichev/.spack/var/spack/environments/arm/.spack-env/view/lib/libgtest.so.1.14.0(_ZN7testing8internal12UnitTestImpl11RunAllTestsEv+0x37c)[0x40002a8e291c]
[srv04:1248680] [ 9] /spack/users/kdichev/.spack/var/spack/environments/arm/.spack-env/view/lib/libgtest.so.1.14.0(_ZN7testing8UnitTest3RunEv+0x9c)[0x40002a8cf25c]
[srv04:1248680] [10] /spack/users/kdichev/.spack/var/spack/environments/arm/.spack-env/view/lib/libgtest_main.so.1.14.0(main+0x40)[0x40002a740af0]
[srv04:1248680] [11] /lib/aarch64-linux-gnu/libc.so.6(+0x273fc)[0x40002abe73fc]
[srv04:1248680] [12] /lib/aarch64-linux-gnu/libc.so.6(__libc_start_main+0x98)[0x40002abe74cc]
[srv04:1248680] [13] /storage/home/kdichev/LPF-gitlab2/build/src/MPI/zero_test(_start+0x30)[0xaaaacccac1b0]
[srv04:1248680] *** End of error message ***
--------------------------------------------------------------------------
Primary job  terminated normally, but 1 process returned
a non-zero exit code. Per user-direction, the job has been aborted.
--------------------------------------------------------------------------
--------------------------------------------------------------------------
mpirun noticed that process rank 0 with PID 1248680 on node srv04 exited on signal 11 (Segmentation fault).

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions