Skip to content

Update DcgmRpmCheck after maintenance failure#497

Merged
jgphpc merged 2 commits intoeth-cscs:mainfrom
jgphpc:maintenance_20261401
Jan 15, 2026
Merged

Update DcgmRpmCheck after maintenance failure#497
jgphpc merged 2 commits intoeth-cscs:mainfrom
jgphpc:maintenance_20261401

Conversation

@jgphpc
Copy link
Collaborator

@jgphpc jgphpc commented Jan 15, 2026

  • test failed: /capstor/scratch/cscs/robinson/14Jan2026daint/cscs-reframe-tests/reframe.out
  • reproduce with:
reframe -c checks/system/gssr/dcgm_hook.py -n DcgmRpmCheck -r -J A=csstaff

Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR fixes a test failure in DcgmRpmCheck by ensuring deterministic output ordering when checking DCGM library checksums. The test was failing due to inconsistent ordering from the shasum command's output.

Changes:

  • Added |sort to the shasum command in dcgm_rpm.sh to ensure consistent alphabetical ordering
  • Updated the reference file 3.3.6.ref to reflect the new sorted order of library checksums

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated no comments.

File Description
checks/system/gssr/src/rpm/dcgm_rpm.sh Added sort pipe to ensure deterministic ordering of shasum output
checks/system/gssr/src/rpm/3.3.6.ref Updated reference file with alphabetically sorted checksums to match new script behavior

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

@jgphpc
Copy link
Collaborator Author

jgphpc commented Jan 15, 2026

cscs-ci run alps-daint-uenv;MY_UENV=prgenv-gnu/25.11:v1

Copy link
Collaborator

@gppezzi gppezzi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

tested on clariden and daint

@jgphpc jgphpc merged commit b4d2d29 into eth-cscs:main Jan 15, 2026
2 checks passed
@jgphpc jgphpc deleted the maintenance_20261401 branch January 15, 2026 17:10
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants