Skip to content

Conversation

@weihuang-jedi
Copy link
Contributor

@weihuang-jedi weihuang-jedi commented Sep 18, 2025

DESCRIPTION OF CHANGES:

Enable reg-test on cloud, and make reg-test can run by other users outside role.ufs_utils, and outside account fv3-cpu.

TESTS CONDUCTED:

If there are changes to the build or source code, the tests below must be conducted. Contact a repository manager if you need assistance.

  • Compile branch on all Tier 1 machines using Intel (Orion, Jet, Ursa, Hercules and WCOSS2).
  • Compile branch on Ursa using GNU.
  • Compile branch in 'Debug' mode on WCOSS2.
  • Compile with Doxygen on any machine with no errors.
  • Run unit tests locally on any Tier 1 machine. (Ran on Ursa)
  • Run relevant consistency tests locally on all Tier 1 machines.
  • [x ] Run on cloud (AWS).

Optional test.

  • Run full set of chgres_cube consistency tests on Ursa.

DEPENDENCIES:

None

DOCUMENTATION:

All new and updated source code must be documented with Doxygen.

  • Doxygen is updated.

If this PR is contributing new capabilities that need to be documented, please also include updates to the RST files in the docs/source directory as supporting material.

ISSUE:

#1104

CONTRIBUTORS (optional):

If others have contributed to this work aside from the PR author, list them here

@weihuang-jedi
Copy link
Contributor Author

@BrianCurtis-NOAA
I am trying to enable reg-test on cloud (AWS), and see issues for me to run reg-test on Ursa as well.
So I made some changes to make it being able to run for me (without access to role.ufs_utils directory, not in fv3-cpu account, as I am in epic project). I added a run_rt.sh, which people can put there own info (ufs_utils forked place like mine, branch to test, setup working directory of own, and use own account (not fv3-cpu)), but rt.sh should work as it is (here I won't be able to test myself).

I put this as draft, so you can take a look first, and we can iterate multiple time before you decide to check it in.

There are few failures on AWS, which I want to hear from you how I can fix them.

If i run a script:
#!/bin/bash

set -x

WORK_DIR=/contrib/Wei.Huang/dev/UFS_UTILS
for dir in regrid_sfc weight_gen ocnice_prep cpld_gridgen chgres_cube grid_gen global_cycle ice_blend snow2mdl
do
if [[ -f ${dir}/summary.log ]]
then
if grep -i "FAILED" ${dir}/summary.log
then
success=false
echo "${dir} consistency tests FAILED" >> ${WORK_DIR}/reg_test_results.txt
else
echo "${dir} consistency tests PASSED" >> ${WORK_DIR}/reg_test_results.txt
fi
else
echo "${dir} did not finish" >> ${WORK_DIR}/reg_test_results.txt
fi
done

I got:

Started on weihuang-awsepicweic7i2xlarge-00033-mgmt
regrid_sfc consistency tests PASSED
weight_gen consistency tests PASSED
ocnice_prep did not finish
cpld_gridgen did not finish
chgres_cube did not finish
grid_gen did not finish
global_cycle consistency tests PASSED
ice_blend consistency tests FAILED
snow2mdl consistency tests PASSED

so here can safely assume regrid_sfc weight_gen, global_cycle, and snow2mld are good.
ice_blend FAILED.
Need to check ocnice_prep, cpld_gridgen, chgres_cube, and grid_gen.

ocnice_prep has:
[Wei.Huang@awsepicweic7i2xlarge-33 ocnice_prep]$ tail RegressionTests_noaacloud.intel.log

Working dir = /contrib/Wei.Huang/dev/UFS_UTILS/reg-tests/OCNICE_PREP/rt_415857/025_ocean_100
Baseline dir = /contrib/ufs_utils/reg_tests/ocnice_prep/baseline_data/025_ocean_100

Checking test 025_ocean_100 results ....
Comparing ocean.mx100.nc........OK

Elapsed time: 00h:06m:39s. Have a nice day!

REGRESSION TEST WAS SUCCESSFUL

Did not figure out how to check chgres_cube yet.

Please let me know if you need more info.

Thanks,

Wei

So I think it is good.

cpld_gridgen has:
[Wei.Huang@awsepicweic7i2xlarge-33 cpld_gridgen]$ cat summary.log
025 failed
050 failed
100 failed
500 failed
900 failed

which is not good.
grid_gen has:

Wei.Huang@awsepicweic7i2xlarge-33 grid_gen]$ grep " TEST " consistency.log0*
consistency.log01:<<< C96 UNIFORM TEST FAILED. >>>
consistency.log02:<<< C96 VIIRS BNU TEST FAILED. >>>
consistency.log03:<<< GFDL REGIONAL TEST FAILED. >>>
consistency.log04:<<< ESG REGIONAL TEST FAILED. >>>
consistency.log05:<<< ESG REGIONAL PERCENT CATEGORY TEST FAILED. >>>
consistency.log06:<<< REGIONAL 12 THREAD GSL GWD TEST FAILED. <<<
consistency.log07:<<< REGIONAL 24 THREAD GSL GWD TEST FAILED. >>>

@weihuang-jedi
Copy link
Contributor Author

On Ursa, I have results:

Date: Thu, 18 Sep 2025 14:07:05 +0000
Subject: UFS_UTILS Consistency Tests PASSED on ursa
Started on ufe04
Commit hash:

regrid_sfc consistency tests PASSED
weight_gen consistency tests PASSED
ocnice_prep consistency tests PASSED
cpld_gridgen consistency tests PASSED
chgres_cube consistency tests PASSED
grid_gen consistency tests PASSED
global_cycle consistency tests PASSED
ice_blend consistency tests PASSED
snow2mdl consistency tests PASSED

@BrianCurtis-NOAA
Copy link
Collaborator

There are parts of this that #1090 has to make reg tests faster. I'll probably focus on getting that in before this, as that should make it easier to run on multiple machines from one file.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants