-
Notifications
You must be signed in to change notification settings - Fork 119
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[develop] Update modulefiles to use hdf/1.14.0 and netcdf/4.9.2-based software stacks on Tier 1 systems #889
[develop] Update modulefiles to use hdf/1.14.0 and netcdf/4.9.2-based software stacks on Tier 1 systems #889
Conversation
…weather-app into update_modulefiles2
Fundamental tests ran on Hera (intel, gnu), Orion, Jet, Gaea.
After rerunning, it completed successfully:
Still looking into random failures in Jet testing. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@natalie-perlin - Unfortunately, the Jenkins tests all failed at the Build
phase when the Jenkins tests were submitted on Friday. The failure is due to the Jenkins build still building the GSI
and rrfs_utils
. Without ncdiag
, neither the GSI
nor rrfs_utils
can build, which is what caused the failure. Please uncomment line 30 for now and ncdiag
will be removed as part of @christinaholtNOAA's PR #893. Thanks!
modulefiles/srw_common.lua
Outdated
|
||
load("ncdiag/1.1.1") | ||
--load("ncdiag/1.1.1") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
keep ncdiag/1.1.1
update miniconda3 location in new role-epic space
update miniconda3 location in new role-epic space
Thanks for updating the modulefiles, @natalie-perlin! Resubmitting the Jenkins tests now. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@natalie-perlin - The Gaea tests are failing to build the GSI due to lines 19-24 in build_gaea_intel.lua
having been removed. Please add these lines back so that the GSI and rrfs_utils can build on the machine. I'll queue up the Gaea tests for this PR once completed (they will run once the rest of the tests complete). Thanks!
Thanks, @natalie-perlin! I've added the Gaea test to the queue. Once the current tests complete, the Gaea test will be resubmitted. |
@natalie-perlin - The Orion automated WE2E coverage test,
This failure is occurring in the Jenkins tests and manually running the WE2E coverage tests on Orion. Please see |
Manual runs of the WE2E coverage tests have successfully passed on Hera Intel:
|
@MichaelLueken - |
@natalie-perlin - Thanks! I've queued up the Orion tests in Jenkins. The Gaea tests continue to fail for the
You can see the cmake output log - |
I've completed a quick run of the WE2E coverage tests on Orion and they all successfully pass now:
We'll see if the Jenkins tests also succeed. |
@natalie-perlin - The Orion Jenkins tests successfully passed as well! The only issue with this PR now is that the |
@MichaelLueken @natalie-perlin Why are we spending time and resources to fix a problem with GSI on a single platform, given that PR #893 is removing that capability entirely? |
@mkavulich Fair point. Making this PR dependent on PR #893 makes sense (especially since the only failure now is with the GSI/rrfs_utils on Gaea) and relaunching the Gaea test once #893 has been merged. @natalie-perlin No changes are required now. Once PR #893 has been merged, you will likely need to merge develop into your branch, but then I should be able to resubmit the Gaea tests without issue. |
Thanks @MichaelLueken, and sorry for the terse tone in my original message, this was posted before I had finished my coffee 😄 |
@MichaelLueken @mkavulich - |
@natalie-perlin - PR #893 has been successfully merged. As expected, there is now a conflict in Additionally, should lines 16-19 (possibly 20) also be removed from Thanks! |
@MichaelLueken - updated wflow_gaea.lua and merged with the updated develop branch |
@natalie-perlin - Thanks! The Gaea tests have been submitted. Once complete, I will merge this PR. |
The Jenkins WE2E coverage tests on Gaea have successfully passed! Merging now. |
DESCRIPTION OF CHANGES:
Type of change
TESTS CONDUCTED:
UPDATE: Fundamental tests pass on all EPIC-accessible Tier-1 platforms (Hera, Gaea Intel/Gnu, Jet, Orion).
To mitigate data transfer tasks failures, increased walltime request and maxtries=2.
DEPENDENCIES:
This PR follows UFS-WM advance to higher-version software modules, allowing fully coupled runs (S2SWA), ufs-community/ufs-weather-model#1745
DOCUMENTATION:
ISSUE:
This PR follows UFS-WM advance to higher-version software modules, allowing fully coupled runs (S2SWA), ufs-community/ufs-weather-model#1745
CHECKLIST
Subsequent updates to spack-stack expected soon that require more documentation updates
LABELS (optional):
A Code Manager needs to add the following labels to this PR:
CONTRIBUTORS (optional):