-
Notifications
You must be signed in to change notification settings - Fork 121
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[develop] Integrate Unified Workflow templater tool #793
[develop] Integrate Unified Workflow templater tool #793
Conversation
All files need YAML suffixes. Call to templater CLI needs full path.
It appears that the functional test is failing because data was removed from AWS. This date only goes out to 17 z files now. Thoughts on how to "fix" this failure? Switch to a different date that does have data? |
@chan-hoo @EdwardSnyder-NOAA Have I addressed your comments sufficiently, or are there still outstanding issues I can work to resolve? |
@christinaholtNOAA Similar to what was done for the task-based |
@christinaholtNOAA, I think you missed my comment above:
Traceback (most recent call last):
|
Yes, you have. Approving on my end. |
@chan-hoo I pushed a change this morning to hopefully address the failure you were seeing. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@christinaholtNOAA These changes look good to me! I was able to clone your branch on Tier-1 platforms (except for Gaea, as the SRW App won't compile or run following the system upgrade) and run the WE2E fundamental tests.
I did note that loading the regional_workflow
conda environment and attempting to run the fundamental tests would cause some of the tests to fail. Loading the workflow_tools
conda environment, all tests seem to pass successfully. I would recommend that the modulefiles/wflow_*.lua files be updated to replace regional_workflow
with workflow_tools
(similar to the change made in ush/load_modules_wflow.sh
).
Please note that the grid_RRFS_CONUS_25km_ics_NAM_lbcs_NAM_suite_GFS_v16
fundamental test is failing on Cheyenne GNU. Running the test on the current HEAD of develop shows that the test is failing there as well, so I won't hold this PR up for this reason (issue #817 has been opened to document this failure).
@MichaelLueken I will try to get to that change shortly. Thanks for the suggestion. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@christinaholtNOAA In addition to changing the regional_workflow
to workflow_tools
in the wflow_*.lua
modulefiles, please change modulefiles/tasks/jet/run_vx.local.lua
to use:
load("python_srw")
The grid_RRFS_CONUS_25km_ics_NAM_lbcs_NAM_suite_GFS_v16
fundamental test is failing on Jet because the necessary conda environment isn't available (the test is failing because it is unable to find f90nml).
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@christinaholtNOAA, a test run for AQM was completed successfully on wcoss2.
@christinaholtNOAA Given @chan-hoo's approval, I will move forward running the Jenkins tests on this work. The modifications to the |
@christinaholtNOAA The automated Jenkins tests encountered failures: Cheyenne GNU - the Gaea - SRW won't build following the latest maintenance on the machine. Will be turning off Gaea testing in #799 until the UFS-WM has merged the new Gaea modulefile changes. Hera Intel - the I saw that the there were two failures in the Hera GNU test - On Jet, the
I will resubmit the Jet tests and hopefully this test will pass. I will also run the Jenkins tests manually on Orion, since the machine requires git/2.28.0 to be loaded, but since git/2.28.0 isn't available on Hercules and Hercules and Orion are the same machine, the git/2.28.0 module load is constantly removed from the role-epic-ps account's .bashrc file. I will let you know if there are any failures. |
@christinaholtNOAA The manual testing of the Jenkins coverage tests on Orion successfully passed. However, the same test that failed on Jet,
The test can be found - |
@MichaelLueken I saw that the test that was failing on Jet yesterday now seems to be passing. I also saw that running that manual test on Jet passed for me yesterday. Is there any update on the status of the other failures? It seems we may want to look more into random failures for the post if we keep seeing those creep in. |
@christinaholtNOAA The failures are on Gaea (the SRW App won't compile following the maintenance on the machine that changed the available libraries on the machine), Orion (the necessary git/2.28.0 module isn't being loaded in the role-epic-ps account, causing the Jenkins pipeline to fail to clone the CCPP), and an issue on Hera GNU (the |
@christinaholtNOAA The Hera GNU tests successfully completed during the rerun, so I will now move forward with merging this PR. Additionally, it is important to note that the |
DESCRIPTION OF CHANGES:
This is the first time a Unified Workflow tool is being integrated into a UFS App. We are starting with the templater tool. It's a direct drop-in replacement for fill_jinja_template.py.
This contribution includes the following additional changes:
Type of change
TESTS CONDUCTED:
DEPENDENCIES:
PR 237 in workflow-tools: ufs-community/uwtools#237 (This has been merged and the hash updated in Externals.cfg)
For the linter to pass, we'll also need PR #788 to be merged in.
DOCUMENTATION:
A small update was made.
ISSUE:
CHECKLIST
LABELS (optional):
A Code Manager needs to add the following labels to this PR:
CONTRIBUTORS (optional):
@fgabelmannjr @venitahagerty @WeirAE @elcarpenter @j-derrico