Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update "excl" to "exclhost" for exclusive jobs on WCOSS2 #783

Conversation

KateFriedman-NOAA
Copy link
Member

Description

This PR updates the excl PBS statement to exclhost after cgroups was implemented on WCOSS2. With cgroups implementation, memory limits are now being enforced and exclusive jobs needs to set place=exclhost to use all memory on the node(s). If you don't specify memory (and without exclhost) then you'll default to 1G/core.

The rocoto workflow_utils.py is also updated to use exclhost now for generated xmls on WCOSS2.

Type of change

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds functionality)

How Has This Been Tested?

NCO made and tested this update on WCOSS2 Dogwood after cgroups implementation. @lgannoaa also updated his ecflow test with these updates and confirmed his test runs again on Dogwood.

Refs #399

- The HOMEobsproc setting in config.base.nco.static is not used in
operations and thus not needed in this version of config.base.

Refs: NOAA-EMC#399
- Update on Dogwood implemented cgroups, which means memory limits are
now enforced.
- Exclusive jobs must now use "place=exclhost" insted of "place=excl".
- Associated exclusive ecf script PBS statements are updated to
exclhost.

Refs: NOAA-EMC#399
Need to set exclhost for exclusive jobs on WCOSS2 now after cgroups was
implemented. Matches updates to exclusive job ecf script PBS statements.

Refs: NOAA-EMC#399
Will consider removing at later date.

Refs: NOAA-EMC#399
@KateFriedman-NOAA KateFriedman-NOAA added production update Processing update in production maintenance Regular updates and maintenance work labels May 13, 2022
@KateFriedman-NOAA KateFriedman-NOAA added this to the WCOSS2 - GFSv16.2.0 milestone May 13, 2022
@KateFriedman-NOAA KateFriedman-NOAA self-assigned this May 13, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
maintenance Regular updates and maintenance work production update Processing update in production
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants