-
Notifications
You must be signed in to change notification settings - Fork 382
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
For pm-cpu/pm-gpu, update some module versions to current machine defaults #5533
Conversation
Does not impact compiler.
I tested against baselines on pm-cpu with e3sm_integration. And have been running larger cases with these modules versions in a scream repo (to test pm-gpu). Example of some of the changes:
|
There are some TPUT failures when I tried to run again vs baselines, but it looks fine to me -- just comparing two very fast cases. Would I need to bless TPUT fails? Or increase the TPU tolerance for the machine? |
…5533) Minor version increases for several modules on pm-cpu/pm-gpu. Does not impact compiler versions. Motivation is to keep up-to-date with machine defaults. Do not see any measurable performance changes. cray-mpich/8.1.22 -> cray-mpich/8.1.24 cray-hdf5-parallel/1.12.2.1 -> cray-hdf5-parallel/1.12.2.3 cray-netcdf-hdf5parallel/4.9.0.1 -> cray-netcdf-hdf5parallel/4.9.0.3 cray-parallel-netcdf/1.12.3.1 -> cray-parallel-netcdf/1.12.3.3 cmake/3.22.0 -> cmake/3.24.3 Added specific version numbers for craype and cray-libsci to reduce surprises when the default version is changed (these were already using default) Also added a couple of modules to remove just in case they are loaded. Updating alvarez the same way, but it may be that the machine goes away. Fixes #5525 [bfb]
Merged to next. The tests runs are way too short to actually see any performance differences between before/after. In my testing, some tests report larger than 10% TPUT, but those cases seem fine and I have not seen any issues with larger/longer cases. Actually, I forgot that NERSC current has an env variable set ( Looks like all the GNU tests passed on cdash. Manually checking nvidia developer -- completed as expected. |
Minor version increases for several modules.
Does not impact compiler versions.
Motivation is to keep up-to-date with machine defaults.
Do not see any measurable performance changes.
Added specific version numbers for
craype
andcray-libsci
to reduce surprises when the default version is changed (these were already using default)Also added a couple of modules to remove just in case they are loaded.
Updating alvarez the same way, but it may be that the machine goes away.
Fixes #5525
[bfb]