Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update Orion site config after Rocky9 transition (for develop) #1174

Merged

Conversation

climbfuji
Copy link
Collaborator

Summary

All in the title. This is nearly the same as #1171, but for develop instead of release/1.7.0.

Testing

Built the unified environment on Orion with Intel.

Applications affected

n/a

Systems affected

Orion

Dependencies

n/a

Issue(s) addressed

Working towards #981

Checklist

  • This PR addresses one issue/problem/enhancement, or has a very good reason for not doing so.
  • These changes have been tested on the affected systems and applications.
  • All dependency PRs/issues have been resolved and this PR can be merged.

@climbfuji climbfuji marked this pull request as ready for review July 1, 2024 16:30
@climbfuji climbfuji self-assigned this Jul 1, 2024
@climbfuji
Copy link
Collaborator Author

Tagging @ulmononian for EPIC, since @RatkoVasic-NOAA is on leave

@AlexanderRichert-NOAA
Copy link
Collaborator

I'm still not sure I understand mixing our own gcc 12 with the system one. Why not either take out the module load or use the executables from the module?

@climbfuji
Copy link
Collaborator Author

I'm still not sure I understand mixing our own gcc 12 with the system one. Why not either take out the module load or use the executables from the module?

I am only using what @RatkoVasic-NOAA prepared. Ratko built gcc@12.2.0 in the role-epic space, and openmpi@4.1.6 with that compiler.

The instructions for Orion are:

   module purge
   module use /work/noaa/epic/role-epic/spack-stack/orion/modulefiles

(and module load ecflow/5.8.4, but we should really remove all of these since they aren't needed).

If you follow that, you get:

------------------------------------------------------------------------------------------ /work/noaa/epic/role-epic/spack-stack/orion/modulefiles -------------------------------------------------------------------------------------------
   ecflow/5.8.4    gcc/12.2.0 (D)    miniconda/3.9.7    mvapich2/2.3.7    mysql/8.0.31    openmpi/4.1.6 (D)

The problem was that the sysadmin's gcc installation (with modulefile with the same name ...) was broken when Ratko started.

@AlexanderRichert-NOAA
Copy link
Collaborator

Does openmpi need to be more recent than the 4.1.4 available through the system, or can we just use gcc and openmpi from the system modules?

@climbfuji
Copy link
Collaborator Author

Same story, openmpi was not functional (just like gcc) when Ratko built spack-stack. Not sure if the sysadmins fixed all of these by now.

@climbfuji
Copy link
Collaborator Author

@AlexanderRichert-NOAA Feel free to take this PR and switch the compiler and OpenMPI; one issue though is that the 1.7.0 spack-stack installs now use the gcc/12.2.0 modulefile that Ratko created; I am not sure if things break if we remove/rename that file (which we will have to in order to still be able to access the ecflow modulefile but use the gcc/12.2.0 module from the sysadmins; UNLESS we do a module use --append on Orion - but that would make this the only system, not a good idea either).

Either way, I don't think I have a lot of time to spend on Orion this and next week before I go on leave for three weeks.

@AlexanderRichert-NOAA
Copy link
Collaborator

I'm testing out system gcc and openmpi with 1.5.1, I'll see how things look and judge from there.

@ulmononian
Copy link
Collaborator

@AlexanderRichert-NOAA @climbfuji from what i understand, the sysadmin gcc/openmpi was not functional for various release versions of the stack, so ratko built them in the role epic space. using these, he was able to successfully install 1.7.0 and had moved onto 1.6.0. im not sure if anything has been fixed on the sysadmin side for gcc/openmpi. for develop it may be ok to change compiler/mpi, but should make sure to keep consistency for the release versions and what was installed.

@AlexanderRichert-NOAA
Copy link
Collaborator

See climbfuji#8

@climbfuji
Copy link
Collaborator Author

@climbfuji climbfuji enabled auto-merge (squash) July 3, 2024 23:01
@climbfuji climbfuji merged commit 4bd652d into JCSDA:develop Jul 3, 2024
7 checks passed
@climbfuji climbfuji deleted the feature/orion_rocky9_spackstack_dev branch July 3, 2024 23:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
No open projects
Development

Successfully merging this pull request may close these issues.

3 participants