-
Notifications
You must be signed in to change notification settings - Fork 19
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Unit tests failing on NOAA Acorn #3048
Comments
@AlexanderRichert-NOAA Please let us know which compiler, MPI stack, and version of ESMF you are using. A quick investigation shows that the code is failing an an ALLOCATE statement. Probably a zero-sized allocation as case12 is intentionally using a coarse grid that results in 0 DEs on some PETs. Often case12 will fail because the test environment does not support 216 PETs, but the error would be different and this would not explain the issue with case 24. |
@AlexanderRichert-NOAA Yeah, as @tclune says, case12 is one we don't regularly run because, as Tom says, it uses 216 processes. As for Case 24, I have seen that have issues with Now, I do see you might be building with Intel 19:
if so...I'm not sure MAPL has been built with that in a looooong time by us. I am honestly impressed more tests didn't fail if that was the ifort version. |
@tclune I'm using Intel Classic 19.1.3.304 (with Cray wrappers), Cray MPICH 8.1.9, and ESMF 8.6.1. If I can work through some issues of it not finding mpirun/mpiexec I can try with another compiler version... |
Forgot to put this here, but for testing doing either Now case24 will still be part of this, but at least big momma case12 will be avoided 😄 ETA: Note that |
Well, I was able to build Baselibs with Intel 19 as well as build MAPL2 with it (which surprised me!) However, building might work, but it was NOT happy with our unit tests:
I'm not sure that compiler and the associated MPI stack like our system or network anymore. Indeed, for me it looks like Case 24 does run, but then goes nuts at Finalize:
I looked with @bena-nasa and we do call |
This issue has been automatically marked as stale because it has not had activity in the last 60 days. If there are no updates within 7 days, it will be closed. You can add the ":hourglass: Long Term" label to prevent the stale action from closing this issue. |
I'm trying to run MAPL unit tests via Spack installation on Acorn (WCOSS2 TDS). I'm happy to provide whatever details are helpful; for now I'll upload the CTest log. It's failing on tests 12 and 24, for both the 1g and 2g cases. I've tried it with 2.46.2, 2.47.2, and head of develop (e600653).
mapl_acorn_LastTest.log
The text was updated successfully, but these errors were encountered: