-
Notifications
You must be signed in to change notification settings - Fork 122
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Ubuntu 24.04 with libmpich-dev fails test_loadsave2 #304
Comments
Thanks for the report, pinging @tim-griesbach to check if there may be any unintialized value problem in loading. |
Since I do not have access to a machine running with Ubuntu (24.04) I can not reproduce the error locally. Nonetheless, I checked test_loadsave2 locally using valgrind but the program is valgrind clean on my machine. Hence, I tried to investigate the problem using the CI and I found out that the md5sum of Given my current observations the issue seems to be caused by a strange MPI behavior but I am not sure about the cause of MPICH's behavior. Due to the issue described in cburstedde/libsc#191, I can not use valgrind in the CI in combination with Ubuntu 24.04. |
What happens if we remove the gcc version numbers from the CI and use whatever is the default for ubuntu-22/24/latest? |
It seems to be fine again. In fact, test_loadsave is the one test that trips MPI I/O issues reliably. I have had this fail transiently quite a lot of times in the past for many years. Still pinging @tim-griesbach for double-checking that this does not have to do with the recent merge on saving a p4est in a more standard conforming way wrt. libc I/O. |
Yes, I double-checked the recent changes in saving a p4est. The changes do not change the md5sum of the created file and I also compared the file positions used for writing and reading and they also do not change with the more standard conforming code. Therefore, the two issues causing the failing test (cf. my report above) are not caused by the recent code changes. |
Closing as not-a-bug. |
The "test_loadsave2" test passes on Ubuntu 24.04 with OpenMPI, and on other OS (Ubuntu 22.04, macOS, etc) regardless of OpenMPI or MPICH.
However, discovered in #303 and confirmed on a laptop with Ubuntu 24.04 is that "test_loadsave2" fails with MPICH and GCC-12, GCC-13, or GCC-14
Zlib and MPICH were enabled/used.
Wondering if this is just a flaky test or a code update is needed?
The text was updated successfully, but these errors were encountered: