Tpetra: cmake logic for detecting GPU-aware MPI only works for OpenMPI variants #12468

jhux2 · 2023-11-01T15:45:48Z

Bug Report

The Tpetra Cmake logic for setting Tpetra_ASSUME_GPU_AWARE_MPI assumes the existence of ompi_info. The latter is specific to OpenMPI.

Description

Frontier uses an MPICH variant. Tpetra incorrectly sets Tpetra_ASSUME_GPU_AWARE_MPI to false (if the option isn't explicitly set on the commmand line).

The text was updated successfully, but these errors were encountered:

jhux2 · 2023-11-01T16:02:26Z

Are there currently GPU architectures where the MPI is not GPU aware?

Should the default be to assume GPU-aware MPI?

csiefer2 · 2023-11-01T16:33:48Z

As discussed offline: Assume GPU-aware by default is reasonable. Consider removing the ompi specific special sauce.

jhux2 · 2023-11-01T17:28:23Z

I assume we'd need to deprecate this?

csiefer2 · 2023-11-02T20:10:39Z

@jhux2 I mean, this isn't something you can control, so you can't really deprecate it.

jhux2 · 2023-11-13T18:11:53Z

Will the merge of #12517, Tpetra now defaults to assuming that MPI is GPU aware.

If an application is using an MPI that isn't GPU aware, the app should either configure Trilinos with

-DTpetra_ASSUME_GPU_AWARE_MPI:BOOL=FALSE

or at run time set the environment variable

export TPETRA_ASSUME_GPU_AWARE_MPI=0.

rppawlo · 2023-11-13T18:42:05Z

Just a heads up. This change caused a lot of testing failures inside sandia. The internal cuda test machines don't seem to have cuda aware mpi installed. The failures are seg faults with no real info, so it is not easy to debug. You might want to send out an email to the trilinos lists mentioning this change.

csiefer2 · 2024-06-07T15:06:44Z

@jhux2 Can we close this?

jhux2 added type: bug The primary issue is a bug in Trilinos code or tests pkg: Tpetra labels Nov 1, 2023

jhux2 mentioned this issue Nov 10, 2023

Tpetra: assume gpu-aware mpi #12517

Merged

jhux2 mentioned this issue Nov 13, 2023

Zoltan2, Belos, Ifpack2, (others?) w/ CUDA: Segfault when running CTests #12522

Closed

jhux2 closed this as completed Jun 7, 2024

jhux2 added this to Tpetra Aug 12, 2024

jhux2 moved this to Done in Tpetra Aug 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tpetra: cmake logic for detecting GPU-aware MPI only works for OpenMPI variants #12468

Tpetra: cmake logic for detecting GPU-aware MPI only works for OpenMPI variants #12468

jhux2 commented Nov 1, 2023

jhux2 commented Nov 1, 2023

csiefer2 commented Nov 1, 2023

jhux2 commented Nov 1, 2023

csiefer2 commented Nov 2, 2023

jhux2 commented Nov 13, 2023

rppawlo commented Nov 13, 2023

csiefer2 commented Jun 7, 2024

Tpetra: cmake logic for detecting GPU-aware MPI only works for OpenMPI variants #12468

Tpetra: cmake logic for detecting GPU-aware MPI only works for OpenMPI variants #12468

Comments

jhux2 commented Nov 1, 2023

Bug Report

Description

jhux2 commented Nov 1, 2023

csiefer2 commented Nov 1, 2023

jhux2 commented Nov 1, 2023

csiefer2 commented Nov 2, 2023

jhux2 commented Nov 13, 2023

rppawlo commented Nov 13, 2023

csiefer2 commented Jun 7, 2024