-
Notifications
You must be signed in to change notification settings - Fork 19
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Offload mods for crusher development
o Modified CMake scripts: - Use BML_OMP_OFFLOAD for both NVIDIA and AMD, needed due to commit #8a7df493 - Use FindCUDAToolkit module instead of depracated FindCUDA - Update to CMake 3.17 version, to support FindCUDAToolkit - Consolidated the logic for CUDA, HIP, and associated libraries for various types of device builds under control of BML_USE_DEVICE - Added BML_OFFLOAD_ARCH with options NVIDIA and AMD o Change crusher and spock build scripts accordingly o Modified offload regions to address bml_multiply_x2() fortran test failure (hang) - Move temporary working arrays all_ix, all_jx, and all_x from stack to heap - This eliminated the hang, although it's not really clear why - Similar changes made to other add, multiply offload regions
- Loading branch information
Showing
7 changed files
with
118 additions
and
107 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,7 +1,7 @@ | ||
#!/bin/bash | ||
|
||
module load craype-accel-amd-gfx908 | ||
module load rocm/4.1.0 | ||
module load rocm | ||
module load cmake | ||
export LD_LIBRARY_PATH="$CRAY_LD_LIBRARY_PATH:$LD_LIBRARY_PATH" | ||
module load openblas | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters