Offload mods for crusher development #633

mewall · 2022-06-23T18:48:18Z

o Modified CMake scripts:
- Use BML_OMP_OFFLOAD for both NVIDIA and AMD,
needed due to commit #8a7df493
- Use FindCUDAToolkit module instead of depracated FindCUDA
- Update to CMake 3.17 version, to support FindCUDAToolkit
- Consolidated the logic for CUDA, HIP, and associated libraries
for various types of device builds under control of BML_USE_DEVICE
- Added BML_OFFLOAD_ARCH with options NVIDIA and AMD

o Change crusher build script to use new CMakeLists.txt and build.sh

o Modified offload regions to address bml_multiply_x2()
fortran test failure (hang)
- Move temporary working arrays all_ix, all_jx, and all_x from stack to heap
- This eliminated the hang, although it's not really clear why
- Similar changes made to other add, multiply offload regions

jeanlucf22

Does any other build script need to be adapted?

CMakeLists.txt

o Modified CMake scripts: - Use BML_OMP_OFFLOAD for both NVIDIA and AMD, needed due to commit #8a7df493 - Use FindCUDAToolkit module instead of depracated FindCUDA - Update to CMake 3.17 version, to support FindCUDAToolkit - Consolidated the logic for CUDA, HIP, and associated libraries for various types of device builds under control of BML_USE_DEVICE - Added BML_OFFLOAD_ARCH with options NVIDIA and AMD o Change crusher and spock build scripts accordingly o Modified offload regions to address bml_multiply_x2() fortran test failure (hang) - Move temporary working arrays all_ix, all_jx, and all_x from stack to heap - This eliminated the hang, although it's not really clear why - Similar changes made to other add, multiply offload regions

CMakeLists.txt

mewall · 2022-06-27T22:19:35Z

The FindCUDAToolkit docs say it was new in version 3.17 https://cmake.org/cmake/help/latest/module/FindCUDAToolkit.html

________________________________ From: Nicolas Bock ***@***.***> Sent: Saturday, June 25, 2022 12:30:02 PM To: lanl/bml Cc: Wall, Michael E; Author Subject: [EXTERNAL] Re: [lanl/bml] Offload mods for crusher development (PR #633) @nicolasbock requested changes on this pull request.

________________________________ In CMakeLists.txt<https://urldefense.com/v3/__https://github.com/lanl/bml/pull/633*discussion_r906711640__;Iw!!Bt8fGhp8LhKGRg!CqQXHiO_SjhVAuzlovwcVIwHeXa6KK61aZ4mD6SsIL4-WVlUv4vDA6SlramrBJOFOjQewjdOphVJCw1aifrDVAL6$>:

@@ -1,4 +1,4 @@

-cmake_minimum_required(VERSION 3.10) +cmake_minimum_required(VERSION 3.17) Why do we need to raise the minimum version here? — Reply to this email directly, view it on GitHub<https://urldefense.com/v3/__https://github.com/lanl/bml/pull/633*pullrequestreview-1019342770__;Iw!!Bt8fGhp8LhKGRg!CqQXHiO_SjhVAuzlovwcVIwHeXa6KK61aZ4mD6SsIL4-WVlUv4vDA6SlramrBJOFOjQewjdOphVJCw1aiUfEOaqV$>, or unsubscribe<https://urldefense.com/v3/__https://github.com/notifications/unsubscribe-auth/AA67VEOEUUQIIMXD6DIPT63VQ5F2VANCNFSM5ZVKQINQ__;!!Bt8fGhp8LhKGRg!CqQXHiO_SjhVAuzlovwcVIwHeXa6KK61aZ4mD6SsIL4-WVlUv4vDA6SlramrBJOFOjQewjdOphVJCw1aicfIg2YG$>. You are receiving this because you authored the thread.Message ID: ***@***.***>

nicolasbock · 2022-06-28T05:41:59Z

Thanks @mewall !

mewall requested review from nicolasbock, cnegre, suemni, jeanlucf22, tokshgithub and jmohdyusof as code owners June 23, 2022 18:48

jeanlucf22 reviewed Jun 23, 2022

View reviewed changes

CMakeLists.txt Show resolved Hide resolved

CMakeLists.txt Show resolved Hide resolved

CMakeLists.txt Show resolved Hide resolved

mewall force-pushed the offload_mods branch from d644a95 to 084af4b Compare June 23, 2022 19:07

suemni approved these changes Jun 23, 2022

View reviewed changes

mewall force-pushed the offload_mods branch 5 times, most recently from dd9b44d to 7572bec Compare June 23, 2022 20:43

jeanlucf22 approved these changes Jun 23, 2022

View reviewed changes

mewall force-pushed the offload_mods branch from 7572bec to 498a969 Compare June 24, 2022 14:09

nicolasbock requested changes Jun 25, 2022

View reviewed changes

CMakeLists.txt Show resolved Hide resolved

nicolasbock approved these changes Jun 28, 2022

View reviewed changes

nicolasbock merged commit b4d863a into master Jun 28, 2022

nicolasbock deleted the offload_mods branch June 28, 2022 05:42

mewall mentioned this pull request Jul 27, 2022

FindCUDA is deprecated in cmake #546

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Offload mods for crusher development #633

Offload mods for crusher development #633

mewall commented Jun 23, 2022

jeanlucf22 left a comment

mewall commented Jun 27, 2022 via email

nicolasbock commented Jun 28, 2022

Offload mods for crusher development #633

Offload mods for crusher development #633

Conversation

mewall commented Jun 23, 2022

jeanlucf22 left a comment

Choose a reason for hiding this comment

mewall commented Jun 27, 2022 via email

nicolasbock commented Jun 28, 2022