onnxruntime: add with_cuda option #22557

fdgStilla · 2024-01-26T14:02:42Z

I started from this PR which is currently stale: #20392 and added the following changes:

onnxruntime_DISABLE_CONTRIB_OPS => CONTRIB_OPS seems mandatory to build CUDA provider so leave it as before
replace option name use_cuda by with_cuda
test with_cuda option in the test_package
Move the cuda provider dll to the bin dir. Some details: onnxruntime creates some "modules" that are supposed to loaded at runtime, and they are considered as "LIBRARY" and not "RUNTIME" by cmake during the installation: https://cmake.org/cmake/help/book/mastering-cmake/chapter/Install.html#installing-targets.
Note that according to onnxruntime documentation the main dll shall be next the to the modules: https://onnxruntime.ai/docs/build/eps.html#loading-the-shared-providers so for this specific recipe we shall move the dll in the bin dir.

Changes not directly related to cuda:

require onnx:disable_static_registration=True to fix [package] onnxruntime/1.14.1: This is an invalid model #21790 (and also seemed in this PR: Fix issue with building shared on windows #18391)
Add some transitive headers by grepping all include in the public include directory of onnxruntime

This was tested on Linux gcc11 and Windows msvc192 with and without the option.
Assuming you have a cuda package from the pre built libs provided by NVIDIA, this is the additional changes I had to do:

        # in both requirements and build_requirements
        if self.options.with_cuda:
            self.requires("cuda/11.4.2", transitive_headers=True)

        # in generate
        if self.options.with_cuda:
            tc.variables["onnxruntime_CUDA_HOME"] = self.dependencies["cuda"].package_folder.replace("\\", "/")
            tc.variables["onnxruntime_CUDNN_HOME"] = self.dependencies["cuda"].package_folder.replace("\\", "/")
            # cuda 11.4 does not support c++20
            tc.variables["CMAKE_CUDA_STANDARD"] = "17"

        # in self.cpp_info.requires
            "cuda::cuda"

I've read the contributing guidelines.
I've used a recent Conan client version close to the currently deployed.
I've tried at least one configuration locally with the conan-center hook activated.

Artalus · 2024-01-29T09:55:21Z

👋
We have forked the ORT recipe internally to support CUDA as well, so I will dare to leave a few comments on the topic from my experience 🙂

Your 1.16.1-0002-cuda-gsl.patch and 1.16.0-0002-cuda-gsl.patch are identical, so the .0 one can be applied in .1 part of conandata.yml as well. Also, I think the patch is valid for 1.16.2 and 1.16.3 too - it got proposed to upstream in fix reference to Microsoft.GSL::GSL in CMake build scripts when enabling cuda microsoft/onnxruntime#17843 and merged, but the change is not affecting the 1.16 "branch".
I added tc.variables["onnxruntime_USE_FLASH_ATTENTION"] = False in our fork, as by default (with it True) ORT will require the cutlass library, which is not in Conan.
In 1.16.3 (did not test on .1 and .2) we had a bunch of weird compilation errors in CUDA part of ORT sources:

/.conan/data/onnxruntime/1.16.3-kudan1/KdConanRecipes/pr-273/build/a790a3bbbda052f5d698683048ad7c0c4c286823/src/onnxruntime/contrib_ops/cuda/bert/packed_multihead_attention.cc:300:27: error: use 'template' keyword to treat 'GetScratchBuffer' as a dependent template name
  auto work_space = this->GetScratchBuffer<void>(workSpaceSize, context->GetComputeStream());
                          ^
                          template

On 1.15.1 with CUDA enabled we did not have this error. Might be a compiler mismatch of sorts; we use Clang-11, which was released in 2021. Would be nice if someone could verify this; maybe CCI should have patch for it as well.

For 1.15.1 I had to add this patch to get it building:

--- a/cmake/onnxruntime_providers.cmake
+++ b/cmake/onnxruntime_providers.cmake
@@ -479,7 +479,8 @@ if (onnxruntime_USE_CUDA)
     target_compile_options(onnxruntime_providers_cuda PRIVATE "$<$<COMPILE_LANGUAGE:CUDA>:SHELL:-Xcompiler /wd4127>")
   endif()

-  onnxruntime_add_include_to_target(onnxruntime_providers_cuda onnxruntime_common onnxruntime_framework onnx onnx_proto ${PROTOBUF_LIB} flatbuffers::flatbuffers)
+  onnxruntime_add_include_to_target(onnxruntime_providers_cuda onnxruntime_common onnxruntime_framework onnx_proto)
+  target_link_libraries(onnxruntime_providers_cuda PUBLIC onnx onnx_proto protobuf::protobuf flatbuffers::flatbuffers Eigen3::Eigen3)
   if (onnxruntime_ENABLE_TRAINING_OPS)
     onnxruntime_add_include_to_target(onnxruntime_providers_cuda onnxruntime_training)
     if (onnxruntime_ENABLE_TRAINING)
--- a/onnxruntime/contrib_ops/cuda/bert/attention.cc
+++ b/onnxruntime/contrib_ops/cuda/bert/attention.cc
@@ -164,7 +164,6 @@ Status Attention<T>::ComputeInternal(OpKernelContext* context) const {
                                         has_memory_efficient_attention(sm, sizeof(T) == 2);
 #else
   constexpr bool use_memory_efficient_attention = false;
-  ORT_UNUSED_VARIABLE(is_mask_1d_key_seq_len_start);
 #endif

   cublasHandle_t cublas = GetCublasHandle(context);

On Protobuf+Flatbuffers, I do not remember the exact errors, but I think their onnxruntime_add_include_to_target glitched with Conan targets somehow, and the include propagation was done by target_link_libraries() anyway.
On ORT_UNUSED_VARIABLE, it was a macro that got renamed at some point, and the "undefined name" error only manifests with the above FLASH_ATTENTION=False define - which is not default. This got fixed in 1.16 at https://github.com/microsoft/onnxruntime/pull/15983/files#diff-34ecf46f429b24ac811c0b67f45e8950f44c5ccd58719fd5dd82bbfc3a7d0cb5L167.

From what I understand, to be built with CUDA support, ORT requires the cuDNN library as well. It is rather tricky to get, as you require Nvidia Developer account to download the prebuilt binaries. I am unsure now what the behavior was without it, but I am fairly sure I created a cudnn Conan package locally not from a goodness of my heart. Did you manage to get ORT working without it?
Should the recipe be handling nvcc build requirement somehow? This is what I did in our fork:

    def build_requirements(self):
        # Required by upstream https://github.com/microsoft/onnxruntime/blob/v1.14.1/cmake/CMakeLists.txt#L5
        self.tool_requires("cmake/[>=3.24 <4]")
        self.tool_requires("ninja/1.11.1")

        # CUDA: cannot make CUDA a conan package, have to rely on system pre-configured
        if self.options.with_cuda and self.settings.os == 'Linux':
            if not shutil.which("nvcc"):
                self.output.error(
                    "Need CUDA toolkit! Here is how you can install it:\n"
                    "  wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2004/x86_64/cuda-keyring_1.1-1_all.deb\n"
                    "  dpkg -i cuda-keyring_1.1-1_all.deb\n"
                    "  apt update\n"
                    "  apt install cuda-toolkit-11-8"
                )
                raise RuntimeError("nvcc not found; see logs")
            from subprocess import check_output
            output = check_output(["nvcc", "--version"]).decode()
            if not "release 11" in output:
                self.output.error(f"nvcc found but wrong version (11.8 recommended):\n\n{output}")
                raise RuntimeError("invalid nvcc version; see logs")

Finally, I found it is somewhat easier to make shared depend on with_cuda - the provider library is always built as shared even if you build everything else as static. I also noticed that libonnxruntime_providers_shared.so gets built even for shared=False && with_cuda=False, so I added this to def package():

        if not self.options.shared:
            rm(
                conanfile=self,
                pattern="*onnxruntime_providers_shared*",
                folder=os.path.join(self.package_folder, "lib"),
            )

fdgStilla · 2024-01-29T10:39:15Z

I forgot to mention that I only tested onnxruntime 1.14.1, I guess I should try with this other versions and implement you changes if I get the same errors with my compiler.

To answer some of your questions:

I did not have to use cutlass
I included cuDNN in my custom cuda conan package
I also included nvcc in the bin folder of my custom cuda conan package, which is found when we add this dependency in the tool_requires section.
Indeed the provider libraries are always built as dll as described in their documention:

The onnxruntime code will look for the provider shared libraries in the same location as the onnxruntime shared library is (or the executable statically linked to the static library version).

https://onnxruntime.ai/docs/build/eps.html#loading-the-shared-providers, and the static build + shared providers is a well supported configuration so I think we should not add a limitation.

libonnxruntime_providers_shared.so is indeed always built but we shall check upstream if the cmake shall be modified to remove it from the build if no provider is built.

fdgStilla · 2024-02-07T07:42:28Z

I updated the MR to be able to build 1.15.1 with cuda. However I cannot build the 1.16.x because I have only a cuda 11.4 configuration, which apparently is too old to work with VS2022 (required for 1.16.x).

But maybe this first version is enough and further adjustments can be in another MR (they still can be built without cuda like before)?

… into onnxruntime-with-cuda

recipes/onnxruntime/all/conanfile.py

conan-center-bot · 2024-09-04T11:59:09Z

Conan v1 pipeline ✔️

All green in build 6 (39e32ea14433ea5705d2fabfff0884630f48b954):

onnxruntime/1.16.3:
All packages built successfully! (All logs)
onnxruntime/1.18.1:
All packages built successfully! (All logs)
onnxruntime/1.14.1:
All packages built successfully! (All logs)
onnxruntime/1.17.3:
All packages built successfully! (All logs)
onnxruntime/1.15.1:
All packages built successfully! (All logs)

Conan v2 pipeline ✔️

Note: Conan v2 builds are now mandatory. Please read our discussion about it.

All green in build 6 (39e32ea14433ea5705d2fabfff0884630f48b954):

onnxruntime/1.17.3:
All packages built successfully! (All logs)
onnxruntime/1.18.1:
All packages built successfully! (All logs)
onnxruntime/1.16.3:
All packages built successfully! (All logs)
onnxruntime/1.15.1:
All packages built successfully! (All logs)
onnxruntime/1.14.1:
All packages built successfully! (All logs)

* Add with_cuda option * Require static registration from onnx disabled * Add some transitive headers * Test CUDA in the test_package * Add 1.15.1 patch * wip * add comment * wip * wip * wip * remove unused patch * better patching * remove unused patch * fix * copy dlls via cmake in v1 * check if win --------- Co-authored-by: czoido <mrgalleta@gmail.com>

fdgStilla added 4 commits January 26, 2024 14:30

Add with_cuda option

7c54438

Require static registration from onnx disabled

2ebf89d

Add some transitive headers

f54fe20

Test CUDA in the test_package

08f4f49

prince-chrismc mentioned this pull request Jan 26, 2024

Pull Requests Ready to be Reviewed prince-chrismc/conan-center-index-pending-review#1

Open

This comment has been minimized.

Sign in to view

ghost mentioned this pull request Jan 26, 2024

onnxruntime: More compatibility patches for 1.16 #22563

Closed

3 tasks

AbrilRBS self-assigned this Jan 28, 2024

Add 1.15.1 patch

20e3257

ghost mentioned this pull request Apr 7, 2024

onnxruntime: add 1.17.3 & 1.18.1, remove older versions #23409

Merged

3 tasks

czoido added 2 commits August 27, 2024 11:57

Merge branch 'master' of https://github.com/conan-io/conan-center-index…

25d491a

… into onnxruntime-with-cuda

wip

ef49a3c

czoido mentioned this pull request Aug 29, 2024

Change default disable_static_registration to True for onnx #25078

Merged

czoido added 2 commits August 30, 2024 08:04

add comment

be6f060

wip

ef0c7a2

czoido assigned czoido and unassigned AbrilRBS Sep 2, 2024

czoido added 2 commits September 2, 2024 15:05

wip

bb330db

wip

8d6bd61

czoido reviewed Sep 3, 2024

View reviewed changes

recipes/onnxruntime/all/conanfile.py Show resolved Hide resolved

conan-center-bot added the Failed label Sep 3, 2024

This comment has been minimized.

Sign in to view

czoido added 2 commits September 3, 2024 17:05

remove unused patch

4153eac

better patching

6666283

This comment has been minimized.

Sign in to view

remove unused patch

60eef12

This comment has been minimized.

Sign in to view

fix

1b9788d

This comment has been minimized.

Sign in to view

copy dlls via cmake in v1

fc1959e

This comment has been minimized.

Sign in to view

check if win

39e32ea

conan-center-bot removed the Failed label Sep 4, 2024

czoido approved these changes Sep 4, 2024

View reviewed changes

czoido requested review from AbrilRBS and franramirez688 September 4, 2024 13:51

AbrilRBS approved these changes Sep 4, 2024

View reviewed changes

conan-center-bot merged commit db6b6f8 into conan-io:master Sep 4, 2024
28 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

onnxruntime: add with_cuda option #22557

onnxruntime: add with_cuda option #22557

fdgStilla commented Jan 26, 2024

This comment has been minimized.

Artalus commented Jan 29, 2024

fdgStilla commented Jan 29, 2024 •

edited

Loading

fdgStilla commented Feb 7, 2024

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

conan-center-bot commented Sep 4, 2024

onnxruntime: add with_cuda option #22557

onnxruntime: add with_cuda option #22557

Conversation

fdgStilla commented Jan 26, 2024

This comment has been minimized.

Artalus commented Jan 29, 2024

fdgStilla commented Jan 29, 2024 • edited Loading

fdgStilla commented Feb 7, 2024

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

conan-center-bot commented Sep 4, 2024

Conan v1 pipeline ✔️

Conan v2 pipeline ✔️

fdgStilla commented Jan 29, 2024 •

edited

Loading