Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG]: gnn_fraud_detection_pipeline example broken around cuml #795

Closed
2 tasks done
pdmack opened this issue Mar 27, 2023 · 2 comments
Closed
2 tasks done

[BUG]: gnn_fraud_detection_pipeline example broken around cuml #795

pdmack opened this issue Mar 27, 2023 · 2 comments
Assignees
Labels
bug Something isn't working Needs Triage Need team to review and classify

Comments

@pdmack
Copy link
Contributor

pdmack commented Mar 27, 2023

Version

23.03

Which installation method(s) does this occur on?

Conda, Source

Describe the bug.

Relates to #758

RAPIDS has provided workaround guidance here on using cuML 22.10:
https://docs.rapids.ai/notices/rsn0026/

Basically add libcusparse<12 libcusolver<=11.4.1.48 to the solve.

Minimum reproducible example

Follow the instructions here:
https://github.com/nv-morpheus/Morpheus/tree/branch-23.03/examples/gnn_fraud_detection_pipeline#readme

Relevant log output

Traceback (most recent call last):
  File "/opt/miniconda3/envs/morpheus/lib/python3.8/site-packages/cupy/__init__.py", line 16, in <module>
    from cupy import _core  # NOQA
  File "/opt/miniconda3/envs/morpheus/lib/python3.8/site-packages/cupy/_core/__init__.py", line 1, in <module>
    from cupy._core import core  # NOQA
  File "cupy/_core/core.pyx", line 1, in init cupy._core.core
  File "/opt/miniconda3/envs/morpheus/lib/python3.8/site-packages/cupy/cuda/__init__.py", line 8, in <module>
    from cupy.cuda import compiler  # NOQA
  File "/opt/miniconda3/envs/morpheus/lib/python3.8/site-packages/cupy/cuda/compiler.py", line 11, in <module>
    from cupy.cuda import device
  File "cupy/cuda/device.pyx", line 9, in init cupy.cuda.device
ImportError: libnvJitLink.so.12: cannot open shared object file: No such file or directory

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
  File "run.py", line 23, in <module>
    from morpheus.pipeline.linear_pipeline import LinearPipeline
  File "/home/pmackinnon/github/morpheus/morpheus/pipeline/__init__.py", line 27, in <module>
    from morpheus.pipeline.multi_message_stage import MultiMessageStage
  File "/home/pmackinnon/github/morpheus/morpheus/pipeline/multi_message_stage.py", line 22, in <module>
    from morpheus.messages import MultiMessage
  File "/home/pmackinnon/github/morpheus/morpheus/messages/__init__.py", line 21, in <module>
    from morpheus.messages.memory.tensor_memory import TensorMemory
  File "/home/pmackinnon/github/morpheus/morpheus/messages/memory/tensor_memory.py", line 19, in <module>
    import cupy as cp
  File "/opt/miniconda3/envs/morpheus/lib/python3.8/site-packages/cupy/__init__.py", line 37, in <module>
    raise ImportError(_msg) from e
ImportError: CuPy is not correctly installed.

If you are using wheel distribution (cupy-cudaXX), make sure that the version of CuPy you installed matches with the version of CUDA on your host.
Also, confirm that only one CuPy package is installed:
  $ pip freeze

If you are building CuPy from source, please check your environment, uninstall CuPy and reinstall it with:
  $ pip install cupy --no-cache-dir -vvvv

Check the Installation Guide for details:
  https://docs.cupy.dev/en/latest/install.html

original error: libnvJitLink.so.12: cannot open shared object file: No such file or directory

Full env printout

No response

Other/Misc.

cuml 22.08.00 cuda11_py38_g1e2f8a9aa_0 rapidsai
cupy 9.5.0 py38h7818112_1 conda-forge
libcuml 22.08.00 cuda11_g1e2f8a9aa_0 rapidsai
libcumlprims 22.08.00 cuda11_g1770e60_0 nvidia

Code of Conduct

  • I agree to follow Morpheus' Code of Conduct
  • I have searched the open bugs and have found no duplicates for this bug report
@pdmack pdmack added bug Something isn't working Needs Triage Need team to review and classify Priority 0 labels Mar 27, 2023
@dagardner-nv
Copy link
Contributor

Related to issues: #764 and #741

@dagardner-nv dagardner-nv self-assigned this Apr 5, 2023
@dagardner-nv
Copy link
Contributor

Fixed by PR #758

@github-project-automation github-project-automation bot moved this from Todo to Done in Morpheus Boards Apr 5, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working Needs Triage Need team to review and classify
Projects
Status: Done
Development

No branches or pull requests

2 participants