Build light weight PyRuntime without llvm or onnx-mlir #3044

chentong319 · 2025-01-15T20:51:53Z

Motivation

Python driver is needed to run the compiled model. Currently, the driver is built with onnx-mlir and can only be run in the env where onnx-mlir is built, typically inside the onnx-mlir docker image. When the compilation can be done by calling the onnx-mlir docker image, we'd like to run the compiled .so with python driver in the local env, so that all the packages installed in the local env can be used, rather than installing them on top of docker.

In order to reach this goal, the PR tried to remove the unnecessary dependencies of pyruntime if the light-weight pyruntime is the target, how onnx-mlir is built and used remains as it was previously.
Details can be found in docs/build-pyruntime-lit.md.
I tried the build on a z16 machine: it takes less than 2 minutes.

Components in this PR

CMakefile and source code changes to cut the dependencies of pyruntime. An option ONNX_MLIR_ENABLE_PYRUNTIME_LIT is used to control Cmake, and consequently a compile definition ENABLE_PYRUNTIME_LIT issued to control the source code.
Wrap the built pyruntime driver into a python package
This python package use python docker package to call the compiler. This is equivalent to docker/onnx-mlir.py in functionality but with different abstraction.

Test
Run successfully with utils/BuildPyRuntimeLit.sh.

Future works:

Support of float16
Not all llvm utilities are replaced. Only the essential ones have been implemented.
Can third_party/onnx be removed?
Try to integrate the precompiled lib for different os-arch into the package. Enable user to use pip install with package name remotely
Try to integrate the utils/build-pyruntime-lit.sh into python package and invoke the build when pip install is executed.

Signed-off-by: Chen Tong <chentong@us.ibm.com>

chentong319 added 5 commits January 15, 2025 12:20

pass test

4ab6256

Signed-off-by: Chen Tong <chentong@us.ibm.com>

package

49034b5

Signed-off-by: Chen Tong <chentong@us.ibm.com>

clean makefile

45b1d9d

Signed-off-by: Chen Tong <chentong@us.ibm.com>

Merge remote-tracking branch 'upstream/main' into pyruntime-lit

0ee214f

document

beff2e3

Signed-off-by: Chen Tong <chentong@us.ibm.com>

chentong319 marked this pull request as draft January 15, 2025 21:00

chentong319 added 3 commits January 15, 2025 16:19

fix MLIR.cmake

28584dd

Signed-off-by: Chen Tong <chentong@us.ibm.com>

fix script

a92ccc2

Signed-off-by: Chen Tong <chentong@us.ibm.com>

Merge remote-tracking branch 'upstream/main' into pyruntime-lit

2037d8d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Build light weight PyRuntime without llvm or onnx-mlir #3044

Build light weight PyRuntime without llvm or onnx-mlir #3044

chentong319 commented Jan 15, 2025 •

edited

Loading

Build light weight PyRuntime without llvm or onnx-mlir #3044

Are you sure you want to change the base?

Build light weight PyRuntime without llvm or onnx-mlir #3044

Conversation

chentong319 commented Jan 15, 2025 • edited Loading

chentong319 commented Jan 15, 2025 •

edited

Loading