Support ONNX export on `torch.float16` type #749

fxmarty · 2023-02-06T13:19:23Z

As per title.

Test still missing. Partly fixes https://discuss.huggingface.co/t/convert-gpt-j-to-fp-16-onnx/30294

JingyaHuang

Just add two nits, btw is the torch onnx export on fp16 stable?

optimum/exporters/onnx/utils.py

fxmarty · 2023-02-07T09:08:04Z

is the torch onnx export on fp16 stable

I've tried on a single model - and did not try to load into an InferenceSession. I will add a test for it, thanks.

HuggingFaceDocBuilderDev · 2023-02-07T09:32:31Z

The documentation is not available anymore as the PR was closed or merged.

optimum/commands/export/onnx.py

optimum/exporters/onnx/__main__.py

optimum/exporters/onnx/convert.py

Co-authored-by: regisss <15324346+regisss@users.noreply.github.com>

…rty/optimum into support-onnx-export-float16

regisss

LGTM!

regisss · 2023-02-23T21:43:12Z

optimum/commands/export/onnx.py

+    optional_group.add_argument(
+        "--fp16",
+        action="store_true",
+        help="Experimental option: use half precision during the export. PyTorch-only, requires `--device cuda`.",


Experimental because it doesn't work with all models?

I'd say experimental because I haven't thouroughly tested it with ONNX Runtime + CUDAExecutionProvider / TensorrtExecutionProvider, and neither with native TensorRT (though in the validation itself we call InferenceSession on CUDA EP, so it's a good sign it's fine). But the export itself is thoroughly tested.

optimum/exporters/onnx/__main__.py

Co-authored-by: regisss <15324346+regisss@users.noreply.github.com>

…rty/optimum into support-onnx-export-float16

support ONNX export on float16

5e1cbd3

fxmarty requested review from michaelbenayoun, mht-sharma and JingyaHuang February 6, 2023 13:19

add test

c13b017

JingyaHuang reviewed Feb 6, 2023

View reviewed changes

optimum/exporters/onnx/utils.py Outdated Show resolved Hide resolved

optimum/exporters/onnx/utils.py Show resolved Hide resolved

Merge branch 'master' into support-onnx-export-float16

2d210a6

michaelbenayoun reviewed Feb 7, 2023

View reviewed changes

optimum/commands/export/onnx.py Outdated Show resolved Hide resolved

regisss reviewed Feb 7, 2023

View reviewed changes

optimum/commands/export/onnx.py Outdated Show resolved Hide resolved

optimum/exporters/onnx/__main__.py Outdated Show resolved Hide resolved

optimum/exporters/onnx/convert.py Outdated Show resolved Hide resolved

optimum/exporters/onnx/convert.py Outdated Show resolved Hide resolved

fxmarty force-pushed the support-onnx-export-float16 branch from 1ff3ed8 to 2d210a6 Compare February 9, 2023 13:41

fxmarty added 2 commits February 9, 2023 14:44

Merge branch 'master' into support-onnx-export-float16

2b209cd

style

9cc8297

echarlaix mentioned this pull request Feb 10, 2023

bug: onnxruntime.capi.onnxruntime_pybind11_state.RuntimeException: [ONNXRuntimeError] : 6 : RUNTIME_EXCEPTION : Non-zero status code returned while running OpenVINO-EP-subgraph_4 node. huggingface/diffusers#1760

Closed

JingyaHuang mentioned this pull request Feb 15, 2023

Decoder with cache ONNX export failed after mixed precision training #719

Open

fxmarty mentioned this pull request Feb 17, 2023

Validating ONNX model fails for GPT-J #607

Closed

4 tasks

fxmarty changed the title ~~Support ONNX export on torch.float16 device~~ Support ONNX export on torch.float16 type Feb 17, 2023

fxmarty and others added 6 commits February 23, 2023 14:47

Merge branch 'master' into support-onnx-export-float16

49ca6f1

merge mess

2b7ec6c

fix test

205cf7f

Update optimum/exporters/onnx/__main__.py

3f73c36

Co-authored-by: regisss <15324346+regisss@users.noreply.github.com>

add thourough tests

f254b48

Merge branch 'support-onnx-export-float16' of https://github.com/fxma…

51ee30b

…rty/optimum into support-onnx-export-float16

fxmarty requested review from JingyaHuang, michaelbenayoun and regisss February 23, 2023 15:58

fix tests

cd70cc7

regisss approved these changes Feb 23, 2023

View reviewed changes

fix tests

95f655f

fxmarty and others added 5 commits February 24, 2023 09:58

Update optimum/exporters/onnx/__main__.py

7cf85b2

Co-authored-by: regisss <15324346+regisss@users.noreply.github.com>

disable failing tests out of our reach

18eba4b

Merge branch 'support-onnx-export-float16' of https://github.com/fxma…

67e29ec

…rty/optimum into support-onnx-export-float16

Merge branch 'master' into support-onnx-export-float16

a7414c4

merge error

756e586

fxmarty merged commit 0b71b46 into huggingface:main Feb 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support ONNX export on `torch.float16` type #749

Support ONNX export on `torch.float16` type #749

fxmarty commented Feb 6, 2023

JingyaHuang left a comment

fxmarty commented Feb 7, 2023

HuggingFaceDocBuilderDev commented Feb 7, 2023 •

edited

Loading

regisss left a comment

regisss Feb 23, 2023

fxmarty Feb 24, 2023

Support ONNX export on torch.float16 type #749

Support ONNX export on torch.float16 type #749

Conversation

fxmarty commented Feb 6, 2023

JingyaHuang left a comment

Choose a reason for hiding this comment

fxmarty commented Feb 7, 2023

HuggingFaceDocBuilderDev commented Feb 7, 2023 • edited Loading

regisss left a comment

Choose a reason for hiding this comment

regisss Feb 23, 2023

Choose a reason for hiding this comment

fxmarty Feb 24, 2023

Choose a reason for hiding this comment

Support ONNX export on `torch.float16` type #749

Support ONNX export on `torch.float16` type #749

HuggingFaceDocBuilderDev commented Feb 7, 2023 •

edited

Loading