Support ONNX export for causal LM sequence classifiers #27450

dwyatte · 2023-11-11T15:25:29Z

What does this PR do?

Partial fix for huggingface/optimum#1527 in optimum when exporting causal LMs with sequence classification support to ONNX

ONNX's argmax operator does not support int64, but that should not be needed here since these are just boolean tensors

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

@ArthurZucker and @younesbelkada (CC @fxmarty)

fxmarty

Thank you for the fix @dwyatte, LGTM! Feel free to open a PR in optimum as well to re-enable the export of those models for text-classification.

ArthurZucker

Nice followup to #24979. Not really sure why I did not use int at the time, but I ran the slow tests and this seems to be alright! Thanks 🤗

ArthurZucker · 2023-11-14T08:54:36Z

The failing test might just need a rebase to main otherwise I'll skip it on main and work on a fix

amyeroberts

Thanks for fixing!

dwyatte · 2023-11-14T15:29:55Z

The failing test might just need a rebase to main otherwise I'll skip it on main and work on a fix

@ArthurZucker I rebased in 8076250 but looks like something is still up with CI. Perhaps different tests get selected based on files changed or between PRs/main

torch_tests from this PR (14627 tests selected): https://app.circleci.com/pipelines/github/huggingface/transformers/78297/workflows/e34f774a-5910-488e-942d-7121d1007bf1/jobs/998415
torch_tests on main 78f6ed6 (7619 tests selected): https://app.circleci.com/pipelines/github/huggingface/transformers/78269/workflows/3e9cb27d-42b0-4b00-bf5f-432eec81c4c0/jobs/997963

amyeroberts · 2023-11-14T20:29:08Z

@dwyatte Yes, the test fetcher selects a subset of the tests to run based on the files that are touched. In this case, the failing tests (I believe) are unreleated to your PR. The tests involving safetensors have had a patch pushed on main. Could you rebase on main to include these in the test runners?

dwyatte · 2023-11-14T21:19:24Z

@amyeroberts I think there are some other problems on main still. Here's what's failing in the tests for this PR after the rebases in 8076250 and d8ab2c9

FAILED tests/models/switch_transformers/test_modeling_switch_transformers.py::SwitchTransformersModelTest::test_assisted_decoding_sample - RuntimeError: The size of tensor a (3) must match the size of tensor b (4) at non-singleton dimension 3
FAILED tests/models/t5/test_modeling_t5.py::T5ModelTest::test_assisted_decoding_sample - RuntimeError: The size of tensor a (3) must match the size of tensor b (4) at non-singleton dimension 3
FAILED tests/models/speech_to_text/test_modeling_speech_to_text.py::Speech2TextModelTest::test_tf_from_pt_safetensors - AssertionError: False is not true

VsonicV · 2023-11-15T02:02:21Z

@dwyatte Exactly same unrelated CI failures in #27351 . In addition to the failures related to safetensors (you already mentioned above), we also need to resolve the other CI failures caused by test_assisted_decoding_sample in tests_torch.

ArthurZucker · 2023-11-15T07:05:36Z

Sorry both for the delays, I'll skip these 3 tests as well. cc @gante I'll look into the test_assisted_decoding_sample.

VsonicV · 2023-11-15T07:52:45Z

Hi, @ArthurZucker , regarding the failures caused by test_assisted_decoding_sample, there have already been some discussions in #26892 , these failures did not only happen for switch_transformers and t5, but also happened for blenderbot , pegasus and umt5 in my previous CI checks. It seems to be a more general issue that may happen to more than these listed models. Thanks for the help in fixing them!

ArthurZucker · 2023-11-15T09:22:44Z

just merged #27508 which should skip it for all models

dwyatte · 2023-11-15T14:55:41Z

Thanks @ArthurZucker that took care of the remaining failures. This is ready to merge

…7450) support onnx for causal lm sequence classification

fxmarty approved these changes Nov 14, 2023

View reviewed changes

fxmarty requested review from ArthurZucker and amyeroberts November 14, 2023 08:43

ArthurZucker approved these changes Nov 14, 2023

View reviewed changes

amyeroberts approved these changes Nov 14, 2023

View reviewed changes

VsonicV mentioned this pull request Nov 14, 2023

add attention_mask and position_ids in assisted model #26892

Merged

dwyatte force-pushed the causal_classification_onnx branch from 14c5099 to 8076250 Compare November 14, 2023 14:57

dwyatte force-pushed the causal_classification_onnx branch from 8076250 to d8ab2c9 Compare November 14, 2023 21:06

support onnx for causal lm sequence classification

fef0c8b

dwyatte force-pushed the causal_classification_onnx branch from d8ab2c9 to fef0c8b Compare November 15, 2023 14:34

fxmarty merged commit 1394e08 into huggingface:main Nov 16, 2023
3 checks passed

EduardoPach pushed a commit to EduardoPach/transformers that referenced this pull request Nov 19, 2023

Support ONNX export for causal LM sequence classifiers (huggingface#2…

7463f37

…7450) support onnx for causal lm sequence classification

dwyatte mentioned this pull request Dec 19, 2023

Fix ONNX export for causal LM sequence classifiers by removing reverse indexing #28144

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support ONNX export for causal LM sequence classifiers #27450

Support ONNX export for causal LM sequence classifiers #27450

dwyatte commented Nov 11, 2023

fxmarty left a comment

ArthurZucker left a comment

ArthurZucker commented Nov 14, 2023

amyeroberts left a comment

dwyatte commented Nov 14, 2023 •

edited

Loading

amyeroberts commented Nov 14, 2023

dwyatte commented Nov 14, 2023 •

edited

Loading

VsonicV commented Nov 15, 2023

ArthurZucker commented Nov 15, 2023

VsonicV commented Nov 15, 2023

ArthurZucker commented Nov 15, 2023 •

edited

Loading

dwyatte commented Nov 15, 2023

Support ONNX export for causal LM sequence classifiers #27450

Support ONNX export for causal LM sequence classifiers #27450

Conversation

dwyatte commented Nov 11, 2023

What does this PR do?

Before submitting

Who can review?

fxmarty left a comment

Choose a reason for hiding this comment

ArthurZucker left a comment

Choose a reason for hiding this comment

ArthurZucker commented Nov 14, 2023

amyeroberts left a comment

Choose a reason for hiding this comment

dwyatte commented Nov 14, 2023 • edited Loading

amyeroberts commented Nov 14, 2023

dwyatte commented Nov 14, 2023 • edited Loading

VsonicV commented Nov 15, 2023

ArthurZucker commented Nov 15, 2023

VsonicV commented Nov 15, 2023

ArthurZucker commented Nov 15, 2023 • edited Loading

dwyatte commented Nov 15, 2023

dwyatte commented Nov 14, 2023 •

edited

Loading

dwyatte commented Nov 14, 2023 •

edited

Loading

ArthurZucker commented Nov 15, 2023 •

edited

Loading