change bnb tests #34713

jiqing-feng · 2024-11-13T06:32:44Z

BNB for CPU and XPU path do not support autocast lora finetune for now.
XPU do not support gpt2 for now.
Add llama tests

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

Titus-von-Koeller · 2024-11-19T09:49:05Z

Thanks for this great work in conjunction with BNB PR #1418, @jiqing-feng 🔥🤗

I'll do my best to provide feedback on this ASAP so we can iterate. That said, I need to balance it with other high-impact topics like quantization improvements, the custom_ops registration refactor (which underpins merging all this into main on BNB) and general maintenance (e.g., resolving the currently broken CI integration tests). Still, this remains one of our top priorities before the end of the year, and we’re aiming to make maximum progress on this topic and bring these new functionalities live ASAP.

Thanks so much to you and the Intel team ❤️ for your continued valuable work and support on this! It’s highly appreciated, and I’m looking forward to a final sprint to materialize the fruits of our collaboration.

HuggingFaceDocBuilderDev · 2024-11-19T10:17:19Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

Titus-von-Koeller · 2024-12-05T09:22:11Z

Will review this today and tmr.

matthewdouglas · 2024-12-09T21:19:23Z

I'm going to take over for reviewing this. Will work on getting access to hw to run this on.
cc: @SunMarc

matthewdouglas

I was able to try this on an Intel machine with GPU Max 1100 where everything looks good with this PR applied.

Before this PR (on transformers latest release), we see several failures and a crash:

tests/quantization/bnb/test_4bit.py::Pipeline4BitTest::test_pipeline Fatal Python error: Aborted

Thread 0x00007f538b656640 (most recent call first):
  File "/opt/conda/lib/python3.11/threading.py", line 331 in wait
  File "/opt/conda/lib/python3.11/threading.py", line 629 in wait
  File "/opt/conda/lib/python3.11/site-packages/tqdm/_monitor.py", line 60 in run
  File "/opt/conda/lib/python3.11/threading.py", line 1045 in _bootstrap_inner
  File "/opt/conda/lib/python3.11/threading.py", line 1002 in _bootstrap

Current thread 0x00007f5843c5d740 (most recent call first):
  File "/opt/conda/lib/python3.11/site-packages/transformers/models/bloom/modeling_bloom.py", line 68 in build_alibi_tensor
  File "/opt/conda/lib/python3.11/site-packages/transformers/models/bloom/modeling_bloom.py", line 577 in build_alibi_tensor
  File "/opt/conda/lib/python3.11/site-packages/transformers/models/bloom/modeling_bloom.py", line 671 in forward
  File "/opt/conda/lib/python3.11/site-packages/torch/nn/modules/module.py", line 1747 in _call_impl
  File "/opt/conda/lib/python3.11/site-packages/torch/nn/modules/module.py", line 1736 in _wrapped_call_impl
  File "/opt/conda/lib/python3.11/site-packages/transformers/models/bloom/modeling_bloom.py", line 973 in forward
  File "/opt/conda/lib/python3.11/site-packages/accelerate/hooks.py", line 170 in new_forward
  File "/opt/conda/lib/python3.11/site-packages/torch/nn/modules/module.py", line 1747 in _call_impl
  File "/opt/conda/lib/python3.11/site-packages/torch/nn/modules/module.py", line 1736 in _wrapped_call_impl
  File "/opt/conda/lib/python3.11/site-packages/transformers/generation/utils.py", line 3222 in _sample
  File "/opt/conda/lib/python3.11/site-packages/transformers/generation/utils.py", line 2231 in generate
  File "/opt/conda/lib/python3.11/site-packages/torch/utils/_contextlib.py", line 116 in decorate_context
  File "/opt/conda/lib/python3.11/site-packages/transformers/pipelines/text_generation.py", line 370 in _forward
  File "/opt/conda/lib/python3.11/site-packages/transformers/pipelines/base.py", line 1208 in forward
  File "/opt/conda/lib/python3.11/site-packages/transformers/pipelines/base.py", line 1308 in run_single
  File "/opt/conda/lib/python3.11/site-packages/transformers/pipelines/base.py", line 1301 in __call__
  File "/opt/conda/lib/python3.11/site-packages/transformers/pipelines/text_generation.py", line 272 in __call__
  File "/usr/src_host/transformers/tests/quantization/bnb/test_4bit.py", line 513 in test_pipeline```

jiqing-feng · 2024-12-18T01:03:32Z

Hi @matthewdouglas . Thanks for your testing.
When you said before this PR, does it means you observed the failed issue without this PR and the failed issue disappear with this PR?
Do I need any changes before merging?

LysandreJik

Thanks for your PR!

matthewdouglas · 2024-12-18T14:49:46Z

@jiqing-feng That's correct: I observed that with this PR applied the tests run correctly.. No changes needed. Thanks!

jiqing-feng marked this pull request as draft November 13, 2024 06:32

jiqing-feng added 8 commits November 13, 2024 14:21

fix training tests

ef63a0e

Merge branch 'huggingface:main' into bnb_cpu

a8a0880

fix xpu check

d49259d

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

rm pdb

ff795cf

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

fix 4bit logits check

7354e42

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

fix 4bit logits check

fea9e21

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

add xpu check on int8 training

e9bd0f2

fix training tests

0db6996

jiqing-feng marked this pull request as ready for review November 19, 2024 05:22

jiqing-feng changed the title ~~fix training tests~~ change bnb tests Nov 19, 2024

jiqing-feng mentioned this pull request Nov 19, 2024

Enable XPU and optimize cpu/xpu op bitsandbytes-foundation/bitsandbytes#1418

Merged

Titus-von-Koeller self-assigned this Nov 19, 2024

jiqing-feng added 6 commits November 19, 2024 12:35

add llama test on bnb

732ac89

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

Merge branch 'main' into bnb_cpu

8d8902e

only cpu and xpu disable autocast training

c17df54

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

Merge branch 'main' into bnb_cpu

9181fc3

fix format

7b4ffd3

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

Merge branch 'main' into bnb_cpu

3a5419f

Merge branch 'main' into bnb_cpu

645d3a4

matthewdouglas self-requested a review December 9, 2024 21:17

matthewdouglas added the Quantization label Dec 9, 2024

matthewdouglas approved these changes Dec 17, 2024

View reviewed changes

Merge branch 'main' into bnb_cpu

3157bdb

LysandreJik approved these changes Dec 18, 2024

View reviewed changes

matthewdouglas merged commit 69e31eb into huggingface:main Dec 18, 2024
8 checks passed

jiqing-feng deleted the bnb_cpu branch December 19, 2024 02:02

matthewdouglas mentioned this pull request Dec 19, 2024

Fix new BNB test failures #35345

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

change bnb tests #34713

change bnb tests #34713

jiqing-feng commented Nov 13, 2024 •

edited

Loading

Titus-von-Koeller commented Nov 19, 2024

HuggingFaceDocBuilderDev commented Nov 19, 2024

Titus-von-Koeller commented Dec 5, 2024

matthewdouglas commented Dec 9, 2024

matthewdouglas left a comment •

edited

Loading

jiqing-feng commented Dec 18, 2024 •

edited

Loading

LysandreJik left a comment

matthewdouglas commented Dec 18, 2024

change bnb tests #34713

change bnb tests #34713

Conversation

jiqing-feng commented Nov 13, 2024 • edited Loading

Titus-von-Koeller commented Nov 19, 2024

HuggingFaceDocBuilderDev commented Nov 19, 2024

Titus-von-Koeller commented Dec 5, 2024

matthewdouglas commented Dec 9, 2024

matthewdouglas left a comment • edited Loading

Choose a reason for hiding this comment

jiqing-feng commented Dec 18, 2024 • edited Loading

LysandreJik left a comment

Choose a reason for hiding this comment

matthewdouglas commented Dec 18, 2024

jiqing-feng commented Nov 13, 2024 •

edited

Loading

matthewdouglas left a comment •

edited

Loading

jiqing-feng commented Dec 18, 2024 •

edited

Loading