ImportError using Llama 2 with BetterTransformer #1481

jpgard · 2023-10-24T01:20:42Z

System Info

I'm attempting to fine-tune a Llama2 model. My training loop works fine without BetterTransformers, but when I attempt to import the module it looks like there is an ImportError: ImportError: cannot import name '_expand_mask' from 'transformers.models.llama.modeling_llama' .

I already installed Transformers from git as described in this issue.

Version info:

transformers @ git+https://github.com/huggingface/transformers@32f799db0d625ec5cf82624ff2604c5a891ebf61
optimum==1.13.2
Python 3.8.16

Full stack trace is below. Any suggestions? Thanks!

Traceback (most recent call last):
  File "scripts/train.py", line 235, in <module>
    main(model_arguments=model_args,
  File "scripts/train.py", line 81, in main
    from optimum.bettertransformer import BetterTransformer
  File "/path/to/miniconda3/envs/myenv/lib/python3.8/site-packages/optimum/bettertransformer/__init__.py", line 14, in <module>
    from .models import BetterTransformerManager
  File "/path/to/miniconda3/envs/myenv/lib/python3.8/site-packages/optimum/bettertransformer/models/__init__.py", line 17, in <module>
    from .decoder_models import (
  File "/path/to/miniconda3/envs/myenv/lib/python3.8/site-packages/optimum/bettertransformer/models/decoder_models.py", line 53, in <module>
    from .attention import (
  File "/path/to/miniconda3/envs/myenv/lib/python3.8/site-packages/optimum/bettertransformer/models/attention.py", line 24, in <module>
    from transformers.models.llama.modeling_llama import _expand_mask as _llama_expand_mask
ImportError: cannot import name '_expand_mask' from 'transformers.models.llama.modeling_llama' (/path/to/miniconda3/envs/myenv/lib/python3.8/site-packages/transformers/models/llama/modeling_llama.py)
Traceback (most recent call last):
  File "scripts/train.py", line 235, in <module>
    main(model_arguments=model_args,
  File "scripts/train.py", line 81, in main
    from optimum.bettertransformer import BetterTransformer
  File "/path/to/miniconda3/envs/myenv/lib/python3.8/site-packages/optimum/bettertransformer/__init__.py", line 14, in <module>
    from .models import BetterTransformerManager
  File "/path/to/miniconda3/envs/myenv/lib/python3.8/site-packages/optimum/bettertransformer/models/__init__.py", line 17, in <module>
    from .decoder_models import (
  File "/path/to/miniconda3/envs/myenv/lib/python3.8/site-packages/optimum/bettertransformer/models/decoder_models.py", line 53, in <module>
    from .attention import (
  File "/path/to/miniconda3/envs/myenv/lib/python3.8/site-packages/optimum/bettertransformer/models/attention.py", line 24, in <module>
    from transformers.models.llama.modeling_llama import _expand_mask as _llama_expand_mask
ImportError: cannot import name '_expand_mask' from 'transformers.models.llama.modeling_llama' (/path/to/miniconda3/envs/myenv/lib/python3.8/site-packages/transformers/models/llama/modeling_llama.py)

This is probably a question for @ArthurZucker based on his answer to the post linked above!

Who can help?

@ArthurZucker

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction

Installing the versions above and importing BetterTransformers should reproduce the issue but I haven't tried it.

Expected behavior

I expect to import BetterTransformer without errors.

The text was updated successfully, but these errors were encountered:

ArthurZucker · 2023-10-24T09:55:44Z

cc @fxmarty this function is private and was removed in huggingface/transformers#26792!
Let's transfer this to optimum library

jpgard · 2023-10-24T17:02:34Z

@ArthurZucker thanks for the reply (and for all your and the HF team's work on the code!!). Does this mean the solution is to reinstall from git? (not sure how long PRs take to percolate into the version pip installs from git, or if it is instant)

patrickvonplaten · 2023-10-25T14:40:51Z

Worst case we can also leave and deprecate it in transformers if it's too breaking. Otherwise we force users to install optimum from "main" no? Happy to open a PR in transformers to leave it

patrickvonplaten · 2023-10-25T22:21:25Z

We'll deprecate it in Transformers: huggingface/transformers#27074 (comment)

fxmarty · 2023-10-26T08:15:44Z

It is private, I'll fix.

patrickvonplaten · 2023-10-26T11:07:06Z

For convenience sake, I deprecated the functions now in huggingface/transformers#27074 (comment)

fxmarty · 2023-12-13T16:16:01Z

Please use directly transformers>=4.36 with torch>=2.1.1 to benefit from PyTorch SDPA optimizations by default. BetterTransformer for Llama is deprecated: https://huggingface.co/docs/transformers/perf_infer_gpu_one#flashattention-and-memory-efficient-attention-through-pytorchs-scaleddotproductattention

ArthurZucker transferred this issue from huggingface/transformers Oct 24, 2023

daniilrusov mentioned this issue Oct 26, 2023

Team 5 final task aitalents/model-compression-2023#3

Open

fxmarty closed this as completed Dec 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ImportError using Llama 2 with BetterTransformer #1481

ImportError using Llama 2 with BetterTransformer #1481

jpgard commented Oct 24, 2023

ArthurZucker commented Oct 24, 2023

jpgard commented Oct 24, 2023

patrickvonplaten commented Oct 25, 2023 •

edited

Loading

patrickvonplaten commented Oct 25, 2023

fxmarty commented Oct 26, 2023

patrickvonplaten commented Oct 26, 2023

fxmarty commented Dec 13, 2023

ImportError using Llama 2 with BetterTransformer #1481

ImportError using Llama 2 with BetterTransformer #1481

Comments

jpgard commented Oct 24, 2023

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

ArthurZucker commented Oct 24, 2023

jpgard commented Oct 24, 2023

patrickvonplaten commented Oct 25, 2023 • edited Loading

patrickvonplaten commented Oct 25, 2023

fxmarty commented Oct 26, 2023

patrickvonplaten commented Oct 26, 2023

fxmarty commented Dec 13, 2023

patrickvonplaten commented Oct 25, 2023 •

edited

Loading