Error when loading pretrained OpenLlama model #29506

kevin-guimard-ext · 2024-03-07T09:55:26Z

System Info

transformers version: 4.38.2
Platform: Windows-10-10.0.19044-SP0
Python version: 3.10.13
Huggingface_hub version: 0.21.3
Safetensors version: 0.4.2
Accelerate version: 0.27.2
Accelerate config: not found
PyTorch version (GPU?): 2.2.1+cu118 (True)
Tensorflow version (GPU?): 2.10.1 (True)
Flax version (CPU?/GPU?/TPU?): not installed (NA)
Jax version: not installed
JaxLib version: not installed
Using GPU in script?: yes
Using distributed or parallel set-up in script?: no

Who can help?

@ArthurZucker @younesbel

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction

When running this snippet:

from transformers import OpenLlamaForCausalLM
model = OpenLlamaForCausalLM.from_pretrained("openlm-research/open_llama_7b")

I get the following error:

You are using a model of type llama to instantiate a model of type open-llama. This is not supported for all configurations of models and can yield errors.
...
AttributeError: 'OpenLlamaAttention' object has no attribute 'rope_theta'

Expected behavior

The code snippet has been taken from the documentation (https://huggingface.co/docs/transformers/model_doc/open-llama), it is expected to work.

The text was updated successfully, but these errors were encountered:

ArthurZucker · 2024-03-07T10:17:53Z

I can reproduce. This model is deprecated, feel free to open a PR for a fix, otherwise would recommend using Llama, we no longer maintain OpenLlama

kevin-guimard-ext · 2024-03-07T10:22:54Z

Ok, I'll switch to Llama then. I wanted to try OpenLlama because I heard that Llama required a subscription, but I may be wrong.

ArthurZucker · 2024-03-08T01:26:50Z

No, OpenLlama is basically Llama but with some "optimisations". You should be able to load https://huggingface.co/NousResearch/Llama-2-7b-hf 🤫

kevin-guimard-ext · 2024-03-08T09:00:09Z

It's a pity that the "Run the model" section has not been written for this model, with a snippet explaining how to use it. Must the weights be downloaded from the Meta repo?

ArthurZucker added the Good Second Issue Issues that are more difficult to do than "Good First" issues - give it a try if you want! label Mar 25, 2024

jla524 mentioned this issue Mar 27, 2024

Fix rope theta for OpenLlama #29893

Merged

ArthurZucker closed this as completed in #29893 Mar 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error when loading pretrained OpenLlama model #29506

Error when loading pretrained OpenLlama model #29506

kevin-guimard-ext commented Mar 7, 2024 •

edited

Loading

ArthurZucker commented Mar 7, 2024

kevin-guimard-ext commented Mar 7, 2024

ArthurZucker commented Mar 8, 2024

kevin-guimard-ext commented Mar 8, 2024

Error when loading pretrained OpenLlama model #29506

Error when loading pretrained OpenLlama model #29506

Comments

kevin-guimard-ext commented Mar 7, 2024 • edited Loading

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

ArthurZucker commented Mar 7, 2024

kevin-guimard-ext commented Mar 7, 2024

ArthurZucker commented Mar 8, 2024

kevin-guimard-ext commented Mar 8, 2024

kevin-guimard-ext commented Mar 7, 2024 •

edited

Loading