AttributeError: Can't get attribute 'SiLUActivation' on <module 'transformers.activations' #28177

Lokesh-Jatangi · 2023-12-21T09:37:40Z

System Info

System info -

transformers version: 4.36.2
Platform: Linux-5.10.0-26-cloud-amd64-x86_64-with-glibc2.31
Python version: 3.10.13
Huggingface_hub version: 0.20.1
Safetensors version: 0.4.0
Accelerate version: 0.24.1
Accelerate config: not found
PyTorch version (GPU?): 2.1.1+cu118 (True)
Tensorflow version (GPU?): not installed (NA)
Flax version (CPU?/GPU?/TPU?): not installed (NA)
Jax version: not installed
JaxLib version: not installed
Using GPU in script?: Yes
Using distributed or parallel set-up in script?: No

I am using a custom script which loads LLAMA checkpoint through torch.
model_orig = torch.load(checkpoint_path)

While unpickling checkpoints in torch "SiLUActivation" class is missing from activations.py.
This PR #27136 removed the SiLUActivation class mentioning it was reduntant.

P.S :- With transformers version 4.35.0 , loading a checkpoint through torch containing SiLU activation layer was succesful.

Find the below trace :-

line 65, in load_model_from_checkpoint model_orig = torch.load(checkpoint_path) File "/opt/conda/envs/adapt/lib/python3.10/site-packages/torch/serialization.py", line 1014, in load return _load(opened_zipfile, File "/opt/conda/envs/adapt/lib/python3.10/site-packages/torch/serialization.py", line 1422, in _load result = unpickler.load() File "/opt/conda/envs/adapt/lib/python3.10/site-packages/torch/serialization.py", line 1415, in find_class return super().find_class(mod_name, name) AttributeError: Can't get attribute 'SiLUActivation' on <module 'transformers.activations' from '/opt/conda/envs/adapt/lib/python3.10/site-packages/transformers/activations.py'>

I would happy to add it the SiLU class back to activations.py file and submit it here. Please let me know if i can proceed .

Who can help?

@amyeroberts

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction

Any model which has SILU activation function and loaded through "torch.load()" will face this issue.

Expected behavior

After adding reverting back the changes , the torch should be able identify SiLU activation class.

The text was updated successfully, but these errors were encountered:

amyeroberts · 2023-12-21T10:40:26Z

Hi @Lokesh-Jatangi, thanks for raising this issue!

Is there a reason you're using torch.load here? The officially supported way to load checkpoints is through the from_pretrained method.

Lokesh-Jatangi · 2023-12-27T06:04:30Z

The checkpoint stores a pruned model whose structure and weights are different and hence I couldnot use from_pretrained method.

coderchem · 2024-01-04T13:36:07Z

@Lokesh-Jatangi Have you solved your problem?

amyeroberts · 2024-01-12T19:14:54Z

@Lokesh-Jatangi We can't guarantee backwards compatibility for a checkpoint which isn't a transformers architecture and isn't loaded through the officially supported API. In order to be able to maintain the repo, there will be objects which we'll move, rename and delete and so pickling in this way may cause issues.

I'd suggest loading the model on the most recent compatible version of transformers. Updating the model to use torch's silu activation implementation and then resave the model out. I think this should resolve the issue and allow you to load in the model in more recent transformers versions again.

33answer33 · 2024-04-29T02:20:17Z

@Lokesh-Jatangi Have you solved your problem?

I add this to "site-packages/transformers/activations.py" ,it works
import torch.nn.functional as F
class SiLUActivation(nn.Module):
def forward(self, input: Tensor) -> Tensor:
return F.silu(input)

amyeroberts mentioned this issue Jan 15, 2024

SiLU activation wrapper for safe importing #28509

Merged

5 tasks

amyeroberts closed this as completed in #28509 Jan 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AttributeError: Can't get attribute 'SiLUActivation' on <module 'transformers.activations' #28177

AttributeError: Can't get attribute 'SiLUActivation' on <module 'transformers.activations' #28177

Lokesh-Jatangi commented Dec 21, 2023 •

edited

Loading

amyeroberts commented Dec 21, 2023

Lokesh-Jatangi commented Dec 27, 2023

coderchem commented Jan 4, 2024

amyeroberts commented Jan 12, 2024

33answer33 commented Apr 29, 2024

AttributeError: Can't get attribute 'SiLUActivation' on <module 'transformers.activations' #28177

AttributeError: Can't get attribute 'SiLUActivation' on <module 'transformers.activations' #28177

Comments

Lokesh-Jatangi commented Dec 21, 2023 • edited Loading

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

amyeroberts commented Dec 21, 2023

Lokesh-Jatangi commented Dec 27, 2023

coderchem commented Jan 4, 2024

amyeroberts commented Jan 12, 2024

33answer33 commented Apr 29, 2024

Lokesh-Jatangi commented Dec 21, 2023 •

edited

Loading