Fix safetensors failing tests #27231

LysandreJik · 2023-11-02T09:37:52Z

LysandreJik · 2023-11-02T09:58:20Z

src/transformers/models/prophetnet/modeling_prophetnet.py

@@ -2313,5 +2326,8 @@ def __init__(self, config: ProphetNetConfig):
        super().__init__(config)
        self.decoder = ProphetNetDecoder(config)

+        # This is a link so that tied weights work across classes
+        self.word_embeddings = self.decoder.word_embeddings


As far as I know, this is necessary as safetensors doesn't want to save tied weights and therefore saves a single identifier for all the tied weights.

It becomes an issue when we have a few: in this case we have 4 tied weights, with some being loaded in some models (like encoders), and others being loaded in other models (like decoders).

If the encoder-decoder parent class saves a checkpoint, then it can select to save a single copy of a tensor which is only visible in the encoder; so when loading the decoder, it would discard that weight even though it is the only reference to the tied weights.

LysandreJik · 2023-11-02T10:07:30Z

src/transformers/models/seamless_m4t/modeling_seamless_m4t.py

Changes in this file are incorrect

will be overridden by #27240

HuggingFaceDocBuilderDev · 2023-11-02T10:20:04Z

The documentation is not available anymore as the PR was closed or merged.

ydshieh · 2023-11-02T12:51:58Z

src/transformers/models/seamless_m4t/modeling_seamless_m4t.py

@@ -3715,6 +3722,7 @@ def __init__(self, config):
        self.lm_head = nn.Linear(config.hidden_size, config.vocab_size, bias=False)

        # Initialize weights and apply final processing
+        self.shared = self.lm_head


I can't see this is used within SeamlessM4TForSpeechToSpeech

(unlike the above change in SeamlessM4TForSpeechToText)

I'll remove this in favor of #27240

ydshieh · 2023-11-02T12:57:17Z

tests/models/kosmos2/test_modeling_kosmos2.py

@@ -304,6 +304,25 @@ def test_forward_signature(self):
            expected_arg_names = ["pixel_values"]
            self.assertListEqual(arg_names[:1], expected_arg_names)

+    def test_load_save_without_tied_weights(self):


Kosmos-2 is the only model that requires this test overridden from the common tests: maybe I am doing something wrong when adding it.

Is this to be temporary here. I can take a look for this one later after PR being merged.

It's because Kosmos has a config.text_config rather than just a config

ydshieh · 2023-11-02T12:59:45Z

The tests all passed, and overall looks good despite I am not familiar with this part.

I am a bit worried tests like test_save_load_keys_to_ignore_on_save will now only test with safetensors and never torch bin format anymore. If this is the case, I believe we need to test both cases, but that could be done in a follow up PR.

LysandreJik · 2023-11-02T13:29:32Z

Thanks for the review!

amyeroberts

Thanks for digging into this and fixing!

From the explanation I think I understand the issue with loading weights for prophetnet - happy if tests pass for these models.

+1 on @ydshieh comment on still keeping regression tests that check saving & loading with torch bin format.

tests/test_modeling_common.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

HuggingFaceDocBuilderDev · 2023-11-02T14:29:52Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

* Fix Kosmos2 * Fix ProphetNet * Fix MarianMT * Fix M4T * XLM ProphetNet * ProphetNet fix * XLM ProphetNet * Final M4T fixes * Tied weights keys * Revert M4T changes * Apply suggestions from code review Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

LysandreJik added 4 commits November 2, 2023 09:20

Fix Kosmos2

50d4d4b

Fix ProphetNet

36f71bc

Fix MarianMT

069f553

Fix M4T

248b1a2

LysandreJik commented Nov 2, 2023

View reviewed changes

XLM ProphetNet

7e7e187

LysandreJik commented Nov 2, 2023

View reviewed changes

LysandreJik and others added 4 commits November 2, 2023 11:59

ProphetNet fix

3542542

XLM ProphetNet

cf1534d

Final M4T fixes

2db9ccd

Tied weights keys

ea7a845

LysandreJik marked this pull request as ready for review November 2, 2023 11:24

ydshieh reviewed Nov 2, 2023

View reviewed changes

Revert M4T changes

babcbe6

LysandreJik requested a review from amyeroberts November 2, 2023 13:45

amyeroberts approved these changes Nov 2, 2023

View reviewed changes

tests/test_modeling_common.py Outdated Show resolved Hide resolved

Apply suggestions from code review

bdfc143

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

LysandreJik merged commit 443bf5e into main Nov 2, 2023

LysandreJik deleted the fix-safetensors-failing-tests branch November 2, 2023 14:03

LysandreJik mentioned this pull request Nov 2, 2023

Bin format tests #27242

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix safetensors failing tests #27231

Fix safetensors failing tests #27231

LysandreJik commented Nov 2, 2023 •

edited

Loading

LysandreJik Nov 2, 2023

LysandreJik Nov 2, 2023

LysandreJik Nov 2, 2023

LysandreJik Nov 2, 2023

HuggingFaceDocBuilderDev commented Nov 2, 2023 •

edited

Loading

ydshieh Nov 2, 2023

LysandreJik Nov 2, 2023

ydshieh Nov 2, 2023

LysandreJik Nov 2, 2023

ydshieh commented Nov 2, 2023

LysandreJik commented Nov 2, 2023

amyeroberts left a comment

HuggingFaceDocBuilderDev commented Nov 2, 2023

Fix safetensors failing tests #27231

Fix safetensors failing tests #27231

Conversation

LysandreJik commented Nov 2, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Nov 2, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ydshieh commented Nov 2, 2023

LysandreJik commented Nov 2, 2023

amyeroberts left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Nov 2, 2023

LysandreJik commented Nov 2, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Nov 2, 2023 •

edited

Loading