[`generate`] fix breaking change for patch #29976

ArthurZucker · 2024-04-01T08:14:18Z

What does this PR do?

A bug was introduced by #29467 pretty much unrelated to cache positions.
This fixes #29968

cc @gante and @zucchini-nlp. The testing suite is missing this particular test for all generation strategies

HuggingFaceDocBuilderDev · 2024-04-01T08:43:42Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

gante

Thank you for the fix 👍

gante · 2024-04-01T11:47:21Z

tests/generation/test_utils.py

+            input_embeds = model.get_input_embeddings()(input_ids)
+            beam_kwargs.update({"inputs_embeds": input_embeds})
+            output_generate2 = self._beam_sample_generate(
+                model=model,
+                input_ids=None,
+                attention_mask=attention_mask,
+                max_length=max_length,
+                beam_kwargs=beam_kwargs,
+                logits_warper_kwargs=logits_warper_kwargs,
+            )
+
+            torch.testing.assert_close(output_generate[:, input_embeds.shape[1] :], output_generate2)


This can't be tested in the mixin -- the vast majority of the models don't support passing inputs_embeds to generate, they need would some changes in prepare_inputs_for_generate

Alright I'll check the signature

ArthurZucker · 2024-04-02T07:51:49Z

Failing test is unrelated

* fix bug and add tests * nit * otherway to get the cur len instead of attention mask * more places where this might have been broken * nit * oups * inputs_embeds vs input_embeds * test generated outptus * style * nit * fix * skip failing biogpt

ArthurZucker added 2 commits April 1, 2024 10:12

fix bug and add tests

5a38dcf

nit

2eab875

ArthurZucker added 2 commits April 1, 2024 11:10

otherway to get the cur len instead of attention mask

c6248dd

more places where this might have been broken

dc4e768

gante approved these changes Apr 1, 2024

View reviewed changes

nit

bb34689

ArthurZucker marked this pull request as ready for review April 1, 2024 09:51

ArthurZucker added 4 commits April 1, 2024 12:02

oups

aafd697

inputs_embeds vs input_embeds

11f12fc

test generated outptus

62b8dfb

style

c1bd53e

gante reviewed Apr 1, 2024

View reviewed changes

ArthurZucker added 3 commits April 1, 2024 16:51

nit

bb72f89

fix

bc36242

skip failing biogpt

4567319

ArthurZucker merged commit 83b26dd into main Apr 2, 2024
19 of 21 checks passed

ArthurZucker deleted the fix-regression-generate branch April 2, 2024 07:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[`generate`] fix breaking change for patch #29976

[`generate`] fix breaking change for patch #29976

ArthurZucker commented Apr 1, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Apr 1, 2024

gante left a comment

gante Apr 1, 2024

ArthurZucker Apr 1, 2024

ArthurZucker commented Apr 2, 2024

[generate] fix breaking change for patch #29976

[generate] fix breaking change for patch #29976

Conversation

ArthurZucker commented Apr 1, 2024 • edited Loading

What does this PR do?

HuggingFaceDocBuilderDev commented Apr 1, 2024

gante left a comment

Choose a reason for hiding this comment

gante Apr 1, 2024

Choose a reason for hiding this comment

ArthurZucker Apr 1, 2024

Choose a reason for hiding this comment

ArthurZucker commented Apr 2, 2024

[`generate`] fix breaking change for patch #29976

[`generate`] fix breaking change for patch #29976

ArthurZucker commented Apr 1, 2024 •

edited

Loading