Paligemma: fix generation with Gemma2 #36044

zucchini-nlp · 2025-02-05T10:29:54Z

What does this PR do?

Fixes #36029 and adds tests for the model, imo we need tests with different LM backbone because Gemma-2 is special

This is a quick fix but I think we should make this kind of fix on LM work out-of-the-box, by adding it as kwargs for example. Most LMs accept loss_kwargs thus we can make all multimodal models also accept kwargs that are simply passed further to the LM. WDYT?

HuggingFaceDocBuilderDev · 2025-02-05T11:18:25Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

src/transformers/models/paligemma/modeling_paligemma.py

ArthurZucker

we can just use kwargs no?

zucchini-nlp · 2025-02-06T10:34:12Z

I think making it explicit that kwargs will be used by an only an LM was better

Cyrilvallez

Fine with me! Thanks a lot!

ArthurZucker · 2025-02-06T13:26:04Z

tests/models/paligemma2/test_modeling_paligemma2.py

let's say that an integration test is most welcome as well!

yeah, was quite low in priority for the patch so I decided to skip it for now :)

ArthurZucker · 2025-02-06T13:36:28Z

For transparency, this commit needs to be modified for the patch, only applying changes for PaliGemnma2

* fix paligemma * nit * use `kwargs` in models that can load any LM * update changes to only affect Paligenma

* fix paligemma * nit * use `kwargs` in models that can load any LM

fix paligemma

79b06ed

zucchini-nlp requested review from Cyrilvallez and ArthurZucker February 5, 2025 10:29

zucchini-nlp added 2 commits February 5, 2025 11:43

Merge branch 'main' into paligemma-fix-kwargs

5e8756d

nit

b151af4

zucchini-nlp added the for patch Tag issues / labels that should be included in the next patch label Feb 5, 2025

molbap reviewed Feb 5, 2025

View reviewed changes

src/transformers/models/paligemma/modeling_paligemma.py Outdated Show resolved Hide resolved

use kwargs in models that can load any LM

9b2cfe4

ArthurZucker reviewed Feb 6, 2025

View reviewed changes

Cyrilvallez approved these changes Feb 6, 2025

View reviewed changes

ArthurZucker approved these changes Feb 6, 2025

View reviewed changes

ArthurZucker merged commit 3dd1de3 into huggingface:main Feb 6, 2025
16 checks passed

ArthurZucker pushed a commit that referenced this pull request Feb 6, 2025

Paligemma: fix generation with Gemma2 (#36044)

093bebc

* fix paligemma * nit * use `kwargs` in models that can load any LM * update changes to only affect Paligenma

MekkCyber pushed a commit that referenced this pull request Feb 7, 2025

Paligemma: fix generation with Gemma2 (#36044)

987e09e

* fix paligemma * nit * use `kwargs` in models that can load any LM

elvircrn pushed a commit to elvircrn/transformers that referenced this pull request Feb 13, 2025

Paligemma: fix generation with Gemma2 (huggingface#36044)

2400d76

* fix paligemma * nit * use `kwargs` in models that can load any LM

sbucaille pushed a commit to sbucaille/transformers that referenced this pull request Feb 16, 2025

Paligemma: fix generation with Gemma2 (huggingface#36044)

bf35296

* fix paligemma * nit * use `kwargs` in models that can load any LM

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Paligemma: fix generation with Gemma2 #36044

Paligemma: fix generation with Gemma2 #36044

zucchini-nlp commented Feb 5, 2025

HuggingFaceDocBuilderDev commented Feb 5, 2025

ArthurZucker left a comment

zucchini-nlp commented Feb 6, 2025

Cyrilvallez left a comment

ArthurZucker Feb 6, 2025

zucchini-nlp Feb 6, 2025

ArthurZucker commented Feb 6, 2025

Paligemma: fix generation with Gemma2 #36044

Paligemma: fix generation with Gemma2 #36044

Conversation

zucchini-nlp commented Feb 5, 2025

What does this PR do?

HuggingFaceDocBuilderDev commented Feb 5, 2025

ArthurZucker left a comment

Choose a reason for hiding this comment

zucchini-nlp commented Feb 6, 2025

Cyrilvallez left a comment

Choose a reason for hiding this comment

ArthurZucker Feb 6, 2025

Choose a reason for hiding this comment

zucchini-nlp Feb 6, 2025

Choose a reason for hiding this comment

ArthurZucker commented Feb 6, 2025