Qwen2VL: skip base `input_ids`-`inputs_embeds` equivalence check #34535

gante · 2024-10-31T14:36:56Z

What does this PR do?

Computing inputs_embeds in Qwen2VL is more complex than simply embedding input_ids (see Qwen2VLForConditionalGeneration), so the basic check doesn't apply :)

Fixes:

py.test tests/models/qwen2_vl/test_modeling_qwen2_vl.py::Qwen2VLModelTest::test_generate_from_inputs_embeds_1_beam_search --flake-finder --flake-runs 100

gante · 2024-10-31T14:39:09Z

tests/generation/test_utils.py

@@ -1610,7 +1610,7 @@ def test_generate_from_inputs_embeds(self, _, num_beams):
                inputs_dict.pop("pixel_values_images", None)
            #   2.C - No easy fix, let's skip the check that compares the outputs from `input_ids` and `inputs_embeds`
            has_complex_embeds_computation = any(
-                model_name in model_class.__name__.lower() for model_name in ["moshi"]
+                model_name in model_class.__name__.lower() for model_name in ["moshi", "qwen2vl"]


This flag skips the check that passing inputs_embeds = model.get_input_embeddings()(input_ids) should be equivalent to passing input_ids to generate.

This is not true in Qwen2VL, whose computation of inputs_embeds is more complex

OK for me. Just curious: would this be addressed in the future to make it work

Not unless we create a model function to embed inputs, similar to def get_input_embeddings(self).

(I think we should do it eventually, we're starting to have a few models with non-standard inputs embeddings creation!)

got it :-) 👍

HuggingFaceDocBuilderDev · 2024-10-31T15:04:29Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

…gingface#34535) it has complex inputs_embeds computation

it has complex inputs_embeds computation

294c706

gante requested a review from ydshieh October 31, 2024 14:37

gante commented Oct 31, 2024

View reviewed changes

ydshieh approved these changes Oct 31, 2024

View reviewed changes

gante merged commit 4ca004e into huggingface:main Oct 31, 2024
26 checks passed

gante deleted the qwen2vl_flaky_inputs_embeds branch October 31, 2024 15:42

BernardZach pushed a commit to BernardZach/transformers that referenced this pull request Dec 5, 2024

Qwen2VL: skip base input_ids-inputs_embeds equivalence check (hug…

1e7f0bd

…gingface#34535) it has complex inputs_embeds computation

eustlb mentioned this pull request Dec 19, 2024

tokenizer decode decode with timestamp fails for extended vocabulary #35330

Open

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Qwen2VL: skip base `input_ids`-`inputs_embeds` equivalence check #34535

Qwen2VL: skip base `input_ids`-`inputs_embeds` equivalence check #34535

gante commented Oct 31, 2024 •

edited

Loading

gante Oct 31, 2024 •

edited

Loading

ydshieh Oct 31, 2024

gante Oct 31, 2024

ydshieh Oct 31, 2024

HuggingFaceDocBuilderDev commented Oct 31, 2024

Qwen2VL: skip base input_ids-inputs_embeds equivalence check #34535

Qwen2VL: skip base input_ids-inputs_embeds equivalence check #34535

Conversation

gante commented Oct 31, 2024 • edited Loading

What does this PR do?

gante Oct 31, 2024 • edited Loading

Choose a reason for hiding this comment

ydshieh Oct 31, 2024

Choose a reason for hiding this comment

gante Oct 31, 2024

Choose a reason for hiding this comment

ydshieh Oct 31, 2024

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Oct 31, 2024

Qwen2VL: skip base `input_ids`-`inputs_embeds` equivalence check #34535

Qwen2VL: skip base `input_ids`-`inputs_embeds` equivalence check #34535

gante commented Oct 31, 2024 •

edited

Loading

gante Oct 31, 2024 •

edited

Loading