Fix `torch.fx` symbolic tracing for LLama #30047

michaelbenayoun · 2024-04-04T14:07:36Z

What does this PR do?

As per title.

HuggingFaceDocBuilderDev · 2024-04-04T14:29:22Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

ArthurZucker

Thanks for fixing!
Should work! Let's fix other models as well

ArthurZucker · 2024-04-05T09:57:58Z

src/transformers/models/llama/modeling_llama.py

@@ -987,7 +987,9 @@ def forward(
        if position_ids is None:
            position_ids = cache_position.unsqueeze(0)

-        causal_mask = self._update_causal_mask(attention_mask, inputs_embeds, cache_position)
+        causal_mask = self._update_causal_mask(
+            attention_mask, inputs_embeds, cache_position, past_seen_tokens + inputs_embeds.shape[1]


I guess passing cache_position[-1] doesn't work? The only issue is that past_seen_tokens s gonna be deprecated in favor of cache positions, but that works since with static cache we use the max positions embeddings

ArthurZucker

Last thing, can you run the slow tests RUN_SLOW=1 pytest tests/models/llama on cuda?

ArthurZucker

====================================== short test summary info =======================================
SKIPPED [1] ../miniconda3/envs/py39/lib/python3.9/unittest/case.py:117: TODO @gante fix this for Llama
SKIPPED [1] tests/test_pipeline_mixin.py:327: LlamaModelTest::test_pipeline_audio_classification is skipped: `audio-classification` is not in `self.pipeline_model_mapping` for `LlamaModelTest`.
SKIPPED [1] tests/test_pipeline_mixin.py:331: LlamaModelTest::test_pipeline_automatic_speech_recognition is skipped: `automatic-speech-recognition` is not in `self.pipeline_model_mapping` for `LlamaModelTest`.
SKIPPED [1] tests/test_pipeline_mixin.py:335: LlamaModelTest::test_pipeline_conversational is skipped: `conversational` is not in `self.pipeline_model_mapping` for `LlamaModelTest`.
SKIPPED [1] tests/test_pipeline_mixin.py:339: LlamaModelTest::test_pipeline_depth_estimation is skipped: `depth-estimation` is not in `self.pipeline_model_mapping` for `LlamaModelTest`.
SKIPPED [1] tests/test_pipeline_mixin.py:346: test requires PyTesseract
SKIPPED [1] tests/test_pipeline_mixin.py:357: LlamaModelTest::test_pipeline_fill_mask is skipped: `fill-mask` is not in `self.pipeline_model_mapping` for `LlamaModelTest`.
SKIPPED [1] tests/test_pipeline_mixin.py:361: LlamaModelTest::test_pipeline_image_classification is skipped: `image-classification` is not in `self.pipeline_model_mapping` for `LlamaModelTest`.
SKIPPED [1] tests/test_pipeline_mixin.py:379: LlamaModelTest::test_pipeline_image_feature_extraction is skipped: `image-feature-extraction` is not in `self.pipeline_model_mapping` for `LlamaModelTest`.
SKIPPED [1] tests/test_pipeline_mixin.py:367: LlamaModelTest::test_pipeline_image_segmentation is skipped: `image-segmentation` is not in `self.pipeline_model_mapping` for `LlamaModelTest`.
SKIPPED [1] tests/test_pipeline_mixin.py:374: LlamaModelTest::test_pipeline_image_to_text is skipped: `image-to-text` is not in `self.pipeline_model_mapping` for `LlamaModelTest`.
SKIPPED [1] tests/test_pipeline_mixin.py:386: `run_pipeline_test` is currently not implemented.
SKIPPED [1] tests/test_pipeline_mixin.py:393: LlamaModelTest::test_pipeline_object_detection is skipped: `object-detection` is not in `self.pipeline_model_mapping` for `LlamaModelTest`.
SKIPPED [1] tests/test_pipeline_mixin.py:404: LlamaModelTest::test_pipeline_summarization is skipped: `summarization` is not in `self.pipeline_model_mapping` for `LlamaModelTest`.
SKIPPED [1] tests/test_pipeline_mixin.py:408: LlamaModelTest::test_pipeline_table_question_answering is skipped: `table-question-answering` is not in `self.pipeline_model_mapping` for `LlamaModelTest`.
SKIPPED [1] tests/test_pipeline_mixin.py:412: LlamaModelTest::test_pipeline_text2text_generation is skipped: `text2text-generation` is not in `self.pipeline_model_mapping` for `LlamaModelTest`.
SKIPPED [1] tests/test_pipeline_mixin.py:425: LlamaModelTest::test_pipeline_text_to_audio is skipped: `text-to-audio` is not in `self.pipeline_model_mapping` for `LlamaModelTest`.
SKIPPED [1] tests/test_pipeline_mixin.py:430: LlamaModelTest::test_pipeline_token_classification is skipped: `token-classification` is not in `self.pipeline_model_mapping` for `LlamaModelTest`.
SKIPPED [1] tests/test_pipeline_mixin.py:434: LlamaModelTest::test_pipeline_translation is skipped: `translation` is not in `self.pipeline_model_mapping` for `LlamaModelTest`.
SKIPPED [1] tests/test_pipeline_mixin.py:438: LlamaModelTest::test_pipeline_video_classification is skipped: `video-classification` is not in `self.pipeline_model_mapping` for `LlamaModelTest`.
SKIPPED [1] tests/test_pipeline_mixin.py:445: LlamaModelTest::test_pipeline_visual_question_answering is skipped: `visual-question-answering` is not in `self.pipeline_model_mapping` for `LlamaModelTest`.
SKIPPED [1] tests/test_pipeline_mixin.py:455: LlamaModelTest::test_pipeline_zero_shot_audio_classification is skipped: `zero-shot-audio-classification` is not in `self.pipeline_model_mapping` for `LlamaModelTest`.
SKIPPED [1] tests/test_pipeline_mixin.py:460: LlamaModelTest::test_pipeline_zero_shot_image_classification is skipped: `zero-shot-image-classification` is not in `self.pipeline_model_mapping` for `LlamaModelTest`.
SKIPPED [1] tests/test_pipeline_mixin.py:465: LlamaModelTest::test_pipeline_zero_shot_object_detection is skipped: `zero-shot-object-detection` is not in `self.pipeline_model_mapping` for `LlamaModelTest`.
SKIPPED [1] tests/models/llama/test_modeling_llama.py:371: Llama buffers include complex numbers, which breaks this test
SKIPPED [1] tests/models/llama/test_modeling_llama.py:657: Model is curently gated
SKIPPED [1] tests/models/llama/test_modeling_llama.py:615: Logits are not exactly the same, once we fix the instabalities somehow, will update!
SKIPPED [1] tests/models/llama/test_modeling_llama.py:628: Logits are not exactly the same, once we fix the instabalities somehow, will update!
SKIPPED [1] tests/models/llama/test_modeling_llama.py:641: Logits are not exactly the same, once we fix the instabalities somehow, will update! Also it is gonna be a `too_slow` test
SKIPPED [1] tests/models/llama/test_modeling_llama.py:602: Logits are not exactly the same, once we fix the instabalities somehow, will update!
================= 1 failed, 120 passed, 30 skipped, 23 warnings in 143.75s (0:02:23) =================

LGTM thanks for taking the time to fix it

[WIP] fix fx

277fd8b

michaelbenayoun added 4 commits April 4, 2024 16:48

[WIP] fix fx

bcd5f01

[WIP] fix fx

b836cd1

[WIP] fix fx

53d0519

[WIP] fix fx

ae2a546

michaelbenayoun changed the title ~~[WIP] fix fx~~ Fix torch.fx symbolic tracing for LLama Apr 4, 2024

michaelbenayoun requested a review from ArthurZucker April 4, 2024 15:31

ArthurZucker reviewed Apr 5, 2024

View reviewed changes

Apply changes to other models

c1e16d8

michaelbenayoun marked this pull request as ready for review April 5, 2024 12:45

ArthurZucker reviewed Apr 5, 2024

View reviewed changes

ArthurZucker approved these changes Apr 5, 2024

View reviewed changes

ArthurZucker merged commit 17cd7a9 into huggingface:main Apr 5, 2024
19 checks passed

ArthurZucker mentioned this pull request Apr 5, 2024

Llama: fix custom 4D masks #29930

Closed

michaelbenayoun deleted the fix-fx branch April 8, 2024 14:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix `torch.fx` symbolic tracing for LLama #30047

Fix `torch.fx` symbolic tracing for LLama #30047

michaelbenayoun commented Apr 4, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Apr 4, 2024

ArthurZucker left a comment

ArthurZucker Apr 5, 2024

ArthurZucker left a comment

ArthurZucker left a comment

Fix torch.fx symbolic tracing for LLama #30047

Fix torch.fx symbolic tracing for LLama #30047

Conversation

michaelbenayoun commented Apr 4, 2024 • edited Loading

What does this PR do?

HuggingFaceDocBuilderDev commented Apr 4, 2024

ArthurZucker left a comment

Choose a reason for hiding this comment

ArthurZucker Apr 5, 2024

Choose a reason for hiding this comment

ArthurZucker left a comment

Choose a reason for hiding this comment

ArthurZucker left a comment

Choose a reason for hiding this comment

Fix `torch.fx` symbolic tracing for LLama #30047

Fix `torch.fx` symbolic tracing for LLama #30047

michaelbenayoun commented Apr 4, 2024 •

edited

Loading