Chat template: save and load correctly for processors #33462

zucchini-nlp · 2024-09-13T08:37:12Z

What does this PR do?

FIxes #33459. The chat template was not being saved after the last PR for adding LLaVa-OneVision, because I fixed a small bug when chat template was being saved in both places: in its own file and in processor config.

In this RP we never serialize chat template in the dict so that it is not saved and when saving we check for self attributes. Yer, I am not 100% sure if it is ok to not add the template to the dict. Or we should pop template when serializing to json, in to_json_string maybe, WDYT?

HuggingFaceDocBuilderDev · 2024-09-13T08:56:49Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

amyeroberts

Thanks for fixing!

amyeroberts · 2024-09-16T11:32:01Z

tests/models/llava/test_processor_llava.py

+        for key, value in self.prepare_processor_dict().items():
+            # chat templates are popped from dict
+            if key != "chat_template":
+                self.assertEqual(obj[key], value)
+                self.assertEqual(getattr(processor, key, None), value)


Let's also assert the chat template is popped from the dict.

Suggested change

for key, value in self.prepare_processor_dict().items():

# chat templates are popped from dict

if key != "chat_template":

self.assertEqual(obj[key], value)

self.assertEqual(getattr(processor, key, None), value)

for key, value in self.prepare_processor_dict().items():

# chat templates are popped from dict

self.assertFalse(key == "chat_template")

self.assertEqual(obj[key], value)

self.assertEqual(getattr(processor, key, None), value)

oops, no, the test fails because the processor_dict for llava has a chat_template key, and we use it in other tests for init and save the processor for ex. This test is same as the general one, with the exception that chat templates cannot pass self.assertEqual(obj[key], value) check

So we just want to test all other processor kwargs except chat template, which is tested separately. By other kwargs, I mean the ones which will be added soon

with the exception that chat templates cannot pass self.assertEqual(obj[key], value) check

I see, I missed what was happening originally in the test. Isn't self.prepare_processor_dict().items() a bit redundant, as we force self.prepare_processor_dict() to only have one key, which is "chat_template" and so all of this logic is skipped?

Yes, same way as almost all processors skip this test. For VLMs this test will become available when we enforce new processing logic for input expansion with image tokens. Until then, we can override it to prevent failing tests, instead of unitest.skip

hmmmm, I don't think overriding to make it look like tests are passing is a great idea. Skipping is far better as it's easier to spot and track.

Part of the issue here is that this new behaviour still isn't being tested then, as we want to make sure that chat_template isn't in the processor_dict when saving out.

Ah, we can add one more assert in test_chat_template_is_saved to check what is the content of processor_dict

Oke, I'll skip it then with a comment explaining why and that we need to stop skipping it at some point

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

amyeroberts

Thanks for updating!

) * fix * add tests * fix tests * Update tests/models/llava/test_processor_llava.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * fix * fix tests * update tests --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

zucchini-nlp added 2 commits September 13, 2024 09:52

fix

1ea3258

add tests

f38e6cb

zucchini-nlp requested a review from amyeroberts September 13, 2024 08:37

zucchini-nlp changed the title ~~Chat template~~ Chat template: save and load correctly for processors Sep 13, 2024

fix tests

bc40e0e

amyeroberts approved these changes Sep 16, 2024

View reviewed changes

zucchini-nlp and others added 3 commits September 16, 2024 13:36

Update tests/models/llava/test_processor_llava.py

512930a

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

fix

7a0b558

Merge branch 'main' into chat-template

900292a

andimarafioti mentioned this pull request Sep 17, 2024

Add Idefics 3! #32473

Merged

5 tasks

fix tests

ce0f6e9

amyeroberts self-requested a review September 17, 2024 10:53

update tests

b681d9c

amyeroberts approved these changes Sep 17, 2024

View reviewed changes

zucchini-nlp merged commit db72894 into huggingface:main Sep 18, 2024
20 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chat template: save and load correctly for processors #33462

Chat template: save and load correctly for processors #33462

zucchini-nlp commented Sep 13, 2024

HuggingFaceDocBuilderDev commented Sep 13, 2024

amyeroberts left a comment

amyeroberts Sep 16, 2024

zucchini-nlp Sep 16, 2024

amyeroberts Sep 16, 2024

zucchini-nlp Sep 17, 2024

amyeroberts Sep 17, 2024

zucchini-nlp Sep 17, 2024

amyeroberts Sep 17, 2024

amyeroberts left a comment

Chat template: save and load correctly for processors #33462

Chat template: save and load correctly for processors #33462

Conversation

zucchini-nlp commented Sep 13, 2024

What does this PR do?

HuggingFaceDocBuilderDev commented Sep 13, 2024

amyeroberts left a comment

Choose a reason for hiding this comment

amyeroberts Sep 16, 2024

Choose a reason for hiding this comment

zucchini-nlp Sep 16, 2024

Choose a reason for hiding this comment

amyeroberts Sep 16, 2024

Choose a reason for hiding this comment

zucchini-nlp Sep 17, 2024

Choose a reason for hiding this comment

amyeroberts Sep 17, 2024

Choose a reason for hiding this comment

zucchini-nlp Sep 17, 2024

Choose a reason for hiding this comment

amyeroberts Sep 17, 2024

Choose a reason for hiding this comment

amyeroberts left a comment

Choose a reason for hiding this comment