Unused or unrecognized kwargs: padding, max_length, truncation. #30106

Excuses123 · 2024-04-08T01:57:52Z

System Info

transformers version: 4.39.2
Platform: Linux-3.10.0-1160.99.1.el7.x86_64-x86_64-with-glibc2.31
Python version: 3.10.13
Huggingface_hub version: 0.22.2
Safetensors version: 0.4.2
Accelerate version: not installed
Accelerate config: not found
PyTorch version (GPU?): 2.0.1+cu117 (True)
Tensorflow version (GPU?): not installed (NA)
Flax version (CPU?/GPU?/TPU?): not installed (NA)
Jax version: not installed
JaxLib version: not installed
Using GPU in script?:
Using distributed or parallel set-up in script?:

Who can help?

No response

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction

Fine-tuning the CLIP model. Previously, I passed both text and images to the processor and specified some parameters for the text: padding, max_length, truncation. Everything was working fine before upgrading to version 4.39. However, after the upgrade, I encountered a warning that affects the training process. How should I resolve this?

Expected behavior

Do I need to separate the text and images for processing and use processor.tokenizer and processor.image_processor separately?

The text was updated successfully, but these errors were encountered:

kuri54 · 2024-04-08T07:36:00Z

Same problem is occurring.
Downgrading to 4.38.2 will solve the problem.

hda-xian · 2024-04-08T14:13:51Z

Meet the same problem

amyeroberts · 2024-04-08T14:19:02Z

Hi @Excuses123, thanks for raising this issue!

Could you share a minimal reproducer?

hda-xian · 2024-04-08T14:19:35Z

这是来自QQ邮箱的假期自动回复邮件。你好，我最近正在休假中，无法亲自回复你的邮件。我将在假期结束后，尽快给你回复。

Excuses123 · 2024-04-10T02:59:18Z

Hi @Excuses123, thanks for raising this issue!

Could you share a minimal reproducer?

Hello, here is a minimal reproducible example.

import torch
import requests
from PIL import Image
from transformers import AutoProcessor

processor = AutoProcessor.from_pretrained("openai/clip-vit-large-patch14-336")

url = "http://images.cocodataset.org/val2017/000000039769.jpg"
image = Image.open(requests.get(url, stream=True).raw)

processor(text="a photo of a cat",
images=image,
max_length=64,
padding='max_length',
truncation=True,
return_tensors='pt')

amyeroberts · 2024-04-11T17:02:54Z

Thanks for sharing @Excuses123. The warnings have started because of new input validation on the image processor. This is good as it shows tokenizer kwargs were being passed to the image processor when they shouldn't be! I've opened a PR to address this

hda-xian · 2024-04-13T13:54:01Z

just to Change code
for the /your paht**/transformers/models/clip/processing_clip.py
function : def call(self, text=None, images=None, return_tensors=None, **kwargs):
line: 104 image_features = self.image_processor(images, return_tensors=return_tensors, **kwargs)
the new code : -> image_features = self.image_processor(images, return_tensors=return_tensors) # delete the others kwargs

amyeroberts self-assigned this Apr 8, 2024

amyeroberts added bug Multimodal labels Apr 8, 2024

amyeroberts mentioned this issue Apr 11, 2024

Separate out kwargs in processor #30193

Merged

5 tasks

amyeroberts closed this as completed in #30193 Apr 15, 2024

Eric2i mentioned this issue May 20, 2024

separate kwargs in processor (similar to #30193) #30905

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unused or unrecognized kwargs: padding, max_length, truncation. #30106

Unused or unrecognized kwargs: padding, max_length, truncation. #30106

Excuses123 commented Apr 8, 2024

kuri54 commented Apr 8, 2024

hda-xian commented Apr 8, 2024

amyeroberts commented Apr 8, 2024

hda-xian commented Apr 8, 2024 via email

Excuses123 commented Apr 10, 2024

amyeroberts commented Apr 11, 2024

hda-xian commented Apr 13, 2024

Unused or unrecognized kwargs: padding, max_length, truncation. #30106

Unused or unrecognized kwargs: padding, max_length, truncation. #30106

Comments

Excuses123 commented Apr 8, 2024

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

kuri54 commented Apr 8, 2024

hda-xian commented Apr 8, 2024

amyeroberts commented Apr 8, 2024

hda-xian commented Apr 8, 2024 via email

Excuses123 commented Apr 10, 2024

amyeroberts commented Apr 11, 2024

hda-xian commented Apr 13, 2024