Fix video batching to videollava #32139

merveenoyan · 2024-07-22T13:32:54Z

cc @zucchini-nlp works with no problems

HuggingFaceDocBuilderDev · 2024-07-22T13:51:31Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

zucchini-nlp

Thanks ❤️

Can we activate tests, it is being skipped now for PIL inputs?

merveenoyan · 2024-07-22T15:35:54Z

@zucchini-nlp one thing confused me, you are passing numpy tests but don't pass numpify=True when creating dummy inputs (also they pass the assertion of being numpy arrays) I couldn't make any change to the numpy test to make it for PIL

zucchini-nlp · 2024-07-23T05:27:02Z

Yes, I made bad decision when constructing tests tailoring only for torch and numpy. This one has better dummy video preparation, can copy from it maybe

transformers/tests/models/llava_next_video/test_image_processing_llava_next_video.py

Lines 99 to 124 in 3aefb4e

    
           def prepare_video_inputs(self, equal_resolution=False, numpify=False, torchify=False): 
        
               images = prepare_image_inputs( 
        
                   batch_size=self.batch_size, 
        
                   num_channels=self.num_channels, 
        
                   min_resolution=self.min_resolution, 
        
                   max_resolution=self.max_resolution, 
        
                   equal_resolution=equal_resolution, 
        
                   numpify=numpify, 
        
                   torchify=torchify, 
        
               ) 
        
               # let's simply copy the frames to fake a long video-clip 
        
               if numpify or torchify: 
        
                   videos = [] 
        
                   for image in images: 
        
                       if numpify: 
        
                           video = image[None, ...].repeat(8, 0) 
        
                       else: 
        
                           video = image[None, ...].repeat(8, 1, 1, 1) 
        
                       videos.append(video) 
        
               else: 
        
                   videos = [] 
        
                   for pil_image in images: 
        
                       videos.append([pil_image] * 8) 
        
               return videos

zucchini-nlp

Great, thanks!

amyeroberts

Thanks for fixing!

FWIW - I think the current "numpify" and "torchify" logic is pretty bad and repeated everywhere. We should look into making this clearer and consolidate in the future!

fix video batching

f884f54

zucchini-nlp approved these changes Jul 22, 2024

View reviewed changes

add pil test

aa320a7

Merve Noyan added 2 commits July 22, 2024 18:36

add pil assertion

7e82cd4

nit comment

f79e362

Merve Noyan added 4 commits July 23, 2024 10:57

replace test util

fa1b00d

nit

a46cd02

fix test

45e73b4

fix

6d463b7

merveenoyan requested a review from zucchini-nlp July 23, 2024 09:27

zucchini-nlp approved these changes Jul 23, 2024

View reviewed changes

zucchini-nlp requested a review from amyeroberts July 23, 2024 09:49

amyeroberts approved these changes Jul 23, 2024

View reviewed changes

merveenoyan merged commit 9ced33c into main Jul 23, 2024
22 checks passed

merveenoyan deleted the fix-videollava-batching branch July 23, 2024 10:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix video batching to videollava #32139

Fix video batching to videollava #32139

merveenoyan commented Jul 22, 2024

HuggingFaceDocBuilderDev commented Jul 22, 2024

zucchini-nlp left a comment

merveenoyan commented Jul 22, 2024

zucchini-nlp commented Jul 23, 2024

zucchini-nlp left a comment

amyeroberts left a comment

Fix video batching to videollava #32139

Fix video batching to videollava #32139

Conversation

merveenoyan commented Jul 22, 2024

HuggingFaceDocBuilderDev commented Jul 22, 2024

zucchini-nlp left a comment

Choose a reason for hiding this comment

merveenoyan commented Jul 22, 2024

zucchini-nlp commented Jul 23, 2024

zucchini-nlp left a comment

Choose a reason for hiding this comment

amyeroberts left a comment

Choose a reason for hiding this comment