[Whisper] Make tests faster #24105

sanchit-gandhi · 2023-06-08T10:44:09Z

What does this PR do?

Reduces the input seq length of the Whisper tests from 1500 -> 60 frames. This in turn should speed up the tests quite considerably.

sanchit-gandhi · 2023-06-08T10:44:57Z

tests/models/whisper/test_modeling_whisper.py

+                pt_model.config.use_cache = False
+
+                # load Flax class
+                fx_model = fx_model_class(config, input_shape=init_shape, dtype=jnp.float32)


We have to override this method to ensure that we init the Flax weights with the downsampled sequence length correctly (e.g. pass input_shape=init_shape)

I am a bit confused here. If you look at FlaxWhisperModelTest, there is no such overriding to pass input_shape. However, FlaxWhisperModelTester uses the low number as in this PR.

Why we don't need to pass init_shape in FlaxWhisperModelTest?

HuggingFaceDocBuilderDev · 2023-06-08T11:04:49Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

ydshieh · 2023-06-08T11:11:08Z

Sorry, I am asking not because I see a test being slow, but just I saw some more Whisper test failures on daily CI, which is tests/models/whisper/test_modeling_whisper.py::WhisperModelTest::test_cpu_offload.

But yes, in general, it's best to use low number. I will take a look.

sanchit-gandhi · 2023-06-08T15:06:59Z

Note that the Whisper tests have already been flagged as being slow (#23736) so this should help combat this issue!

ydshieh · 2023-06-08T15:13:09Z

It's not because it's slow test that we use large value without really valid reason :-). Always better to make them use low values is the goal, unless it's absolute necessary.

I still have questions on why we don't need to pass input_shape in the corresponding flax test file.

ydshieh · 2023-06-08T15:32:56Z

OK, in flax test file, I see

        self.all_model_classes = (
            make_partial_class(model_class, input_shape=self.init_shape) for model_class in self.all_model_classes
        )

probably it's the reason.

ydshieh

Thank you @sanchit-gandhi LGTM. I convinced myself regarding the input_shape thing.
But any comment is welcomed.

sanchit-gandhi · 2023-06-09T09:13:19Z

Yep agreed - the seq len was unnecessarily high here :) You're spot on regarding the init shape: we have to change this based on the sequence length since Flax Whisper initialises the positional embeddings based on the context window, so if we change the seq len (= context window) we need to init the weights with the new shape

sgugger

Thanks!

sanchit-gandhi commented Jun 8, 2023

View reviewed changes

sanchit-gandhi mentioned this pull request Jun 8, 2023

Add FlaxWhisperForAudioClassification model #23173

Merged

ydshieh approved these changes Jun 8, 2023

View reviewed changes

[Whisper] Make tests faster

160a458

sanchit-gandhi force-pushed the whisper-tests branch from ed2c159 to 160a458 Compare June 12, 2023 16:35

sanchit-gandhi requested a review from sgugger June 19, 2023 08:43

sgugger approved these changes Jun 20, 2023

View reviewed changes

sanchit-gandhi merged commit 6c13444 into huggingface:main Jun 20, 2023

sanchit-gandhi deleted the whisper-tests branch June 20, 2023 15:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Whisper] Make tests faster #24105

[Whisper] Make tests faster #24105

sanchit-gandhi commented Jun 8, 2023

sanchit-gandhi Jun 8, 2023

ydshieh Jun 8, 2023

HuggingFaceDocBuilderDev commented Jun 8, 2023

ydshieh commented Jun 8, 2023

sanchit-gandhi commented Jun 8, 2023

ydshieh commented Jun 8, 2023 •

edited

Loading

ydshieh commented Jun 8, 2023

ydshieh left a comment

sanchit-gandhi commented Jun 9, 2023

sgugger left a comment •

edited

Loading

[Whisper] Make tests faster #24105

[Whisper] Make tests faster #24105

Conversation

sanchit-gandhi commented Jun 8, 2023

What does this PR do?

sanchit-gandhi Jun 8, 2023

Choose a reason for hiding this comment

ydshieh Jun 8, 2023

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Jun 8, 2023

ydshieh commented Jun 8, 2023

sanchit-gandhi commented Jun 8, 2023

ydshieh commented Jun 8, 2023 • edited Loading

ydshieh commented Jun 8, 2023

ydshieh left a comment

Choose a reason for hiding this comment

sanchit-gandhi commented Jun 9, 2023

sgugger left a comment • edited Loading

Choose a reason for hiding this comment

ydshieh commented Jun 8, 2023 •

edited

Loading

sgugger left a comment •

edited

Loading