Whisper: move to tensor cpu before converting to np array at decode time #31954

gante · 2024-07-14T14:00:48Z

What does this PR do?

Follow up to #27818

pytest --doctest-modules src/transformers/models/whisper/generation_whisper.py -vv started failing on main due to the PR above.

In a nutshell, if Whisper was running on GPU, the generated tensors would also be on GPU. The new decoding code called token_ids.numpy(), which failed if the token_ids tensor was on GPU. This PR moves it to the CPU before the numpy conversion :)

cc @sanchit-gandhi

HuggingFaceDocBuilderDev · 2024-07-14T14:20:49Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

amyeroberts

Thanks for fixing!

Just a question about the properties of token_ids

amyeroberts · 2024-07-14T14:30:20Z

src/transformers/models/whisper/tokenization_whisper.py

-            token_ids = token_ids.numpy()
+        if hasattr(token_ids, "numpy"):
+            if "torch" in str(type(token_ids)):
+                token_ids = token_ids.cpu().numpy()


Following from this - will token_ids ever have a grad? In which case, this will also fail on the cpu call

token_ids, the output of generate, will not have gradients :) generate is decorated with @no_grad

move to cpu before converting to np

7a678ae

gante requested a review from amyeroberts July 14, 2024 14:00

tensorflow

b0d61fe

amyeroberts approved these changes Jul 14, 2024

View reviewed changes

gante merged commit a5c642f into huggingface:main Jul 14, 2024
19 checks passed

gante deleted the fix_whisper_doctest branch July 14, 2024 15:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Whisper: move to tensor cpu before converting to np array at decode time #31954

Whisper: move to tensor cpu before converting to np array at decode time #31954

gante commented Jul 14, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Jul 14, 2024

amyeroberts left a comment

amyeroberts Jul 14, 2024

gante Jul 14, 2024

Whisper: move to tensor cpu before converting to np array at decode time #31954

Whisper: move to tensor cpu before converting to np array at decode time #31954

Conversation

gante commented Jul 14, 2024 • edited Loading

What does this PR do?

HuggingFaceDocBuilderDev commented Jul 14, 2024

amyeroberts left a comment

Choose a reason for hiding this comment

amyeroberts Jul 14, 2024

Choose a reason for hiding this comment

gante Jul 14, 2024

Choose a reason for hiding this comment

gante commented Jul 14, 2024 •

edited

Loading