fix `F.interpolate()` for large batch sizes #1006

NouamaneTazi · 2022-10-26T23:09:37Z

Fixes #984.
It seems that F.interpolate(hidden_states, scale_factor=2.0, mode="nearest") breaks for batch sizes > 64 when hidden_states uses channels last format. See pytorch/pytorch#81665 and #984

This PR proposes to force a contiguous format for hidden states when bsz > 64. Credits to @pcuenca for the find.

The following now works (after applying the memory efficient PR + this PR)

pipe = StableDiffusionPipeline.from_pretrained(
    "CompVis/stable-diffusion-v1-4", 
    use_auth_token=True,
    revision="fp16",
    torch_dtype=torch.float16,
).to("cuda")

batch_size = 32

with torch.inference_mode():
    image = pipe([prompt] * batch_size, num_inference_steps=5).images[0]

cc @pcuenca @patrickvonplaten @patil-suraj

HuggingFaceDocBuilderDev · 2022-10-26T23:13:15Z

The documentation is not available anymore as the PR was closed or merged.

NouamaneTazi · 2022-10-26T23:21:27Z

Wondering if I should apply the same fix for the following lines as well 🤔

        if self.upsample is not None:
            input_tensor = self.upsample(input_tensor)
            hidden_states = self.upsample(hidden_states)

patil-suraj

Looks good to me!Maybe let's apply it before those two lines that you listed so it wilkl apply to all upsample ops.

NouamaneTazi · 2022-10-27T14:11:51Z

Done @patil-suraj

* fix `upsample_nearest_nhwc` for large bsz * fix `upsample_nearest_nhwc` for large bsz

fix upsample_nearest_nhwc for large bsz

cf53584

NouamaneTazi changed the title ~~fix F.interpolate() for large bsz~~ fix F.interpolate() for large batch sizes Oct 26, 2022

NouamaneTazi requested review from patil-suraj, patrickvonplaten and pcuenca October 26, 2022 23:21

patil-suraj approved these changes Oct 27, 2022

View reviewed changes

fix upsample_nearest_nhwc for large bsz

f9fa986

NouamaneTazi requested a review from patil-suraj October 27, 2022 21:39

patil-suraj approved these changes Oct 28, 2022

View reviewed changes

patil-suraj merged commit ab079f2 into huggingface:main Oct 28, 2022

yoonseokjin pushed a commit to yoonseokjin/diffusers that referenced this pull request Dec 25, 2023

fix F.interpolate() for large batch sizes (huggingface#1006)

0ddc0d3

* fix `upsample_nearest_nhwc` for large bsz * fix `upsample_nearest_nhwc` for large bsz

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix `F.interpolate()` for large batch sizes #1006

fix `F.interpolate()` for large batch sizes #1006

NouamaneTazi commented Oct 26, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Oct 26, 2022 •

edited

Loading

NouamaneTazi commented Oct 26, 2022

patil-suraj left a comment

NouamaneTazi commented Oct 27, 2022

fix F.interpolate() for large batch sizes #1006

fix F.interpolate() for large batch sizes #1006

Conversation

NouamaneTazi commented Oct 26, 2022 • edited Loading

HuggingFaceDocBuilderDev commented Oct 26, 2022 • edited Loading

NouamaneTazi commented Oct 26, 2022

patil-suraj left a comment

Choose a reason for hiding this comment

NouamaneTazi commented Oct 27, 2022

fix `F.interpolate()` for large batch sizes #1006

fix `F.interpolate()` for large batch sizes #1006

NouamaneTazi commented Oct 26, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Oct 26, 2022 •

edited

Loading