add DPM scheduler with EDM formulation #7120

patil-suraj · 2024-02-27T14:32:59Z

What does this PR do?

HuggingFaceDocBuilderDev · 2024-02-27T14:42:34Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

sayakpaul

Generally looks good. Let's make sure to add "Copied from ..." statements wherever applicable.

pcuenca

Looks good in principle!

src/diffusers/schedulers/scheduling_edm_dpmsolver_multistep.py

pcuenca · 2024-02-27T15:12:31Z

src/diffusers/schedulers/scheduling_edm_dpmsolver_multistep.py

+            else:
+                raise NotImplementedError(f"{solver_type} does is not implemented for {self.__class__}")
+
+        if algorithm_type not in ["dpmsolver++", "sde-dpmsolver++"] and final_sigmas_type == "zero":


should deis be accepted here as well?

Not sure, for now I followed the logic in existing dpm scheduler.

I think we just map it to dpmsolver++ here so it should be fine; although I absolutely have no clue why "deis" is accepted as algorithm type (I looked back at the original PR that added it too and that did not help)

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

into edm-dpmsolver

yiyixuxu

look good!
I left a comment about the _sigma_to_alpha_sigma_t function - I think maybe we don't need this here and can simplify the math a little bit

yiyixuxu · 2024-02-27T17:48:36Z

src/diffusers/schedulers/scheduling_edm_dpmsolver_multistep.py

+        return sigmas
+
+    # Copied from diffusers.schedulers.scheduling_ddpm.DDPMScheduler._threshold_sample
+    def _threshold_sample(self, sample: torch.FloatTensor) -> torch.FloatTensor:


does this work? asking because we precondition the inputs so sample has a different scale now

yiyixuxu · 2024-02-27T18:03:56Z

src/diffusers/schedulers/scheduling_edm_dpmsolver_multistep.py

+        return t
+
+    def _sigma_to_alpha_sigma_t(self, sigma):
+        alpha_t = torch.tensor(1)  # Inputs are pre-scaled before going into unet, so alpha_t = 1


ohh nice! that's why this formula still works here

x_t = (sigma_t / sigma_s) * sample - (alpha_t * (torch.exp(-h) - 1.0)) * model_output

or maybe we do not need this function at all , just change the math directly inside the steps as

x_t = (sigma_t / sigma_s) * sample - (torch.exp(-h) - 1.0) * model_output

yiyixuxu · 2024-02-27T18:09:33Z

src/diffusers/schedulers/scheduling_edm_dpmsolver_multistep.py

+        alpha_t, sigma_t = self._sigma_to_alpha_sigma_t(sigma_t)
+        alpha_s, sigma_s = self._sigma_to_alpha_sigma_t(sigma_s)
+        lambda_t = torch.log(alpha_t) - torch.log(sigma_t)
+        lambda_s = torch.log(alpha_s) - torch.log(sigma_s)
+
+        h = lambda_t - lambda_s


Suggested change

alpha_t, sigma_t = self._sigma_to_alpha_sigma_t(sigma_t)

alpha_s, sigma_s = self._sigma_to_alpha_sigma_t(sigma_s)

lambda_t = torch.log(alpha_t) - torch.log(sigma_t)

lambda_s = torch.log(alpha_s) - torch.log(sigma_s)

h = lambda_t - lambda_s

h = torch.log(sigma_s) - torch.log(sigma_t)

I'm not 100% sure the math is correct here but the idea is we can calculate h from sigma directly

yiyixuxu · 2024-02-27T18:09:51Z

src/diffusers/schedulers/scheduling_edm_dpmsolver_multistep.py

+
+        h = lambda_t - lambda_s
+        if self.config.algorithm_type == "dpmsolver++":
+            x_t = (sigma_t / sigma_s) * sample - (alpha_t * (torch.exp(-h) - 1.0)) * model_output


Suggested change

x_t = (sigma_t / sigma_s) * sample - (alpha_t * (torch.exp(-h) - 1.0)) * model_output

x_t = (sigma_t / sigma_s) * sample - (torch.exp(-h) - 1.0) * model_output

yiyixuxu · 2024-02-27T18:10:11Z

src/diffusers/schedulers/scheduling_edm_dpmsolver_multistep.py

+            assert noise is not None
+            x_t = (
+                (sigma_t / sigma_s * torch.exp(-h)) * sample
+                + (alpha_t * (1 - torch.exp(-2.0 * h))) * model_output


Suggested change

+ (alpha_t * (1 - torch.exp(-2.0 * h))) * model_output

+ (1 - torch.exp(-2.0 * h)) * model_output

yiyixuxu · 2024-02-27T18:13:09Z

src/diffusers/schedulers/scheduling_edm_dpmsolver_multistep.py

+            self.sigmas[self.step_index - 1],
+        )
+
+        alpha_t, sigma_t = self._sigma_to_alpha_sigma_t(sigma_t)


same comments as #7120 (comment)

patil-suraj added 2 commits February 27, 2024 20:02

add DPM scheduler with EDM formulation

8e9ed55

set sigmas in init

d93f2c0

add _compute_sigmas

0d591dc

sayakpaul reviewed Feb 27, 2024

View reviewed changes

pcuenca approved these changes Feb 27, 2024

View reviewed changes

patil-suraj and others added 4 commits February 27, 2024 20:56

Apply suggestions from code review

fefbd69

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

address some review comments

f4a04c3

Merge branch 'edm-dpmsolver' of https://github.com/huggingface/diffusers

12e307b

into edm-dpmsolver

up,

d45c4f1

patil-suraj changed the title ~~[wip] add DPM scheduler with EDM formulation~~ add DPM scheduler with EDM formulation Feb 27, 2024

add tests

56ea8a6

patil-suraj marked this pull request as ready for review February 27, 2024 16:59

patil-suraj requested a review from yiyixuxu February 27, 2024 16:59

patil-suraj merged commit 8492db2 into main Feb 27, 2024
15 checks passed

patil-suraj deleted the edm-dpmsolver branch February 27, 2024 18:09

yiyixuxu reviewed Feb 27, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add DPM scheduler with EDM formulation #7120

add DPM scheduler with EDM formulation #7120

patil-suraj commented Feb 27, 2024

HuggingFaceDocBuilderDev commented Feb 27, 2024

sayakpaul left a comment

pcuenca left a comment

pcuenca Feb 27, 2024

patil-suraj Feb 27, 2024

yiyixuxu Feb 27, 2024 •

edited

Loading

yiyixuxu left a comment

yiyixuxu Feb 27, 2024

yiyixuxu Feb 27, 2024

yiyixuxu Feb 27, 2024

yiyixuxu Feb 27, 2024

yiyixuxu Feb 27, 2024

yiyixuxu Feb 27, 2024

	x_t = (sigma_t / sigma_s) * sample - (alpha_t * (torch.exp(-h) - 1.0)) * model_output
	x_t = (sigma_t / sigma_s) * sample - (torch.exp(-h) - 1.0) * model_output

	+ (alpha_t * (1 - torch.exp(-2.0 * h))) * model_output
	+ (1 - torch.exp(-2.0 * h)) * model_output

add DPM scheduler with EDM formulation #7120

add DPM scheduler with EDM formulation #7120

Conversation

patil-suraj commented Feb 27, 2024

What does this PR do?

HuggingFaceDocBuilderDev commented Feb 27, 2024

sayakpaul left a comment

Choose a reason for hiding this comment

pcuenca left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

yiyixuxu Feb 27, 2024 • edited Loading

Choose a reason for hiding this comment

yiyixuxu left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

yiyixuxu Feb 27, 2024 •

edited

Loading