Add EDMEulerScheduler #7109

patil-suraj · 2024-02-27T04:06:38Z

What does this PR do?

This PR is based on #4481 by @dg845 !

This PR adds Euler scheduler with EDM formulation. The difference between this and EulerDiscreteScheduler is that

EulerDiscreteScheduler was essentially designed for DDPM models to use euler-style sampling algorithms with discrete timesteps.
EDMEulerScheduler follows the EDM formulation as closely as possible and is solely intended for models that use EDM formulation, like SVD. It does not support epsilon scaling as that's already covered by EulerDiscreteScheduler. So models that still DDPM (the alphas and betas schedules) should keep using EulerDiscreteScheduler

Why add a new scheduler class ?
While it is possible to support this in EulerDiscreteScheduler by introducing an argument maybe called scaling_type, which could correspond to epsilon (ddpm) v-scaling or edm, the issue is that:

the naming is inconsistent. With edm, the model is conditioned on continuous noise scales rather than discrete timesteps.
There is some functionality in EulerDiscreteScheduler that is not required for EDM, such as computing betas, rescaling for zero terminal SNR, interpolating sigmas etc.

Hence with the current schedule API if we support full EDM formulation in EulerDiscreteScheduler, the code will be confusing to follow with lot's of if/else branches.

API

scheduler = EDMEulerScheduler(sigma_min=0.002, sigma_max=80.0, sigma_data=0.5)

scheduler.sigmas # computed directly as per the karras paper 
scheduler.timesteps # this is the `c_noise` which is fed to the model instead of discrete timestep

# to precondition input
sample = scheduler.precondition_inputs(sample, sigma)

# to get the c_noise
c_noise = scheduler.precondition_noise(sigma) 

# precondition output
output = scheduler.precondition_outputs(sample, model_output, sigma)

These pre-conditioning methods can be used during training to easily scale the input/output based on sigma values, which the user is free to sample in any way they want during training.

We could also consider adding a method to help with sigma sampling as per here

The rest of the API follows the existing scheduler API.

I would love to have your thoughts here @yiyixuxu @sayakpaul @dhruvrnaik @dg845

HuggingFaceDocBuilderDev · 2024-02-27T04:13:07Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

src/diffusers/schedulers/scheduling_edm_euler.py

sayakpaul · 2024-02-27T05:17:19Z

src/diffusers/schedulers/scheduling_edm_euler.py

+            s_churn (`float`):
+            s_tmin  (`float`):
+            s_tmax  (`float`):


Would be nice to have docstrings.

sayakpaul · 2024-02-27T05:18:21Z

src/diffusers/schedulers/scheduling_edm_euler.py

+
+        return EDMEulerSchedulerOutput(prev_sample=prev_sample, pred_original_sample=pred_original_sample)
+
+    def add_noise(


sayakpaul · 2024-02-27T05:21:07Z

src/diffusers/schedulers/scheduling_edm_euler.py

+        if self.config.prediction_type == "epsilon":
+            c_out = sigma * sigma_data / (sigma**2 + sigma_data**2) ** 0.5
+        elif self.config.prediction_type == "v_prediction":
+            c_out = -sigma * sigma_data / (sigma**2 + sigma_data**2) ** 0.5
+        else:
+            raise ValueError(f"Prediction type {self.config.prediction_type} is not supported.")


I guess users will have to do this bit manually during training when experimenting with different prediction types?

sayakpaul

Design looks very nice to me!

yiyixuxu

looking good to me:)

question:
for v_prediction: does it work the same as in what's currently implemented in EulerDiscreteScheduler?

src/diffusers/schedulers/scheduling_edm_euler.py

patil-suraj · 2024-02-27T08:11:35Z

for v_prediction: does it work the same as in what's currently implemented in EulerDiscreteScheduler?

@yiyixuxu Yes, that's right. EulerDiscreteScheduler would work the same way if sigma_min , sigma_max is provided and timestep_type==continuous

Actually not sure if we should put it here, it's used to SVD and follows EDM as well.

sayakpaul · 2024-02-27T08:18:01Z

I think okay to put it there as it helps the community to experiment with different prediction types and provide us feedback if needed. We can iterate on top of that.

Co-authored-by: Daniel Gu dgu8957@gmail.com

@dg845

* Add EDMEulerScheduler * address review comments * fix import * fix test * add tests * add co-author Co-authored-by: @dg845 dgu8957@gmail.com

Add EDMEulerScheduler

e671a00

patil-suraj requested review from sayakpaul, yiyixuxu and DN6 February 27, 2024 04:48

sayakpaul reviewed Feb 27, 2024

View reviewed changes

src/diffusers/schedulers/scheduling_edm_euler.py Outdated Show resolved Hide resolved

sayakpaul reviewed Feb 27, 2024

View reviewed changes

yiyixuxu approved these changes Feb 27, 2024

View reviewed changes

src/diffusers/schedulers/scheduling_edm_euler.py Outdated Show resolved Hide resolved

yiyixuxu reviewed Feb 27, 2024

View reviewed changes

src/diffusers/schedulers/scheduling_edm_euler.py Outdated Show resolved Hide resolved

yiyixuxu reviewed Feb 27, 2024

View reviewed changes

src/diffusers/schedulers/scheduling_edm_euler.py Show resolved Hide resolved

address review comments

ce3cfd1

patil-suraj marked this pull request as ready for review February 27, 2024 08:37

patil-suraj added 4 commits February 27, 2024 14:53

fix import

1804dda

fix test

34de8af

add tests

a06abab

add co-author

ea2a6c0

Co-authored-by: Daniel Gu dgu8957@gmail.com

patil-suraj merged commit f57e7bd into main Feb 27, 2024
15 checks passed

patil-suraj deleted the edm-euler branch February 27, 2024 12:21

Beinsezii pushed a commit to Beinsezii/diffusers that referenced this pull request Feb 28, 2024

Add EDMEulerScheduler (huggingface#7109)

20a98f6

* Add EDMEulerScheduler * address review comments * fix import * fix test * add tests * add co-author Co-authored-by: @dg845 dgu8957@gmail.com

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add EDMEulerScheduler #7109

Add EDMEulerScheduler #7109

patil-suraj commented Feb 27, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Feb 27, 2024

sayakpaul Feb 27, 2024

sayakpaul Feb 27, 2024

sayakpaul Feb 27, 2024

sayakpaul left a comment

yiyixuxu left a comment

patil-suraj commented Feb 27, 2024

sayakpaul commented Feb 27, 2024


		return EDMEulerSchedulerOutput(prev_sample=prev_sample, pred_original_sample=pred_original_sample)

		def add_noise(

Add EDMEulerScheduler #7109

Add EDMEulerScheduler #7109

Conversation

patil-suraj commented Feb 27, 2024 • edited Loading

What does this PR do?

HuggingFaceDocBuilderDev commented Feb 27, 2024

sayakpaul Feb 27, 2024

Choose a reason for hiding this comment

sayakpaul Feb 27, 2024

Choose a reason for hiding this comment

sayakpaul Feb 27, 2024

Choose a reason for hiding this comment

sayakpaul left a comment

Choose a reason for hiding this comment

yiyixuxu left a comment

Choose a reason for hiding this comment

patil-suraj commented Feb 27, 2024

sayakpaul commented Feb 27, 2024

patil-suraj commented Feb 27, 2024 •

edited

Loading