[Pipiline] Wuerstchen v3 aka Stable Cascasde pipeline #6487

kashif · 2024-01-08T13:46:25Z

What does this PR do?

Add Wuerstchen v3 pipeline

kashif · 2024-01-08T13:47:27Z

src/diffusers/pipelines/wuerstchen/modeling_wuerstchen_common.py

HuggingFaceDocBuilderDev · 2024-01-16T21:33:40Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

.gitignore

src/diffusers/pipelines/wuerstchen3/modeling_wuerstchen3_prior.py

src/diffusers/pipelines/wuerstchen3/modeling_wuerstchen3_diffnext.py

patrickvonplaten

There is still quite a bit of clean-up to do until we can merge this I think

vladmandic · 2024-03-06T14:02:55Z

all of the recent class renames broke actual model as https://huggingface.co/stabilityai/stable-cascade/blob/main/model_index.json refers to class names that no longer exist.
for example StableCascadeUnet vs StableCascadeUNet

aa956 · 2024-03-06T14:46:36Z

all of the recent class renames broke actual model as https://huggingface.co/stabilityai/stable-cascade/blob/main/model_index.json refers to class names that no longer exist. for example StableCascadeUnet vs StableCascadeUNet

If I understand correctly this was expected and will be fixed by merging this PR on model repo?

kashif · 2024-03-06T15:14:51Z

correct @aa956 and @vladmandic

vladmandic · 2024-03-06T15:26:49Z

feels like a sledgehammer solution to self-inflicted problem - users will need to manually delete existing copy of stable cascade and redownload it.

sayakpaul · 2024-03-06T15:27:57Z

FWIW, it was from a PR branch and not even from the main branch. So, this should not be too surprising.

vladmandic · 2024-03-06T15:28:34Z

FWIW, it was from a PR branch and not even from the main branch. So, this should not be too surprising.

fair point, i forgot that we needed a pr branch to start with.

vladmandic · 2024-03-06T16:57:29Z

also, what are plans to support loading different model variations? for each stage there is full or light and fp32 and bf16 for each (and unofficial fp16) right now, even with the pr, there is only one frozen variation available on huggingface as that is the only variation in actual hf folder-style format.

perhaps diffusers team could republish the model so all variations are actually available as variations and can be loaded using from_pretrained by specifying variation - that is what variation was designed for?

the current prs to model itself (44 for decoder and 2 for prior) only make it worse since they force fp32 version which is double the size and unusable by majority of users.

FurkanGozukara · 2024-03-07T03:00:44Z

yes we need load from single point (e.g. a single safetensors file). this is really really needed

vladmandic · 2024-03-12T13:23:34Z

@DN6 @sayakpaul what are the plans to make SC actually available at https://huggingface.co/stabilityai/stable-cascade ?
PRs to modify model are still open and it doesn't look like StabilityAI has much interest in it - and even if it were merged, my previous comments still stand - if diffusers format is preferred, we need to make it actually available.

sayakpaul · 2024-03-12T13:49:19Z

The merge decision lies with StabilityAI. Once merged that will become available. But until then we have to use the revision mechanism. I know that is not ideal but that is the best we have got so far.

You can also check out the open PRs in the repository where we are adding support for Stable Cascade single file checkpoint loading.

Cc: @apolinario

vladmandic · 2024-03-12T15:15:05Z

The merge decision lies with StabilityAI.

Not necessarily - StabilityAI uploaded only single variant of the model in diffusers format.
And even if PR is accepted, its still going to be a single variant.
If diffusers prefer diffusers folder-style format, then why not re-publish the model with all available variants as expected in diffusers folder-style?
Its not like Huggingface team never republished an existing model.

Switching blame does not help users that want to use the model.

sayakpaul · 2024-03-12T15:18:02Z

You want us to host the model on our org? Sorry, I didn’t mean to blame anyone and I apologise if it sounded like that.

Also, which variant in the diffusers format is currently missing?

vladmandic · 2024-03-12T15:34:37Z

as you know, both stage_b and stage_c exist in 4 variants each: full fp32, full bf16, lite fp32, lite bf16
but if i look in stabilityai/stable-cascade/decoder or stabilityai/stable-cascade-prior/prior as originally published, i only see one variant published

the pr:44 in decoder and pr:2 in prior adds variants, but places lite variants in different folder - how can they be used with from_pretrained?

and yes, ideally stabiltiyai should accept those two prs (and even better, they should publish models with all variants in the first place). as it is, model is now 1 month from publication and still not usable (and pr is one week old) - so yes, if stabilityai does not accept pr, then perhaps huggingface should re-publish the model under hugggingface org.

sayakpaul · 2024-03-12T15:46:53Z

the pr:44 in decoder and pr:2 in prior adds variants, but places lite variants in different folder - how can they be used with from_pretrained?

These will be separate pipelines for convenience. Refer to the scripts/convert_stable_cascade_lite.py in https://github.com/huggingface/diffusers/pull/7271/files.

and yes, ideally stabiltiyai should accept those two prs (and even better, they should publish models with all variants in the first place). as it is, model is now 1 month from publication and still not usable (and pr is one week old) - so yes, if stabilityai does not accept pr, then perhaps huggingface should re-publish the model under hugggingface org.

Duly noted. Thanks for explaining!

yiyixuxu · 2024-03-12T15:50:30Z

hi @vladmandic
for using lite version, we included an example here https://github.com/huggingface/diffusers/pull/7257/files#diff-abde182084148a55a5a5edf9df250650923a1c1686e3f138e9f40d587e74ec2c

how can they be used with from_pretrained?

vladmandic · 2024-03-12T16:43:19Z

a bit messy as it requires manual load of unet from subfolder and then create a pipeline
why not have lite simply as a variants as well?
btw, i think example is wrong, from diffusers import StableCascadeUNet doesn't exist, class is in diffusers.models

and does StableCascadeCombinedPipeline support lite?
i'm getting corrupt outputs, almost like vqgan scaling is off (no issues with full).

yiyixuxu · 2024-03-12T17:51:14Z

@vladmandic

i'm getting corrupt outputs, almost like vqgan scaling is off (no issues with full).

we have a bug in the pipeline, this PR should fix it #7287

why not have lite simply as a variants as well?

ohh I like this idea, I think it will be easier to use indeed, that would require us to duplicate the checkpoint for other components though, cc @DN6 and @sayakpaul here, let me know what you guys think!

yiyixuxu · 2024-03-12T17:59:20Z

@vladmandic

and it should work with combined pipeline

prior_unet = StableCascadeUNet.from_pretrained("stabilityai/stable-cascade-prior", subfolder="prior_lite")
decoder_unet = StableCascadeUNet.from_pretrained("stabilityai/stable-cascade", subfolder="decoder_lite")

pipe_combined = DiffusionPipeline.from_pretrained("stabilityai/stable-cascade", prior_prior=prior_unet, decoder=decoder_unet, ...)

sayakpaul · 2024-03-13T01:33:46Z

ohh I like this idea, I think it will be easier to use indeed, that would require us to duplicate the checkpoint for other components though, cc @DN6 and @sayakpaul here, let me know what you guys think!

Sounds good to me! In line with https://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0/tree/main/vae_1_0.

yiyixuxu · 2024-03-13T02:17:11Z

@vladmandic
I realize the lite version has a different config so it won't work as a variant out-of-box
we may still want to explore this in the future though if more models are getting distributed this way

bghira · 2024-03-13T02:18:57Z

having a key for variant-specific values in the config might work?

something like

{
... other config values...
"variant_config": {
    "lite": {
        ....lite values....
    }
}

sayakpaul · 2024-03-13T02:23:08Z

The "variant" keyword is reserved for loading checkpoints belonging to certain numerical precisions IIUC. So, I would like to keep it that way given that a model variant (in the sense that it's architecturally different) can also be loaded with subfolder as shown by @yiyixuxu in #6487 (comment).

Would love to know why and how that's not we're looking for here.

bghira · 2024-03-13T02:24:24Z

well, that's true. the subfolder can have config.json in there and it should use that to load that model type

yiyixuxu · 2024-03-13T02:28:03Z

@sayakpaul
variant is just used to load different versions of the model - it does not have to be limited to numerical precision, and there is no reason to limit it to that.

I'm not saying we should use variant here but that should not be a factor

sayakpaul · 2024-03-13T02:31:12Z

variant is just used to load different versions of the model - it does not have to be limited to numerical precision, and there is no reason to limit it to that.

This can only be done when:

The number of parameters of the underlying model is exactly the same.
The config of the underlying model is exactly the same.

Model versions mean different things and can vary with respect to numerical precision, architecture, etc. I think the sole assumption behind "variant" is that we're operating with a model that respects the two points I enlisted above.

If those two are respected, then sure -- we definitely shouldn't be restrictive about it. But I don't think that is the case for "lite" here.

bghira · 2024-03-13T02:37:55Z

i might propose that model_index.json could have a list of subfolders to point to for a new keyword. something like collection.

model = DiffusionModel.from_pretrained('username/reponame', collection='lite')

model_index.json:

{
    "collections": {
        "lite": {
            "prior": "prior_lite",
            ...
        }
    }
}

vladmandic · 2024-03-13T03:13:29Z

@sayakpaul i agree with your points. i was looking for a quick-easy solution and overloading variant is not it.
but we should have a way to specify "flavor" (full or lite) somehow. using subfolder to load unet and and then pass that to pipe later is not really elegant, its definitely not a single step and in application layer leads to many if/then branches.

kashif added 5 commits November 11, 2023 10:12

initial diffNext v3

6185da3

move to v3 folder

6fd8639

imports

86e2bcd

dry up the unets

e77db10

Merge branch 'main' into wuerstchen-v3

644dc5d

kashif and others added 7 commits January 9, 2024 14:00

no switch_level

1380b95

fix init

2bca122

add switch_level tp config

d4d0bc1

Fixed some things

0db9e4d

Added pooled text embeddings

87e5577

Initial work on adding image encoder

38f9f35

changes from @dome272

dc3f47e

patrickvonplaten reviewed Jan 15, 2024

View reviewed changes

src/diffusers/pipelines/wuerstchen/modeling_wuerstchen_common.py Outdated Show resolved Hide resolved

Stuff for the image encoder processing and variable naming in decoder

5c6635f

kashif and others added 8 commits January 19, 2024 15:09

fix arg name

3d41b2a

inference fixes

2012e71

inference fixes

add164a

Merge branch 'main' into wuerstchen-v3

f6035c6

default TimestepBlock without conds

edbd76b

c_skip=0 by default

c5326fa

fix bfloat16 to cpu

228f98c

use config

b1e6db3

patrickvonplaten reviewed Jan 30, 2024

View reviewed changes

.gitignore Outdated Show resolved Hide resolved

patrickvonplaten reviewed Jan 30, 2024

View reviewed changes

src/diffusers/pipelines/wuerstchen3/modeling_wuerstchen3_prior.py Outdated Show resolved Hide resolved

patrickvonplaten reviewed Jan 30, 2024

View reviewed changes

src/diffusers/pipelines/wuerstchen3/modeling_wuerstchen3_diffnext.py Outdated Show resolved Hide resolved

patrickvonplaten reviewed Jan 30, 2024

View reviewed changes

kashif added 2 commits January 30, 2024 11:22

undo temp change

0fb4bf8

fix gen_c_embeddings args

834baba

DN6 added 2 commits March 6, 2024 13:43

update

ceedcc4

update

8dd88f0

DN6 merged commit 40aa47b into huggingface:main Mar 6, 2024
15 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Pipiline] Wuerstchen v3 aka Stable Cascasde pipeline #6487

[Pipiline] Wuerstchen v3 aka Stable Cascasde pipeline #6487

kashif commented Jan 8, 2024

kashif commented Jan 8, 2024

HuggingFaceDocBuilderDev commented Jan 16, 2024

patrickvonplaten left a comment

vladmandic commented Mar 6, 2024

aa956 commented Mar 6, 2024

kashif commented Mar 6, 2024

vladmandic commented Mar 6, 2024

sayakpaul commented Mar 6, 2024

vladmandic commented Mar 6, 2024

vladmandic commented Mar 6, 2024 •

edited

Loading

FurkanGozukara commented Mar 7, 2024 •

edited

Loading

vladmandic commented Mar 12, 2024

sayakpaul commented Mar 12, 2024

vladmandic commented Mar 12, 2024

sayakpaul commented Mar 12, 2024

vladmandic commented Mar 12, 2024

sayakpaul commented Mar 12, 2024

yiyixuxu commented Mar 12, 2024

vladmandic commented Mar 12, 2024 •

edited

Loading

yiyixuxu commented Mar 12, 2024 •

edited

Loading

yiyixuxu commented Mar 12, 2024

sayakpaul commented Mar 13, 2024

yiyixuxu commented Mar 13, 2024

bghira commented Mar 13, 2024 •

edited

Loading

sayakpaul commented Mar 13, 2024

bghira commented Mar 13, 2024

yiyixuxu commented Mar 13, 2024

sayakpaul commented Mar 13, 2024

bghira commented Mar 13, 2024

vladmandic commented Mar 13, 2024

[Pipiline] Wuerstchen v3 aka Stable Cascasde pipeline #6487

[Pipiline] Wuerstchen v3 aka Stable Cascasde pipeline #6487

Conversation

kashif commented Jan 8, 2024

What does this PR do?

kashif commented Jan 8, 2024

HuggingFaceDocBuilderDev commented Jan 16, 2024

patrickvonplaten left a comment

Choose a reason for hiding this comment

vladmandic commented Mar 6, 2024

aa956 commented Mar 6, 2024

kashif commented Mar 6, 2024

vladmandic commented Mar 6, 2024

sayakpaul commented Mar 6, 2024

vladmandic commented Mar 6, 2024

vladmandic commented Mar 6, 2024 • edited Loading

FurkanGozukara commented Mar 7, 2024 • edited Loading

vladmandic commented Mar 12, 2024

sayakpaul commented Mar 12, 2024

vladmandic commented Mar 12, 2024

sayakpaul commented Mar 12, 2024

vladmandic commented Mar 12, 2024

sayakpaul commented Mar 12, 2024

yiyixuxu commented Mar 12, 2024

vladmandic commented Mar 12, 2024 • edited Loading

yiyixuxu commented Mar 12, 2024 • edited Loading

yiyixuxu commented Mar 12, 2024

sayakpaul commented Mar 13, 2024

yiyixuxu commented Mar 13, 2024

bghira commented Mar 13, 2024 • edited Loading

sayakpaul commented Mar 13, 2024

bghira commented Mar 13, 2024

yiyixuxu commented Mar 13, 2024

sayakpaul commented Mar 13, 2024

bghira commented Mar 13, 2024

vladmandic commented Mar 13, 2024

vladmandic commented Mar 6, 2024 •

edited

Loading

FurkanGozukara commented Mar 7, 2024 •

edited

Loading

vladmandic commented Mar 12, 2024 •

edited

Loading

yiyixuxu commented Mar 12, 2024 •

edited

Loading

bghira commented Mar 13, 2024 •

edited

Loading