[Core] Harmonize single file ckpt model loading #6971

sayakpaul · 2024-02-14T03:57:41Z

What does this PR do?

Currently, we make use of set_module_tensor_to_device in the single file loading utilities to speed up model loading time. However, in src/models/modeling_utils.py, we already have a utility called load_model_dict_into_meta() that abstracts the boilerplate code.

diffusers/src/diffusers/models/modeling_utils.py

Line 133 in 0ca7b68

def load_model_dict_into_meta(

This PR updates the single file utilities to use load_model_dict_into_meta() as well.

I have run the single-file SLOW tests, but @DN6, feel free to do so on your end, too, just to be sure.

HuggingFaceDocBuilderDev · 2024-02-14T04:08:51Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

DN6 · 2024-02-14T05:05:59Z

src/diffusers/loaders/single_file_utils.py

+            for pat in vae._keys_to_ignore_on_load_unexpected:
+                unexpected_keys = [k for k in unexpected_keys if re.search(pat, k) is None]
+
+        if len(unexpected_keys) > 0:


Small nit. Since unexpected_keys is always of type list we can just use

if unexpected_keys: ...

Oh, I followed what's done in modeling_utils.py to be consistent.

DN6 · 2024-02-14T05:09:20Z

src/diffusers/loaders/single_file.py

@@ -48,6 +48,7 @@ def build_sub_model_components(
    load_safety_checker=False,
    model_type=None,
    image_size=None,
+    torch_dtype=None,


Nice catch!

DN6

LGTM 👍🏽

sayakpaul added 5 commits February 14, 2024 07:41

use load_model_into_meta in single file utils

4a012ba

propagate to autoencoder and controlnet.

badf58c

correct class name access behaviour.

7fe5c9a

remove torch_dtype from load_model_into_meta; seems unncessary

6735af6

remove incorrect kwarg

455b10a

sayakpaul requested a review from DN6 February 14, 2024 03:57

sayakpaul added the refactor label Feb 14, 2024

style to avoid extra unnecessary line breaks

d6de774

DN6 reviewed Feb 14, 2024

View reviewed changes

DN6 approved these changes Feb 14, 2024

View reviewed changes

sayakpaul merged commit 4343ce2 into main Feb 14, 2024
15 checks passed

sayakpaul deleted the single-file-ckpt-model-loading branch February 14, 2024 05:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Core] Harmonize single file ckpt model loading #6971

[Core] Harmonize single file ckpt model loading #6971

sayakpaul commented Feb 14, 2024

HuggingFaceDocBuilderDev commented Feb 14, 2024

DN6 Feb 14, 2024

sayakpaul Feb 14, 2024

DN6 Feb 14, 2024

DN6 left a comment

[Core] Harmonize single file ckpt model loading #6971

[Core] Harmonize single file ckpt model loading #6971

Conversation

sayakpaul commented Feb 14, 2024

What does this PR do?

HuggingFaceDocBuilderDev commented Feb 14, 2024

DN6 Feb 14, 2024

Choose a reason for hiding this comment

sayakpaul Feb 14, 2024

Choose a reason for hiding this comment

DN6 Feb 14, 2024

Choose a reason for hiding this comment

DN6 left a comment

Choose a reason for hiding this comment