FIX [`PEFT` / `Core`] Copy the state dict when passing it to `load_lora_weights` #7058

younesbelkada · 2024-02-22T08:50:04Z

What does this PR do?

As per title

There should be no reason to not copy the state dict of the lora layers if one passes a dict into load_lora_weights, therefore avoiding to sliently modifying the passed state_dict in-place. Added also a nice test with a state dict pushed under hf-internal-testing

cc @yiyixuxu @sayakpaul @pacman100

HuggingFaceDocBuilderDev · 2024-02-22T08:57:14Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

BenjaminBossan

In general, this LGTM, thanks for working on this.

I tried to track down the source of the mutation. If I'm not missing something, the culprits seem to be _maybe_map_sgm_blocks_to_diffusers and _convert_kohya_lora_to_diffusers here because they pop from the state_dict. I wonder if it wouldn't be better to create the shallow copy in these two functions. The advantage would be that if we call them from somewhere else, the state_dict is still not mutated. As is, we are only safe if we go via load_lora_weights. Right now, AFAICT, that's the only function that calls them, but this could change in the future.

sayakpaul

In general, this looks good to me. But I concur with @BenjaminBossan’s notes. Could we approach the PR along those lines? @yiyixuxu what are your thoughts?

sayakpaul · 2024-02-23T03:13:29Z

tests/lora/test_lora_layers_peft.py

@@ -1727,6 +1729,20 @@ def test_load_unload_load_kohya_lora(self):
        self.assertTrue(np.allclose(lora_images, lora_images_again, atol=1e-3))
        release_memory(pipe)

+    def test_empty_state_dict(self):


Thank you but can we maybe add this as a fast test with smaller ckpts?

I think having two slow tests is fine

sayakpaul · 2024-02-23T03:14:22Z

tests/lora/test_lora_layers_peft.py

+        lcm_lora = load_file(cached_file)
+
+        pipe.load_lora_weights(lcm_lora, adapter_name="lcm")
+        self.assertTrue(lcm_lora != {})


WDYT of making the test more rigid by comparing the length of the state dict instead?

@BenjaminBossan what are your thoughts?

So basically remember the size before passing it and then ensuring that it's the same after? I don't see why not.

Since load_file already gives you a dict, we could store the original state dict length with len(lcm_lora) and use the value for assertion.

IMO we can just make another test_load_unload_load test but with state dict, where we can call pipe.load_lora_weights(lcm_lora, adapter_name="lcm") twice and make sure it gives the same results -
that way we make sure we can re-use the state dict we passed

Added the test !

yiyixuxu

thank you!

yiyixuxu · 2024-02-24T04:04:06Z

tests/lora/test_lora_layers_peft.py

+        lcm_lora = load_file(cached_file)
+
+        pipe.load_lora_weights(lcm_lora, adapter_name="lcm")
+        self.assertTrue(lcm_lora != {})


IMO we can just make another test_load_unload_load test but with state dict, where we can call pipe.load_lora_weights(lcm_lora, adapter_name="lcm") twice and make sure it gives the same results -
that way we make sure we can re-use the state dict we passed

younesbelkada · 2024-02-27T01:40:55Z

Thanks everyone for the review! merging for now ! if we see any other similar issue in the future I'd be happy to refactor that a bit as suggested by Benjamin

copy the state dict in load lora weights

aedc8bb

younesbelkada requested review from sayakpaul, yiyixuxu and pacman100 February 22, 2024 08:50

younesbelkada requested a review from BenjaminBossan February 22, 2024 12:46

BenjaminBossan approved these changes Feb 22, 2024

View reviewed changes

sayakpaul reviewed Feb 23, 2024

View reviewed changes

yiyixuxu approved these changes Feb 24, 2024

View reviewed changes

fixup

50dcb38

younesbelkada merged commit 8a69273 into huggingface:main Feb 27, 2024
13 checks passed

younesbelkada deleted the fix-peft-state-dict-issue branch February 27, 2024 01:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FIX [`PEFT` / `Core`] Copy the state dict when passing it to `load_lora_weights` #7058

FIX [`PEFT` / `Core`] Copy the state dict when passing it to `load_lora_weights` #7058

younesbelkada commented Feb 22, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Feb 22, 2024

BenjaminBossan left a comment

sayakpaul left a comment

sayakpaul Feb 23, 2024

younesbelkada Feb 27, 2024

sayakpaul Feb 23, 2024

BenjaminBossan Feb 23, 2024

sayakpaul Feb 23, 2024

yiyixuxu Feb 24, 2024

younesbelkada Feb 27, 2024

yiyixuxu left a comment

yiyixuxu Feb 24, 2024

younesbelkada commented Feb 27, 2024

FIX [PEFT / Core] Copy the state dict when passing it to load_lora_weights #7058

FIX [PEFT / Core] Copy the state dict when passing it to load_lora_weights #7058

Conversation

younesbelkada commented Feb 22, 2024 • edited Loading

What does this PR do?

HuggingFaceDocBuilderDev commented Feb 22, 2024

BenjaminBossan left a comment

Choose a reason for hiding this comment

sayakpaul left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

yiyixuxu left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

younesbelkada commented Feb 27, 2024

FIX [`PEFT` / `Core`] Copy the state dict when passing it to `load_lora_weights` #7058

FIX [`PEFT` / `Core`] Copy the state dict when passing it to `load_lora_weights` #7058

younesbelkada commented Feb 22, 2024 •

edited

Loading