Auto mask uniform background base on PR #589 mask loss #1114

gesen2egee · 2024-02-11T11:45:34Z

(1) Change the mechanism for caching_latent_to_disk by directly storing the mask in the NPZ file. Actually, due to trim_and_resize, it was impossible to match the required size, made cache_latent_to_disk unusable.

(2) Modify the way of color augmentation to preserve the original alpha channel unchanged, which should allow for more accurate handling of the mask.

(3) Add --mask_simple_background. Enable auto-masking of latent loss based on the dominant edge color if it occupies more than 30% of the image edges. This helps in focusing the model on the main content by ignoring simple or uniform background colors such as solid white or black.

I think this will help improve the quality of datasets with a high proportion of white backgrounds, transparent backgrounds, and simple backgrounds in anime images.

This is a simpler feature implementation. It might be possible to add automatic masking for faces (for clothing training), characters, etc. However, because automatically processed masks are harder to inspect and would introduce a significant amount of additional requirements, it might be better to use the script I wrote or the webui's rembg to pre-generate masks for manual inspection.

implement mask loading from mask folder

# Conflicts: # fine_tune.py # sdxl_train.py # train_db.py # train_network.py # train_textual_inversion.py # train_textual_inversion_XTI.py

# Conflicts: # fine_tune.py # sdxl_train.py

kohya-ss · 2024-02-12T12:50:52Z

Thank you for this PR.

As I wrote before in #589 , I've implemented ControlNetDataset which has conditioning data recently. I believe the ControlNetDataset can have not only canny or pose control images but also mask images.

I think it might be an idea to add a capability to handle ControlNetDataset for each training script. However, it will take some time to implement.

So, please let me carefully consider how to handle this PR. I hope you will understand.

I also think --mask_simple_background is simple but interesting idea. It seems to be promising.

zapp and others added 16 commits October 9, 2023 13:08

implement masked loss for LoRA, Textual Inversion, Dreambooth & others

7de0550

implement mask loading from mask folder

add back mask rescale by mean

a34222a

fix mask loading when cache_latents=False or cache_latents_to_disk=True

b0bff65

Merge branch 'main' into masked-loss-rebase

610f8e0

Merge remote-tracking branch 'origin/dev' into masked-loss-rebase

45e64b1

# Conflicts: # fine_tune.py # sdxl_train.py # train_db.py # train_network.py # train_textual_inversion.py # train_textual_inversion_XTI.py

fix bugs

b2adcbb

Merge branch 'dev' into masked-loss-rebase

5680057

# Conflicts: # fine_tune.py # sdxl_train.py

Change transparent background to white background

94f55ed

Merge branch 'pr/589' into auto_mask

5f6c5ff

use mask loss PR kohya-ss#589 and save mask to npz

041bd93

don't change alpha channels in color_aug

9e1026d

Update train_util.py

7a79d6c

Merge branch 'RGBA-background' into auto_mask

f56674b

mask_simple_background

27e411f

change function name

6e07cae

Update train_util.py

ffee40b

gesen2egee closed this Feb 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Auto mask uniform background base on PR #589 mask loss #1114

Auto mask uniform background base on PR #589 mask loss #1114

gesen2egee commented Feb 11, 2024 •

edited

Loading

kohya-ss commented Feb 12, 2024

Auto mask uniform background base on PR #589 mask loss #1114

Auto mask uniform background base on PR #589 mask loss #1114

Conversation

gesen2egee commented Feb 11, 2024 • edited Loading

kohya-ss commented Feb 12, 2024

gesen2egee commented Feb 11, 2024 •

edited

Loading