-
Notifications
You must be signed in to change notification settings - Fork 5.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Tracker] use micro-conditioning for the SDXL trainers #6586
Comments
@jiqing-feng, would you like to take Textual inversion SDXL? @linoytsaban would you like to take Advanced SDXL trainer? |
Hey @sayakpaul what's the min requirement for GPU while testing for the training? |
DreamBooth SDXL LoRA should run in Colab. Refer to the associated readme for more details. |
sure! |
Sorry for the delay, I have been quite busy recently; I will do it once I have time. |
Can it run locally? |
You need to repurpose the code accordingly for that. |
Hello @sayakpaul, I would love to work on ControlNet SDXL. Thank you. |
Please go ahead. |
Hi @sayakpaul . I think the micro conditioning is already in the example, see conditioning_image_size and conditioning_crop_size, and they are all in the add_time_ids. BTW, I found a minor error about the resize function, and I already fixed it in #7095 |
You are right. Thanks for the quick fix. |
This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread. Please note that issues that do not follow the contributing guidelines are likely to be ignored. |
SDXL makes use of micro-conditioning, and it does have quite a bit of an effect on the end results. For more details, refer to the paper here.
Currently, not all of our SDXL trainers don't make use of micro-conditioning. So, it'd be nice to have micro-conditioning support as in https://github.com/huggingface/diffusers/blob/main/examples/text_to_image/train_text_to_image_lora_sdxl.py.
Below is a list of the training scripts where we'd like to have this change incorporated:
Feel free to open PRs targeting only ONE example at a time and tag me. Please also share an example training command while submitting the PRs. The command doesn't have to run the training for a large number of steps. Anything in the range of [4, 10] should suffice.
The text was updated successfully, but these errors were encountered: