Preliminary confirmation: Image gen now runs on 4 GB cards #486

lstein · 2022-09-10T14:11:05Z

I just merged in a patch from @mh-dm which appears to dramatically reduce the memory requirements for loading the model. Instead of using 4.3 GB in my benchmarks, the patched version requires just 2.17 G. Along with the image generation optimization, this makes me hopeful that the system will now run on 4 GB cards.

If anyone would like to test this, please check out the "development" branch.

willcohen · 2022-09-10T23:22:47Z

Confirmed! development branch works on a Windows machine with a 4GB card! Prompts using dream.py generate images successfully!

willcohen · 2022-09-10T23:57:03Z

@lstein i think this can be closed -- this is absolutely fantastic. I'm getting something like ~2 mins for a 50-step basic txt2img prompt via dream with default settings, and ~9 minutes for the same with 250 steps on a Quadro P1000, which is certainly no speed demon of a card. I'd call this an excellent result considering that a lot more machines just got access to all of this functionality!

lstein · 2022-09-11T11:18:03Z

This is fantastic news. Thanks for the confirmations!

Kolaer · 2022-09-14T17:40:50Z

I can confirm that image generation works on 4 GB cards:

However, to run it I had to remove the memory check in attention.py
https://github.com/lstein/stable-diffusion/blob/89da371f4841f7e05da5a1672459d700c3920784/ldm/modules/attention.py#L248

It ran fine without it, so I wonder if the check is incorrect/incomplete.

Still amazed that it's now possible to generate high-quality 512x512 images on a laptop's GPU.

Vargol · 2022-09-15T08:22:36Z

I had noticed that for me the model now takes 30% (60 seconds compared to 40 seconds) longer to load was to busy with testing the VRAM fixes for the image generation notice it until recently , @lstein can you point me to the patch for this so I can have a look sometime ?

lstein mentioned this issue Sep 10, 2022

MPS support for doggettx-optimizations #431

Closed

lstein changed the title ~~To Test: model loading optimization may allow model to be loaded on 4 GB cards~~ Preliminary confirmation: Image gen now runs on 4 GB cards Sep 11, 2022

mh-dm mentioned this issue Sep 14, 2022

Keep VRAM usage and faster slicing consistent in attention.py #582

Merged

lstein closed this as completed Sep 17, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Preliminary confirmation: Image gen now runs on 4 GB cards #486

Preliminary confirmation: Image gen now runs on 4 GB cards #486

lstein commented Sep 10, 2022

willcohen commented Sep 10, 2022

willcohen commented Sep 10, 2022 •

edited

Loading

lstein commented Sep 11, 2022

Kolaer commented Sep 14, 2022

Vargol commented Sep 15, 2022

Preliminary confirmation: Image gen now runs on 4 GB cards #486

Preliminary confirmation: Image gen now runs on 4 GB cards #486

Comments

lstein commented Sep 10, 2022

willcohen commented Sep 10, 2022

willcohen commented Sep 10, 2022 • edited Loading

lstein commented Sep 11, 2022

Kolaer commented Sep 14, 2022

Vargol commented Sep 15, 2022

willcohen commented Sep 10, 2022 •

edited

Loading