sd-interim-bayesian-merger

Even more opinionated fork of the original repo by s1dlx currently undergoing a transformation hence the name

Since he became absent, I started updating the project with things that were missing at the time, which eventually built into getting my own ideas, which I'll be gradually introducing into this fork.

Don't expect it to always be in a working state, but I generally try to only push when it is. Might still miss stuff tho

everything was done with Gemini and some advice/ideas from ljleb

Stuff that's new:

entirely different merge backend mecha (everyone say thank you to ljleb for boosting the merge game)
pick what components to optimize
group/select components/blocks in various ways, customize their bounds
new juicer scorer
both a1111 and forge support (comfy and swarm eventually)
ability to skip during manual scoring

Planned:

more (and better) visualizations
~~ability to define custom bounds/behavior to a hyper of choice (optimizing false/true behaving hypers)~~
switching between manual and automatic scoring with hotkeys
more hotkeys for more behavior like early stopping and other qol
adjusting batch size and payload selection during optimization
scoring rethinking, categories, character objective, perceptual similarity metrics(lpips)
probably trying optuna first, then hyperactive sometime later
more that I can't remember or will randomly come up with

wip text

What is this?

An opinionated take on stable-diffusion models-merging automatic-optimisation.

The main idea is to treat models-merging procedure as a black-box model with 26 parameters: one for each block plus base_alpha. We can then try to apply black-box optimisation techniques, in particular we focus on Bayesian optimisation with a Gaussian Process emulator. Read more here, here and here.

The optimisation process is split in two phases:

exploration: here we sample (at random for now, with some heuristic in the future) the 26-parameter hyperspace, our block-weights. The number of samples is set by the --init_points argument. We use each set of weights to merge the two models we use the merged model to generate batch_size * number of payloads images which are then scored.
exploitation: based on the exploratory phase, the optimiser makes an idea of where (i.e. which set of weights) the optimal merge is. This information is used to sample more set of weights --n_iters number of times. This time we don't sample all of them in one go. Instead, we sample once, merge the models, generate and score the images and update the optimiser knowledge about the merging space. This way the optimiser can adapt the strategy step-by-step.

At the end of the exploitation phase, the set of weights scoring the highest score are deemed to be the optimal ones.

OK, How Do I Use It In Practice?

Head to the wiki for all the instructions to get you started.

Name		Name	Last commit message	Last commit date
Latest commit History 335 Commits
.github		.github
.idea		.idea
conf		conf
logs		logs
models		models
scripts		scripts
sd_interim_bayesian_merger		sd_interim_bayesian_merger
tests		tests
wildcards		wildcards
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
bayesian_merger.py		bayesian_merger.py
install.py		install.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

sd-interim-bayesian-merger

everything was done with Gemini and some advice/ideas from ljleb

Stuff that's new:

Planned:

What is this?

OK, How Do I Use It In Practice?

With the help of

About

Releases

Packages

Languages

License

Enferlain/sd-interim-bayesian-merger

Folders and files

Latest commit

History

Repository files navigation

sd-interim-bayesian-merger

everything was done with Gemini and some advice/ideas from ljleb

Stuff that's new:

Planned:

What is this?

OK, How Do I Use It In Practice?

With the help of

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages