[WIP] Generation backend rewrite #6577

StAlKeR7779 · 2024-07-03T23:29:48Z

Summary

Rewriting of generation backend. At this moment backend rewritten to new architecture, but from user perspective nothing should change as backend converts arguments silently. Later in major update we can apply changes further.
Recreated after conversation with @dunkeroni, from #6548 and his ideas.

What here done:
Simplified generation logic to most basic steps and added injection points(modifiers/overrides) to which users can attach and change generation logic.
Logic moved from backend to extensions:

Generation latents preview
Rescale CFG
Handling inpainting/inpaint models
Guidance models - ControlNet, T2I Adapter, IP Adapter

Related Issues / Discussions

None

QA Instructions

Currently old generation code not removed, so you can patch if in latents_from_embeddings and run old logic to check/compare.

Merge Plan

Add tiled generation support.
Remove old backend code, which currently remains for testing.

Checklist

The PR has a short but descriptive title, suitable for a changelog
Tests added / updated (if applicable)
Documentation added / updated (if applicable)

@dunkeroni @RyanJDick

RyanJDick · 2024-07-04T22:11:53Z

I took an initial read through this today. Before I leave any comments on the code, I think some more design discussion is warranted. I'll start the conversation here - but a new feature proposal ticket might be a better place for this.

First, let's clarify the goals with this PR. To me:

Separation of code by feature for improved readability.
Make it easier to add new features.
Provide an interface for community node contributors to import and extend the diffusion process.

Is this list accurate? How important is 3.? We'd go about his differently if we drop 3 as a goal.

Next, here are my top concerns with the current direction:

It is hard to anticipate how new features will want to modify the diffusion process. I fear that we will have to keep adding new injection points to support future features. If this happens, the abstraction of injection points will become a hindrance rather than a benefit.
Extensions may interfere with one another. It may become more difficult to handle interactions between extensions.
If we are aiming for Goal 3. then we need a clearer proposal for how this is going to work. Currently, diffusers_pipeline.py is not part of the public API, and it probably shouldn't be.

Let me know how you are thinking about these things. I'd like to get alignment in some form of design document before we get into the code on this one.

I'm not trying to hold up this effort. I just want to make sure that we've considered it thoroughly. If we're not careful, I think there is a real risk that we add a bunch of complexity and don't achieve the positive outcomes that we are aiming for.

dunkeroni · 2024-07-05T22:37:01Z

I would argue that 3 is vital, and not significantly more effort than 2. Invoke's position is already to not provide official support for everything under the sun, and requiring that node authors either make a clone of diffusers_pipeline.py or go use ComfyUI instead is a non-starter. Those are the only other options for the sort of features that this enables. 3 is also unavoidable if we want to make new features easier to add for official functions. There will need to be some generalized extension input on the node if we want to have support for ControlNet, and Controlllite, and ICLight, and the rest. Otherwise get ready for a Denoise Latents node with three dozen input connectors a few years from now.

This also isn't a new or uncharted idea. This PR is a (more technically advanced) version of the Modular Denoise Latents nodes that I have been working on and experimenting with for 8 months now. Those nodes were created because it was becoming infuriatingly complex to implement new research papers in Invoke while comparing or combining their effects. The architecture has been reworked a few times already to facilitate my many (often failed) experiments. As an example, our current compositing implementation in Canvas exists because of tests in that node pack.

As for interactions between extensions; we do not need to guarantee that all extensions are cross compatible. We already don't guarantee that all nodes work with each other aside from type restrictions for what gets handed between them. The only hard restriction is that extensions which override and change the structure of the pipeline (e.g. tiled generation) cannot run with other extensions that attempt to override the same point. Most extensions only modify the data between process steps, and very few of them would actually break any others.

…ass from compel nodes

StAlKeR7779 · 2024-07-08T12:40:58Z

From my perspective 3 is not main goal in this PR, I mostly thinks about simplification and further edit.
But still I agree with @dunkeroni about sharing this api to users, but I suggest it do in this steps:

In this PR we assume that it's not shared, as we not provide for users extensions input on node
When we do this big change and provide extensions input, we can say that api still unstable/beta and still can be changed
Later, after looking at more extensions(ic-light, cnet++, ...) we can say with more confidence if this still unstable api

While I writing it I saw only 2 moment in api which problematic/can change:

What if we want patch unet on cpu? For example to trade speed for memory
What if extension changes attention process, like IP-Adapter? We can't abstract attention processor with confidence as have only one such extension

Other than this I feel that API should be already stable and in further injection points will be only added, not changed.

RyanJDick · 2024-07-08T21:38:48Z

I'm excited about how this is progressing 🚀

I took a stab at working backwards from this draft PR to a design document so that we can align on some of the important decisions being made here: https://www.notion.so/invokeai/Modular-Stable-Diffusion-Backend-Design-Document-e8952daab5d5472faecdc4a72d377b0d

DM me your email on Discord (ryanjdick) and I'll share the document with you.

@StAlKeR7779 @dunkeroni I'm hoping that you two can review that doc and help with filling in some of the incomplete/unresolved sections.

We have to treat this PR with some extra care, given its scale and scope, but I think the effort will be worth it.

## Summary Base code of new modular backend from #6577. Contains normal generation and regional prompts support. Also preview extension included to test if extensions logic works. ## Related Issues / Discussions https://invokeai.notion.site/Modular-Stable-Diffusion-Backend-Design-Document-e8952daab5d5472faecdc4a72d377b0d ## QA Instructions Run with and without set `USE_MODULAR_DENOISE` environment. Currently only normal and regional conditionings supported, so just generate some images and compare with main output. ## Merge Plan Discuss a bit more about injection point names? As if for example in future unet will be overridable, current `pre_unet`/`post_unet` assumes to name override as `unet` what feels a bit odd. Also `apply_cfg` - future implementation could ignore/not use cfg, so in this case `combine_noise_predictions`/`combine_noise` seems more suitable. ## Checklist - [x] _The PR has a short but descriptive title, suitable for a changelog_ - [x] _Tests added / updated (if applicable)_ - [ ] _Documentation added / updated (if applicable)_

## Summary Rescale CFG code from #6577. ## Related Issues / Discussions #6606 https://invokeai.notion.site/Modular-Stable-Diffusion-Backend-Design-Document-e8952daab5d5472faecdc4a72d377b0d ## QA Instructions Run with and without set `USE_MODULAR_DENOISE` environment. ~~Note: for some reasons there slightly different output from run to run, but I able sometimes to get same output on main and this branch.~~ Fix presented in #6641. ## Merge Plan ~~Nope.~~ Merge #6641 firstly, to be able see output difference properly. If you think that there should be some kind of tests - feel free to add. ## Checklist - [x] _The PR has a short but descriptive title, suitable for a changelog_ - [ ] _Tests added / updated (if applicable)_ - [ ] _Documentation added / updated (if applicable)_

## Summary FreeU code from #6577. Also fix issue with sometimes slightly different output. ## Related Issues / Discussions #6606 https://invokeai.notion.site/Modular-Stable-Diffusion-Backend-Design-Document-e8952daab5d5472faecdc4a72d377b0d ## QA Instructions Run with and without set `USE_MODULAR_DENOISE` environment. ## Merge Plan Nope. If you think that there should be some kind of tests - feel free to add. ## Checklist - [x] _The PR has a short but descriptive title, suitable for a changelog_ - [ ] _Tests added / updated (if applicable)_ - [ ] _Documentation added / updated (if applicable)_

## Summary ControlNet code from #6577. ## Related Issues / Discussions #6606 https://invokeai.notion.site/Modular-Stable-Diffusion-Backend-Design-Document-e8952daab5d5472faecdc4a72d377b0d ## QA Instructions Run with and without set `USE_MODULAR_DENOISE` environment. ## Merge Plan Merge #6641 firstly, to be able see output difference properly. If you think that there should be some kind of tests - feel free to add. ## Checklist - [x] _The PR has a short but descriptive title, suitable for a changelog_ - [ ] _Tests added / updated (if applicable)_ - [ ] _Documentation added / updated (if applicable)_

## Summary Seamless code from #6577. ## Related Issues / Discussions #6606 https://invokeai.notion.site/Modular-Stable-Diffusion-Backend-Design-Document-e8952daab5d5472faecdc4a72d377b0d ## QA Instructions Run with and without set `USE_MODULAR_DENOISE` environment. ## Merge Plan Nope. If you think that there should be some kind of tests - feel free to add. ## Checklist - [x] _The PR has a short but descriptive title, suitable for a changelog_ - [ ] _Tests added / updated (if applicable)_ - [ ] _Documentation added / updated (if applicable)_

## Summary T2I Adapter code from #6577. ## Related Issues / Discussions #6606 https://invokeai.notion.site/Modular-Stable-Diffusion-Backend-Design-Document-e8952daab5d5472faecdc4a72d377b0d ## QA Instructions Run with and without set `USE_MODULAR_DENOISE` environment. ## Merge Plan Nope. If you think that there should be some kind of tests - feel free to add. ## Checklist - [x] _The PR has a short but descriptive title, suitable for a changelog_ - [ ] _Tests added / updated (if applicable)_ - [ ] _Documentation added / updated (if applicable)_

## Summary Code for inpainting and inpaint models handling from #6577. Separated in 2 extensions as discussed briefly before, so wait for discussion about such implementation. ## Related Issues / Discussions #6606 https://invokeai.notion.site/Modular-Stable-Diffusion-Backend-Design-Document-e8952daab5d5472faecdc4a72d377b0d ## QA Instructions Run with and without set `USE_MODULAR_DENOISE` environment. Try and compare outputs between backends in cases: - Normal generation on inpaint model - Inpainting on inpaint model - Inpainting on normal model ## Merge Plan Nope. If you think that there should be some kind of tests - feel free to add. ## Checklist - [x] _The PR has a short but descriptive title, suitable for a changelog_ - [ ] _Tests added / updated (if applicable)_ - [ ] _Documentation added / updated (if applicable)_

## Summary Code for lora patching from #6577. Additionally made it the way, that lora can patch not only `weight`, but also `bias`, because saw some loras which doing it. ## Related Issues / Discussions #6606 https://invokeai.notion.site/Modular-Stable-Diffusion-Backend-Design-Document-e8952daab5d5472faecdc4a72d377b0d ## QA Instructions Run with and without set `USE_MODULAR_DENOISE` environment. ## Merge Plan Replace old lora patcher with new after review done. If you think that there should be some kind of tests - feel free to add. ## Checklist - [x] _The PR has a short but descriptive title, suitable for a changelog_ - [ ] _Tests added / updated (if applicable)_ - [ ] _Documentation added / updated (if applicable)_

StAlKeR7779 added 8 commits July 4, 2024 01:43

New backend base

8298110

Restore generation preview

9324b18

Restore rescale cfg

dccb9f1

Restore inpaint support

f545e60

Restore regional prompts

44f4f25

Restore t2i adapter

d71896f

Restore controlnet

118b54b

Restore ip adapters

e013899

github-actions bot added python PRs that change python files backend PRs that change backend files labels Jul 3, 2024

Change modifier/override handler args handling

4aae90d

StAlKeR7779 added 2 commits July 6, 2024 01:58

Implement tiled denoise test version

c82dba0

Move new backend logic to node

f5f6dea

github-actions bot added the invocations PRs that change invocations label Jul 6, 2024

StAlKeR7779 added 13 commits July 7, 2024 15:33

Update logic in denoise node(try)

9970af2

A bit optimize ip adapter loading

8cfb712

Optmize extensions patching methods

b9434e0

Add seamless support

61529f5

Add FreeU support

0e8b434

Redo lora patcher as extension class

7e465e5

Merge lora patcher to extension class, call lora patcher extension cl…

2c64974

…ass from compel nodes

Merge seamless patcher to extension class

d3b1b2f

Rewrite tiled denoise node

dc58274

Add t2i and ip adapter to tiled generation

d69ec3a

Merge branch 'main' into stalker7779/gen-backend-rewrite3

d42f257

Clean up code and imports, remove old backend code

bd4de46

hotfix inpaint gradient mask

d4d5684

Ruff format/fixes

b7c91a2

StAlKeR7779 mentioned this pull request Jul 12, 2024

Base of modular backend #6606

Merged

3 tasks

This was referenced Jul 21, 2024

Modular backend - add rescale cfg #6640

Merged

Modular backend - add FreeU #6641

Merged

Modular backend - add ControlNet #6642

Merged

Modular backend - inpaint #6643

Merged

This was referenced Jul 23, 2024

Modular backend - Seamless #6651

Merged

Modular backend - T2I Adapter #6662

Merged

Modular backend - LoRA/LyCORIS #6667

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Generation backend rewrite #6577

[WIP] Generation backend rewrite #6577

StAlKeR7779 commented Jul 3, 2024

RyanJDick commented Jul 4, 2024

dunkeroni commented Jul 5, 2024

StAlKeR7779 commented Jul 8, 2024 •

edited

Loading

RyanJDick commented Jul 8, 2024

[WIP] Generation backend rewrite #6577

Are you sure you want to change the base?

[WIP] Generation backend rewrite #6577

Conversation

StAlKeR7779 commented Jul 3, 2024

Summary

Related Issues / Discussions

QA Instructions

Merge Plan

Checklist

RyanJDick commented Jul 4, 2024

dunkeroni commented Jul 5, 2024

StAlKeR7779 commented Jul 8, 2024 • edited Loading

RyanJDick commented Jul 8, 2024

StAlKeR7779 commented Jul 8, 2024 •

edited

Loading