(Updated 02/17) New research: ControlNet - Adding Conditional Control to Text-to-Image Diffusion Models #7732

catboxanon · 2023-02-11T06:26:51Z

catboxanon
Feb 11, 2023
Collaborator

ControlNet - Adding Conditional Control to Text-to-Image Diffusion Models

Paper: https://arxiv.org/abs/2302.05543
GitHub: https://github.com/lllyasviel/ControlNet

From the abstract, this provides a way to train for task-specific conditions with even small datasets (<50k) and as fast as fine-tuning diffusion models locally (one of the examples presented shows a depth model that only took a week of training on a single 3090TI).

Several pre-trained models are provided, and training code is included in the GitHub repo to make your own. Highly recommend taking a look at the paper to understand what it's capable of.

Update
This is now supported in the webui via the extensions below (independent of each other):
https://github.com/Mikubill/sd-webui-controlnet
https://github.com/ThereforeGames/unprompted

A kind user also shared a Blender project to create openpose-like images, so you can skip the preprocessor step for the openpose controlnet. https://toyxyz.gumroad.com/l/ciojz

Update 2
T2I-Adapter integration has now been merged with the ControlNet extension. Mikubill/sd-webui-controlnet#140

Related, Style2Paints V5 preview (using a currently unreleased sketch-to-illustration model, based on the same research): https://github.com/lllyasviel/style2paints/tree/master/V5_preview

ExponentialML · 2023-02-11T07:51:40Z

ExponentialML
Feb 11, 2023

This needs to be given more attention as I personally believe this can supercharge the webui. Here are some more examples of what a lot of people may be interested in as well. Also, the depth to image model apparently better than SD 2.0's as the mask part is trained at a higher resolution.

0 replies

toyxyz · 2023-02-11T10:08:43Z

toyxyz
Feb 11, 2023

I tested it and it's amazing! Each tool is very powerful and produces results that are faithful to the input image and pose. In particular, pose2image was able to capture poses much better and create accurate images compared to depth models.

0 replies

Fictiverse · 2023-02-11T14:34:10Z

Fictiverse
Feb 11, 2023

Amazing

0 replies

gsgoldma · 2023-02-11T20:53:26Z

gsgoldma
Feb 11, 2023

i tried to test it, but got OOM error, I guess 6 gigs isn't enough vram, maybe if its an extension, it'll be optimized by auto's optimizations?

3 replies

toyxyz Feb 12, 2023

It definitely consumes a lot of Vram. 16gb or more is required. Optimization seems necessary.

toyxyz Feb 12, 2023

Low VRAM Mode has been updated.
https://github.com/lllyasviel/ControlNet/blob/main/docs/low_vram.md

gsgoldma Feb 12, 2023

Low VRAM Mode has been updated. https://github.com/lllyasviel/ControlNet/blob/main/docs/low_vram.md

unfortunately, it's not enough for 6 gigs yet. tried and got an OOM again

camenduru · 2023-02-12T10:21:35Z

camenduru
Feb 12, 2023
Collaborator

https://github.com/camenduru/controlnet-colab

0 replies

musicurgy · 2023-02-12T17:27:11Z

musicurgy
Feb 12, 2023

looks great

0 replies

toyxyz · 2023-02-12T17:48:13Z

toyxyz
Feb 12, 2023

And merging with other models is also possible!
lllyasviel/ControlNet#12

0 replies

Gushousekai195 · 2023-02-13T03:19:23Z

Gushousekai195
Feb 13, 2023

I need this, popular or not!

1 reply

ThereforeGames Feb 13, 2023

Got it working in the WebUI. Will post another thread about it momentarily.

Mikubill · 2023-02-13T10:48:49Z

Mikubill
Feb 13, 2023

Just PoC, I wrote a simple extension that takes ControlNet into WebUI. Not all features are supported, but at least it works.
https://github.com/Mikubill/sd-webui-controlnet

0 replies

camenduru · 2023-02-14T08:17:28Z

camenduru
Feb 14, 2023
Collaborator

thanks to @Mikubill ❤ now openpose ControlNet with colab 🥳 I will add others

https://github.com/camenduru/stable-diffusion-webui-colab/blob/v2.0/control_net_openpose_webui_colab.ipynb

0 replies

toyxyz · 2023-02-14T08:49:43Z

toyxyz
Feb 14, 2023

Pose2Image works just by putting in an image that looks similar to open pose. If I set the preprocessor to None and use a reasonably similar looking image created in blender, it works fine. I used Mikubill's extension.

7 replies

catboxanon Feb 14, 2023
Collaborator Author

@toyxyz This may be of interest to you if you haven't seen it, the openpose model can support hands. Maybe it's worth adding to the rig? Mikubill/sd-webui-controlnet#25

Did you prompt the camera angle in that 4th example?

Yes. Adding the camera angle to the prompt helped the image creation a bit more.

FurkanGozukara · 2023-02-14T17:09:55Z

FurkanGozukara
Feb 14, 2023

this thing is just revolutionary

just released my tutorial for controlnet based on an extension

16.) Automatic1111 Web UI
Sketches into Epic Art with 1 Click: A Guide to Stable Diffusion ControlNet in Automatic1111 Web UI

0 replies

catboxanon · 2023-02-17T14:32:45Z

catboxanon
Feb 17, 2023
Collaborator Author

Two similar papers have now come out shortly after ControlNet did.

https://github.com/arpitbansal297/Universal-Guided-Diffusion (https://arxiv.org/abs/2302.07121)
https://github.com/TencentARC/T2I-Adapter (https://arxiv.org/abs/2302.08453)

Edit: T2I-Adapter integration has now been merged with the ControlNet extension. Mikubill/sd-webui-controlnet#140

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

(Updated 02/17) New research: ControlNet - Adding Conditional Control to Text-to-Image Diffusion Models #7732

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 13 comments 11 replies

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

(Updated 02/17) New research: ControlNet - Adding Conditional Control to Text-to-Image Diffusion Models #7732

catboxanon Feb 11, 2023 Collaborator

Replies: 13 comments · 11 replies

camenduru Feb 12, 2023 Collaborator

camenduru Feb 14, 2023 Collaborator

catboxanon Feb 14, 2023 Collaborator Author

catboxanon Feb 17, 2023 Collaborator Author

catboxanon
Feb 11, 2023
Collaborator

Replies: 13 comments 11 replies

camenduru
Feb 12, 2023
Collaborator

camenduru
Feb 14, 2023
Collaborator

catboxanon Feb 14, 2023
Collaborator Author

catboxanon
Feb 17, 2023
Collaborator Author