Releases · invoke-ai/InvokeAI

24 Jul 23:15

psychedelicious

v4.2.7rc1

3657285

v4.2.7rc1 Pre-release

Pre-release

v4.2.7rc1 includes gallery improvements and some major features focused on upscaling.

Upscaling

We've added a dedicated upscaling tab, support for custom upscaling models, and some new nodes.

Thanks to @RyanJDick (backend implementation), @chainchompa (frontend) and @maryhipp (frontend) for working on this!

Dedicated Upscaling Tab

The new upscaling tab provides a simple and powerful UI to Invoke's MultiDiffusion implementation. This builds on the workflow released in v4.2.6, allowing for memory-efficient upscaling to huge output image sizes.

Upscaling.Tab.mov

We're pretty happy with the results!

4x scale, 4x_NMKD-Siax_200k upscale model, Deliberate_v5 SD1.5 model, KDPM 2 scheduler @ 30 steps, all other settings default

Requirements

You need 3 models installed to use this feature:

An upscale model for the first pass upscale
A main SD model (SD1.5 or SDXL) for the image-to-image
A tile ControlNet model of the same model architecture as your main SD model

If you are missing any of these, you'll see a warning directing you to the model manager to install them. You can search the starter models for upscale, main, and tile to get you started.

Tips

The main SD model architecture has the biggest impact on VRAM usage. For example, SD1.5 @ 2k needs just under 4GB, while SDXL @ 2k needs just under 9GB. VRAM usage increases a small amount as output size increases - SD1.5 @ 8k needs ~4.5GB while SDXL @ 8k needs ~10.5GB.
The upscale and main SD model choices matter. Choose models best suited to your input image or desired output characteristics.
Some schedulers work better than others. KDPM 2 is a good choice.
LoRAs - like a detail-adding LoRA - can make a big impact.
Higher Creativity values give the SD model more leeway in creating new details. This parameter controls denoising start and end percentages.
Higher Structure values tell the SD model to stick closer to the input image's structure. This parameter controls the tile ControlNet.

Custom Upscaling Models

You can now install and use custom upscaling models in Invoke. The excellent spandrel library handles loading and running the models.

Custom.Upscaling.Models.mov

spandrel can do a lot more than upscaling - it supports a wide range of "image to image" models. This includes single-image super resolution like ESRGAN (upscalers) but also things like GFPGAN (face restoration) and DeJPEG (cleans up JPEG compression artifacts).

A complete list of supported architectures can be found here.

Note: We have not enabled the restrictively-licensed architectures, which are denoted with a + symbol in the list.

Installing Models

We've added a few popular upscaling models to the Starter Models tab in the Model Manager - search for "upscale" to find them.

You can install models found online via the Model Manager, just like any other model. OpenModelDB is a popular place to get these models. For most of them, you can copy the model's download link and paste in into the Model Manager to install.

Nodes

Two nodes have been added to support processing images with spandrel - be that upscaling or any of the other tasks these models support.

Image-to-Image - Runs the selected model without any extra processing.
Image-to-Image (Autoscale) - Runs the selected model repeatedly until the desired scale is reached. This node is intended for upscaling models specifically, providing some useful extra functionality:
- If the model overshoots the target scale, the final image will be downscaled to the target scale with Lanczos resampling.
- As a convenience, the output image width and height can be fit to a multiple of 8, as is required for SD. This will only resize down, and may change the aspect ratio slightly.
- If the model doesn't actually upscale the image, the scale parameter will be ignored.

Gallery Improvements

Thanks to @maryhipp and @chainchompa for continued iteration on the gallery!

Cleaner boards UI.
Improved boards and image search UI.
Fixed issues where board counts don't update when images are moved between boards.

Other Changes

Enhancement: When installing starter models, the description is carried over. Thanks @lstein!
Enhancement: Updated translations.
Fix: Model unpatching when running on CPU, causing bad/no outputs.
Fix: Occasional visible seams on images with smooth textures, like skies. MultiDiffusion tiling now uses gradient blending to mitigate this issue.
Fix: Model names overflow the model selection drop-downs.
Internal: Backend SD pipeline refactor (WIP). This will allow contributors to add functionality to Invoke more easily. This will be behind a feature flag until the refactor is complete and tested. Thanks to @StAlKeR7779 for leading the effort, with major contributions from @dunkeroni and @RyanJDick.

Installation and Updating

To install or update to v4.2.7rc1, download the installer and follow the installation instructions.

To update, select the same installation location. Your user data (images, models, etc) will be retained.

Missing models after updating from v3 to v4

See this FAQ.

Error during installation `ModuleNotFoundError: No module named 'controlnet_aux'`

See this FAQ

What's Changed

Add support for Spandrel Image-to-Image models (e.g. ESRGAN, Real-ESRGAN, Swin-IR, DAT, etc.) by @RyanJDick in #6556
Add tiling to SpandrelImageToImageInvocation by @RyanJDick in #6594
Add Spandrel upscale starter models by @RyanJDick in #6605
Fix model unpatching process when running on CPU by @lstein in #6631
Add gradient blending to tile seams in MultiDiffusion by @RyanJDick in #6635
Base of modular backend by @StAlKeR7779 in #6606
add sdxl tile to starter models by @maryhipp in #6634
Fix function call that we forgot to update in #6606 by @RyanJDick in #6636
Bump fastapi-events dependency by @ebr in #6644
fix: update uncategorized board totals when deleting and moving images by @chainchompa in #6646
feat(ui): upscaling tab by @maryhipp in #6633
Math Updates by @hipsterusername in #6648
Modular backend - add rescale cfg by @StAlKeR7779 in #6640
Modular backend - add FreeU by @StAlKeR7779 in #6641
Modular backend - add ControlNet by @StAlKeR7779 in #6642
feat(ui): add upsells for pro edition to settings menu by @maryhipp in #6650
Update Simple Upscale Button to work with spandrel models by @chainchompa in #6649
Simple upscale bugfixes by @chainchompa in #6655
fix(ui): settings menu layout by @psychedelicious in #6654
[MM2] Use typed ModelRecordChanges for model_install() rather than untyped dict by @lstein in #6645
fix(ui): restore pnpm-lock.yaml by @psychedelicious in #6659
feat(ui): upscaling & ad-hoc post-processing misc by @psychedelicious in #6658
fix(ui): model select overflowing when model names are too long by @psychedelicious in #6660
feat(ui): more gallery UX updates by @maryhipp in #6652
fix(ui): few cases where board totals don't updated when moving by @psychedelicious in #6665
ui: translations update from weblate by @weblate in #6653
chore: bump version v4.2.7rc1 by @psychedelicious in #6670

Full Changelog: v4.2.6post1...v4.2.7rc1

Contributors

lstein, weblate, and 8 other contributors

Assets 5

16 Jul 00:24

psychedelicious

v4.2.6post1

7905a46

v4.2.6post1

v4.2.6post1 fixes issues some users may experience with memory management and sporadic black image outputs.

Please see the v4.2.6 release for full release notes.

💾 Installation and Updating

To install or update to v4.2.6post1, download the installer and follow the installation instructions.

To update, select the same installation location. Your user data (images, models, etc) will be retained.

Missing models after updating from v3 to v4

See this FAQ.

Error during installation `ModuleNotFoundError: No module named 'controlnet_aux'`

See this FAQ

What's Changed

fix(backend): revert non-blocking device transfer by @psychedelicious in #6624
chore: bump version to 4.2.6post1 by @psychedelicious in #6625

Full Changelog: v4.2.6...v4.2.6post1

Contributors

psychedelicious

Assets 5

15 Jul 04:31

psychedelicious

v4.2.6

5a0c998

v4.2.6

v4.2.6 includes a handful of fixes and improvements, plus three major changes:

Gallery updates
Tiled upscaling via MultiDiffusion
Checkpoint models work without conversion to diffusers

Gallery Updates

We've made some changes to the gallery, adding features, improving the performance of the app and reducing memory usage. The changes also fix a number of bugs relating to stale data - for example, a board not updating as expected after moving an image to it.

Thanks to @chainchompa and @maryhipp for working on this major effort.

Pagination & Selection

Infinite scroll is dead, long live infinite scroll!

The gallery is now paginated. Selection logic has been updated to work with pagination. An indicator shows how many images are selected and allows you to clear the selection entirely. Arrow keys still navigate.

Gallery.Pagination.and.Selection.mov

The number of images per page is dynamically calculated as the panel is resized, ensuring the panel is always filled with images.

Boards UI Refresh

The bulky tiled boards grid has been replaced by a scrollable list. The boards list panel is now a resizable, collapsible panel.

Boards.List.and.Resizable.Panel.mov

Boards and Image Search

Search for boards by name and images by metadata. The search term is matched against the image's metadata as a string. We landed on full-text search as a flexible yet simple implementation after considering a few methods for search.

Boards.and.Images.Search.mov

Archived Boards

Archive a board to hide it from the main boards list. This is purely an organizational enhancement. You can still interact with archived boards as you would any other board.

Archived.Boards.mov

Image Sorting

You can now change the sort for images to show oldest first. A switch allows starred images to be placed in the list according to their age, instead of always showing them first.

Image.Sorting.mov

Tiled Upscaling via `MultiDiffusion`

MultiDiffusion is a fairly straightforward technique for tiled denoising. The gist is similar to other tiled upscaling methods - split the input image up in to tiles, process each independently, and stitch them back together. The main innovation for MultiDiffusion is to do this in latent space, blending the tensors together continuously. This results in excellent consistency across the output image, with no seams.

This feature is exposed as a Tiled MultiDiffusion Denoise Latents node, currently classified as a beta version. It works much the same as the OG Denoise Latents node. You can find an example workflow in the workflow library's default workflows.

We are still thinking about to expose this in the linear UI. Most likely, we expose this with very minimal settings. If you want to tweak it, use the workflow.

Thanks to @RyanJDick for designing and implementing MultiDiffusion.

How to use it

This technique is fundamentally the same as normal img2img. Appropriate use of conditioning and control will greatly improve the output. The one hard requirement is to use the Tile ControlNet model.

Besides that, here are some tips from our initial testing:

Use a detail-adding or style LoRAs.
Use a base model best suited for the desired output style.
Prompts make a difference.
The initial upscaling method makes a difference.
Scheduler makes a difference. Some produce softer outputs.

VRAM Usage

This technique can upscale images to very large sizes without substantially increasing VRAM usage beyond what you'd see for a "normal" sized generation. The VRAM bottlenecks then become the first VAE encode (Image to Latents) and final VAE decode (Latents to Image) steps.

You may run into OOM errors during these steps. The solution is to enable tiling using the toggle on the Image to Latents and Latents to Image nodes. This allows the VAE operations to be done piecewise, similar to the tiled denoising process, without using gobs of VRAM.

There's one caveat - VAE tiling often introduces inconsistency across tiles. Textures and colors may differ from tile to tile. This is a function of diffusers' handling of VAE tiling, not the new tiled denoising process. We are investigating ways to improve this.

Takeaway: If your GPU can handle non-tiled VAE encode and decode for a given output size, use that for best results.

Checkpoint models work without conversion to `diffusers`

The required conversion of checkpoint format models to diffusers format has long been a pain point. The diffusers library now supports loading single-file (checkpoint) models directly, and we have removed the mandatory checkpoint-to-diffusers conversion step.

The main user-facing change is that there is no longer a conversion cache directory.

Major thanks to @lstein for getting this working.

📈 Patch Nodes for v4.2.6

Enhancements

When downloading image metadata, graphs or workflows, the JSON file includes the image name and type of data. Thanks @jstnlowe!
Add clear_queue_on_startup config setting to clear problematic queues. This is useful for a rare edge case where your queue is full of items that somehow crash the app. Set this to true, and the queue will clear before it has time to attempt to execute the problematic item. Thanks @steffy-lo!
Performance and memory efficiency improvements for LoRA patching and model offloading.
Addition of a simplified model installation methods to the Invocation API: download_and_cache_model, load_local_model and load_remote_model. These methods allow models to be used without needing them to be added to the model manager. For example, we are now using these methods to load ESRGAN models.
Support for probing and loading SDXL VAE checkpoint.
Updated gallery UI.
Checkpoint models work without conversion to diffusers.
When using a VAE in tiled mode, you may now select the tile size.

Fixes

Fix handling handling of 0-step denoising process.
If a control image's processed version is missing when the app loads, it is now re-processed.
Fixed an issue where a model's size could be misreported as 0, possibly causing memory issues.
Fixed an issue where images - especially large images - may fail to delete.

Performance improvements

Improved LoRA patching.
Improved RAM <-> VRAM model transfer performance.

Internal changes

The DenoiseLatentsInvocation has had its internal methods split up to support tiled upscaling via MultiDiffusion. This included some amount of file shuffling and renaming. The invokeai package's exported classes should still be the same. Please let us know if this has broken an import for you.
Internal cleanup, intending to eliminate circular import issues. There's a lot left to do for this issue, but we are making progress.

💾 Installation and Updating

To install or update to v4.2.6, download the installer and follow the installation instructions.

To update, select the same installation location. Your user data (images, models, etc) will be retained.

Missing models after updating from v3 to v4

See this FAQ.

Error during installation `ModuleNotFoundError: No module named 'controlnet_aux'`

See this FAQ

What's Changed

Prefixed JSON filenames with the image UUID by @jstnlowe in #6486
feat(ui): control layers internals cleanup by @psychedelicious in #6487
LoRA patching optimization by @lstein in #6439
fix(ui): re-process control image if processed image is missing on page load by @psychedelicious in #6494
Split up latent.py (code reorganization, no functional changes) by @RyanJDick in #6491
Add simplified model manager install API to InvocationContext by @lstein in #6132
fix: Some imports from previous PR's by @blessedcoolant in #6501
Improve RAM<->VRAM memory copy performance in LoRA patching and elsewhere by @lstein in #6490
Fix DEFAULT_PRECISION handling by @RyanJDick in #6492
added route to install huggingface models from model marketplace by @chainchompa in #6515
Model hash validator by @brandonrising in #6520
Tidy SilenceWarnings context manager by @RyanJDick in #6493
[#6333] Add clear_queue_on_startup config to clear problematic queues by @steffy-lo in #6502
[MM] Add support for probing and loading SDXL VAE checkpoint files by @lstein in #6524
Add TiledMultiDiffusionDenoiseLatents invocation (for upscaling workflows) by @RyanJDick in https://github.com/invoke-ai/InvokeAI/...

Contributors

lstein, weblate, and 11 other contributors

Assets 5

13 Jul 05:06

psychedelicious

v4.2.6rc1

3ecd14f

v4.2.6rc1 Pre-release

Pre-release

v4.2.6 includes a handful of fixes and improvements, plus three major changes:

Gallery updates
Tiled upscaling via MultiDiffusion
Checkpoint models work without conversion to diffusers

Known Issues

Our last release, v4.2.5, was quickly pulled after a black image issue on MPS (macOS) was discovered. We also had reports of CUDA (Nvidia) GPUs getting unexpected OOM (Out of Memory) errors.

The MPS issue is resolved in this release, but we haven't been able to replicate unexpected OOMs on Linux or Windows. We did fix one issue that may have been a factor.

If you get OOMs on this alpha release with settings that worked fine on v4.2.4 - or have any other issues - please let us know via GH issues or discord.

Gallery Updates

Thanks to @chainchompa and @maryhipp for working on this major effort.

Pagination & Selection

Infinite scroll is dead, long live infinite scroll!

Gallery.Pagination.and.Selection.mov

The number of images per page is dynamically calculated as the panel is resized, ensuring the panel is always filled with images.

Boards UI Refresh

The bulky tiled boards grid has been replaced by a scrollable list. The boards list panel is now a resizable, collapsible panel.

Boards.List.and.Resizable.Panel.mov

Boards and Image Search

Boards.and.Images.Search.mov

Archived Boards

Archive a board to hide it from the main boards list. This is purely an organizational enhancement. You can still interact with archived boards as you would any other board.

Archived.Boards.mov

Image Sorting

You can now change the sort for images to show oldest first. A switch allows starred images to be placed in the list according to their age, instead of always showing them first.

Image.Sorting.mov

Tiled Upscaling via `MultiDiffusion`

MultiDiffusion is a fairly straightforward technique for tiled denoising. The gist is similar to other tiled upscaling methods - split the input image up in to tiles, process each independently, and stitch them back together. The main innovation for MultiDiffusion is to do this in latent space, blending the tensors together continually. This results in excellent consistency across the output image, with no seams.

This feature is exposed as a Tiled MultiDiffusion Denoise Latents node, currently classified as a beta version. It works much the same as the OG Denoise Latents node. Here's a workflow to get you started: sd15_multi_diffusion_esrgan_x2_upscale.json

We are still thinking about to expose this in the linear UI. Most likely, we expose this with very minimal settings. If you want to tweak it, use the workflow.

Thanks to @RyanJDick for designing and implementing MultiDiffusion.

How to use it

This technique is fundamentally the same as normal img2img. Appropriate use of conditioning and control will greatly improve the output. The one hard requirement is to use the Tile ControlNet model.

Besides that, here are some tips from our initial testing:

Use a detail-adding or style LoRAs.
Use a base model best suited for the desired output style.
Prompts make a difference.
The initial upscaling method makes a difference.
Scheduler makes a difference. Some produce softer outputs.

VRAM Usage

There's one caveat - VAE tiling often introduces inconsistency across tiles. Textures and colors may differ from tile to tile. This is a function of the diffusers handling of VAE tiling, not the tiled denoising process introduced in v4.2.5. We are investigating ways to improve this.

Takeaway: If your GPU can handle non-tiled VAE encode and decode for a given output size, use that for best results.

Checkpoint models work without conversion to diffusers

The required conversion of checkpoint format models to diffusers format has long been a pain point. Diffusers now supports loading single-file (checkpoint) models directly, and we have removed the mandatory checkpoint-to-diffusers conversion step.

The main user-facing change is that there is no longer a conversion cache directory!

Major thanks to @lstein for getting this working.

📈 Patch Nodes for v4.2.6

Enhancements

When downloading image metadata, graphs or workflows, the JSON file includes the image name and type of data. Thanks @jstnlowe!
Add clear_queue_on_startup config setting to clear problematic queues. This is useful for a rare edge case where your queue is full of items that somehow crash the app. Set this to true, and the queue will clear before it has time to attempt to execute the problematic item. Thanks @steffy-lo!
Performance and memory efficiency improvements for LoRA patching and model offloading.
Addition of a simplified model installation methods to the Invocation API: download_and_cache_model, load_local_model and load_remote_model. These methods allow models to be used without needing them to be added to the model manager. For example, we are now using these methods to load ESRGAN models.
Support for probing and loading SDXL VAE checkpoint.
Updated gallery UI.
Checkpoint models work without conversion to diffusers.
When using a VAE in tiled mode, you may now select the tile size.

Fixes

Fix handling handling of 0-step denoising process.
If a control image's processed version is missing when the app loads, it is now re-processed.
Fixed an issue where a model's size could be misreported as 0, possibly causing memory issues.
Fixed an issue where images - especially large images - may fail to delete.

Performance improvements

Improved LoRA patching.
Improved RAM <-> VRAM model transfer performance.

Internal changes

The DenoiseLatentsInvocation has had its internal methods split up to support tiled upscaling via MultiDiffusion. This included some amount of file shuffling and renaming. The invokeai package's exported classes should still be the same. Please let us know if this has broken an import for you.
Internal cleanup, intending to eliminate circular import issues. There's a lot left to do for this issue, but we are making progress.

💾 Installation and Updating

To install or update to v4.2.6rc1, download the installer and follow the installation instructions.

To update, select the same installation location. Your user data (images, models, etc) will be retained.

Missing models after updating from v3 to v4

See this FAQ.

Error during installation `ModuleNotFoundError: No module named 'controlnet_aux'`

See this FAQ

What's Changed

Prefixed JSON filenames with the image UUID by @jstnlowe in #6486
feat(ui): control layers internals cleanup by @psychedelicious in #6487
LoRA patching optimization by @lstein in #6439
fix(ui): re-process control image if processed image is missing on page load by @psychedelicious in #6494
Split up latent.py (code reorganization, no functional changes) by @RyanJDick in #6491
Add simplified model manager install API to InvocationContext by @lstein in #6132
fix: Some imports from previous PR's by @blessedcoolant in #6501
Improve RAM<->VRAM memory copy performance in LoRA patching and elsewhere by @lstein in https://github.com/invoke-ai/InvokeAI/pull...

Contributors

lstein, weblate, and 11 other contributors

Assets 5

11 Jul 01:35

psychedelicious

v4.2.6a1

de1235c

v4.2.6a1 Pre-release

Pre-release

v4.2.6a1 includes a handful of fixes and improvements, plus three major changes:

Gallery updates
Tiled upscaling via MultiDiffusion
Checkpoint models work without conversion to diffusers

Known Issues

Our last release, v4.2.5, was quickly pulled after a black image issue on MPS (macOS) was discovered. We also had reports of CUDA (Nvidia) GPUs getting unexpected OOM (Out of Memory) errors.

The MPS issue is resolved in this release, but we haven't been able to replicate unexpected OOMs on Linux or Windows. We did fix one issue that may have been a factor.

If you get OOMs on this alpha release with settings that worked fine on v4.2.4 - or have any other issues - please let us know via GH issues or discord.

Gallery Updates

Thanks to @chainchompa and @maryhipp for working on this major effort.

Pagination & Selection

Infinite scroll is dead, long live infinite scroll!

Gallery.Pagination.and.Selection.mov

The number of images per page is dynamically calculated as the panel is resized, ensuring the panel is always filled with images.

Boards UI Refresh

The bulky tiled boards grid has been replaced by a scrollable list. The boards list panel is now a resizable, collapsible panel.

Boards.List.and.Resizable.Panel.mov

Boards and Image Search

Boards.and.Images.Search.mov

Archived Boards

Archive a board to hide it from the main boards list. This is purely an organizational enhancement. You can still interact with archived boards as you would any other board.

Archived.Boards.mov

Image Sorting

You can now change the sort for images to show oldest first. A switch allows starred images to be placed in the list according to their age, instead of always showing them first.

Image.Sorting.mov

Tiled Upscaling via `MultiDiffusion`

MultiDiffusion is a fairly straightforward technique for tiled denoising. The gist is similar to other tiled upscaling methods - split the input image up in to tiles, process each independently, and stitch them back together. The main innovation for MultiDiffusion is to do this in latent space, blending the tensors together continually. This results in excellent consistency across the output image, with no seams.

We are still thinking about to expose this in the linear UI. Most likely, we expose this with very minimal settings. If you want to tweak it, use the workflow.

Thanks to @RyanJDick for designing and implementing MultiDiffusion.

How to use it

This technique is fundamentally the same as normal img2img. Appropriate use of conditioning and control will greatly improve the output. The one hard requirement is to use the Tile ControlNet model.

Besides that, here are some tips from our initial testing:

Use a detail-adding or style LoRAs.
Use a base model best suited for the desired output style.
Prompts make a difference.
The initial upscaling method makes a difference.
Scheduler makes a difference. Some produce softer outputs.

VRAM Usage

Takeaway: If your GPU can handle non-tiled VAE encode and decode for a given output size, use that for best results.

Checkpoint models work without conversion to diffusers

The main user-facing change is that there is no longer a conversion cache directory!

Major thanks to @lstein for getting this working.

📈 Patch Nodes for v4.2.5

Enhancements

When downloading image metadata, graphs or workflows, the JSON file includes the image name and type of data. Thanks @jstnlowe!
Add clear_queue_on_startup config setting to clear problematic queues. This is useful for a rare edge case where your queue is full of items that somehow crash the app. Set this to true, and the queue will clear before it has time to attempt to execute the problematic item. Thanks @steffy-lo!
Performance and memory efficiency improvements for LoRA patching and model offloading.
Addition of a simplified model installation methods to the Invocation API: download_and_cache_model, load_local_model and load_remote_model. These methods allow models to be used without needing them to be added to the model manager. For example, we are now using these methods to load ESRGAN models.
Support for probing and loading SDXL VAE checkpoint.
Updated gallery UI.
Checkpoint models work without conversion to diffusers.
When using a VAE in tiled mode, you may now select the tile size.

Fixes

Fix handling handling of 0-step denoising process.
If a control image's processed version is missing when the app loads, it is now re-processed.
Fixed an issue where a model's size could be misreported as 0, possibly causing memory issues.

Performance improvements

Improved LoRA patching.
Improved RAM <-> VRAM model transfer performance.

Internal changes

The DenoiseLatentsInvocation has had its internal methods split up to support tiled upscaling via MultiDiffusion. This included some amount of file shuffling and renaming. The invokeai package's exported classes should still be the same. Please let us know if this has broken an import for you.
Internal cleanup, intending to eliminate circular import issues. There's a lot left to do for this issue, but we are making progress.

💾 Installation and Updating

To install or update to v4.2.6a1, download the installer and follow the installation instructions.

To update, select the same installation location. Your user data (images, models, etc) will be retained.

Missing models after updating from v3 to v4

See this FAQ.

Error during installation `ModuleNotFoundError: No module named 'controlnet_aux'`

See this FAQ

What's Changed

Prefixed JSON filenames with the image UUID by @jstnlowe in #6486
feat(ui): control layers internals cleanup by @psychedelicious in #6487
LoRA patching optimization by @lstein in #6439
fix(ui): re-process control image if processed image is missing on page load by @psychedelicious in #6494
Split up latent.py (code reorganization, no functional changes) by @RyanJDick in #6491
Add simplified model manager install API to InvocationContext by @lstein in #6132
fix: Some imports from previous PR's by @blessedcoolant in #6501
Improve RAM<->VRAM memory copy performance in LoRA patching and elsewhere by @lstein in #6490
Fix DEFAULT_PRECISION handling by @RyanJDick in https://github.com/in...

Contributors

lstein, ebr, and 10 other contributors

Assets 5

27 Jun 01:38

psychedelicious

v4.2.5

4fca626

v4.2.5 Pre-release

Pre-release

🚨 macOS users may get black images when using LoRAs or IP Adapters. Users with CUDA GPUs may get unexpected OOMs. We are investigating. 🚨

v4.2.5 includes a handful of fixes and improvements, plus one exciting beta node - tiled upscaling via MultiDiffusion.

If you missed v4.2.0, please review its release notes to get up to speed on Control Layers.

Tiled Upscaling via `MultiDiffusion`

MultiDiffusion is a fairly straightforward technique for tiled denoising. The gist is similar to other tiled upscaling methods - split the input image up in to tiles, process each independently, and stitch them back together. The main innovation for MultiDiffusion is to do this in latent space, blending the tensors together continually. This results in excellent consistency across the output image, with no seams.

We are still thinking about to expose this in the linear UI. Most likely, we expose this with very minimal settings. If you want to tweak it, use the workflow.

How to use it

This technique is fundamentally the same as normal img2img. Appropriate use of conditioning and control will greatly improve the output. The one hard requirement is to use the Tile ControlNet model.

Besides that, here are some tips from our initial testing:

Use a detail-adding or style LoRAs.
Use a base model best suited for the desired output style.
Prompts make a difference.
The initial upscaling method makes a difference.
Scheduler makes a difference. Some produce softer outputs.

VRAM Usage

Takeaway: If your GPU can handle non-tiled VAE encode and decode for a given output size, use that for best results.

📈 Patch Nodes for v4.2.5

Enhancements

When downloading image metadata, graphs or workflows, the JSON file includes the image name and type of data. Thanks @jstnlowe!
Add clear_queue_on_startup config setting to clear problematic queues. This is useful for a rare edge case where your queue is full of items that somehow crash the app. Set this to true, and the queue will clear before it has time to attempt to execute the problematic item. Thanks @steffy-lo!
Performance and memory efficiency improvements for LoRA patching and model offloading.
Addition of a simplified model installation methods to the Invocation API: download_and_cache_model, load_local_model and load_remote_model. These methods allow models to be used without needing them to be added to the model manager. For example, we are now using these methods to load ESRGAN models.
Support for probing and loading SDXL VAE checkpoint.

Fixes

Fix handling handling of 0-step denoising process.
If a control image's processed version is missing when the app loads, it is now re-processed.

Performance improvements

Improved LoRA patching.
Improved RAM <-> VRAM model transfer performance.

Internal changes

The DenoiseLatentsInvocation has had its internal methods split up to support tiled upscaling via MultiDiffusion. This included some amount of file shuffling and renaming. The invokeai package's exported classes should still be the same. Please let us know if this has broken an import for you.

💾 Installation and Updating

To install or update to v4.2.5, download the installer and follow the installation instructions.

To update, select the same installation location. Your user data (images, models, etc) will be retained.

Missing models after updating from v3 to v4

See this FAQ.

Error during installation `ModuleNotFoundError: No module named 'controlnet_aux'`

See this FAQ

What's Changed

Prefixed JSON filenames with the image UUID by @jstnlowe in #6486
feat(ui): control layers internals cleanup by @psychedelicious in #6487
LoRA patching optimization by @lstein in #6439
fix(ui): re-process control image if processed image is missing on page load by @psychedelicious in #6494
Split up latent.py (code reorganization, no functional changes) by @RyanJDick in #6491
Add simplified model manager install API to InvocationContext by @lstein in #6132
fix: Some imports from previous PR's by @blessedcoolant in #6501
Improve RAM<->VRAM memory copy performance in LoRA patching and elsewhere by @lstein in #6490
Fix DEFAULT_PRECISION handling by @RyanJDick in #6492
added route to install huggingface models from model marketplace by @chainchompa in #6515
Model hash validator by @brandonrising in #6520
Tidy SilenceWarnings context manager by @RyanJDick in #6493
[#6333] Add clear_queue_on_startup config to clear problematic queues by @steffy-lo in #6502
[MM] Add support for probing and loading SDXL VAE checkpoint files by @lstein in #6524
Add TiledMultiDiffusionDenoiseLatents invocation (for upscaling workflows) by @RyanJDick in #6522
Update prevention exception message by @hipsterusername in #6543
Fix handling handling of 0-step denoising process by @RyanJDick in #6544
chore: bump version v4.2.5 by @psychedelicious in #6547

New Contributors

@jstnlowe made their first contribution in #6486

Full Changelog: v4.2.4...v4.2.5

Contributors

lstein, psychedelicious, and 7 other contributors

Assets 5

04 Jun 23:43

psychedelicious

v4.2.4

ff01cea

v4.2.4

v4.2.4 brings one frequently requested feature and a host of fixes and improvements, mostly focused on performance and internal code quality.

If you missed v4.2.0, please review its release notes to get up to speed on Control Layers.

Image Comparison

The image viewer now supports comparing two images using a Slider, Side-by-Side or Hover UI.

To enter the comparison UI, select a compare image using one of these methods:

Right click an image and click Select for Compare.
Hold alt (option on mac) while clicking a gallery image to select it as the compare image.
Hold alt (option on mac) and use the arrow keys to select the comparison image.

Press C to swap the images and M to cycle through the comparison modes. Press Escape or Z to exit the comparison UI and return to the single image viewer.

When comparing images of different aspect ratios or sizes, the compare image will be stretched to fit the viewer image. Disable the toggle button at the top-left to instead contain the compare image within the viewer image.

Screen.Recording.2024-06-05.at.9.26.00.am.mov

📈 Patch Nodes for v4.2.4

Enhancements

The queue item detail view now updates when it finishes. The finished (completed, failed or canceled) session is displayed.
Updated translations. @Harvester62 @Vasyanator @BrunoCdot @gallegonovato @Atalanttore @hugoalh
Docs updates. @hsm207 @cdpath

Fixes

Fixed problem when using a latents from the blend latents node for denoising with certain schedulers which made images drastically different, even with an alpha of 0.
Fixed unnecessarily strict constraints for ControlNet and IP Adapter weights in the Control Layers UI. This prevented layers with weights outside the range of 0-1 from recalling.
Fixed error when editing non-main models (e.g. LoRAs).
Fixed the SDXL prompt concat flag from not being set when recalling prompts.
Fixed model metadata recall not working when a model has a different key. This can happen if the model was uninstalled and reinstalled. When recalling, we fall back on the model's name, base and type, if the key doesn't match an existing model.

Performance improvements

Big thanks to @lstein for these very impactful improvements!

Substantially improved performance when moving models between RAM and VRAM. For example, an SDXL model RAM -> VRAM -> RAM roundtrip tested at ~0.8s, down from ~3s. That's about 75% faster!
Fixed bug with VRAM lazy offloading which caused inefficient VRAM cache usage.
Reduced VRAM requirements when using IP Adapter.

Internal changes

Modularize the queue processor.
Use pydantic models for events instead of plain dicts.
Improved handling of pydantic invocation unions.
Updated ML dependencies. @Malrama

💾 Installation and Updating

To install or update to v4.2.4, download the installer and follow the installation instructions.

To update, select the same installation location. Your user data (images, models, etc) will be retained.

Missing models after updating from v3 to v4

See this FAQ.

Error during installation `ModuleNotFoundError: No module named 'controlnet_aux'`

See this FAQ

What's Changed

feat: update queue items session on complete by @psychedelicious in #6419
Break apart session processor and the running of each session into se… by @brandonrising in #6382
fix(ui): isLocal erroneously hardcoded in error toast by @psychedelicious in #6436
fix typo by @cdpath in #6255
Optimize RAM to VRAM transfer by @lstein in #6312
docs: fix a typo by @hsm207 in #6395
Update deps to their lastest versions by @Malrama in #6327
fix(ui): parameter not set translation by @psychedelicious in #6441
refactor(events): use pydantic schemas for events by @psychedelicious in #5748
feat(events): restore full invocation in event payloads by @psychedelicious in #6447
fix(ui): edit variant for main models only by @psychedelicious in #6446
feat(events): register event schemas by @psychedelicious in #6448
feat(events): add missing classvar to events, add validators for deserialization of events by @psychedelicious in #6451
Update TI handling for compatibility with transformers 4.40.0 by @RyanJDick in #6449
[MM]Fix bug in offload_unlocked_models() call by @lstein in #6450
docs: add FAQ for fixing controlnet_aux by @psychedelicious in #6459
fix(ui): remove overly strict constraints on control adapter weight by @psychedelicious in #6460
fix: openapi stuff by @psychedelicious in #6454
feat(ui): image compare by @psychedelicious in #6464
[feat] Reduce peak VRAM memory usage of IP adapter by @lstein in #6453
fix(ui): metadata recall fixes by @psychedelicious in #6480
fix(nodes): blend latents with weight=0 with DPMSolverSDEScheduler by @psychedelicious in #6482
ui: translations update from weblate by @weblate in #6440
chore: v4.2.4 by @psychedelicious in #6485

New Contributors

@cdpath made their first contribution in #6255

Full Changelog: v4.2.3...v4.2.4

Contributors

lstein, Atalanttore, and 12 other contributors

Assets 5

23 May 22:27

psychedelicious

v4.2.3

c3f28f7

v4.2.3

If you missed v4.2.0, please review its release notes to get up to speed on Control Layers.

📈 Patch Nodes for v4.2.3

Spellcheck is re-enabled on prompt boxes
DB maintenance script removed from launcher (it currently does not work)
Reworked toasts. When a toast of a given type is triggered, if another toast of that type is already being displayed, it is updated instead of creating another toast. The old behaviour was painful in situations where you queue up many generations that all immediately fail, or install a lot of models at once. In these situations, you'd get a wall of toasts. Now you get only 1.
Fixed: Control layer checkbox correctly indicates that it enables or disables the layer
Fixed: Disabling Regional Guidance layers didn't work
Fixed: Excessive warnings in terminal when uploading images
Fixed: When loading a workflow, if an image, board or model for an input for that workflow no longer exists, the workflow will execute but error.

For example, say you save a workflow that has a certain model set for a node, then delete the model. When you load that workflow, the model is missing but the workflow doesn't detect this. You can run the workflow, and it will fail when it attempts to use the nonexistent model.

With this fix, when a workflow is loaded, we check for the existence of all images, boards and models referenced by the workflow. If something is missing, that input is reset.
Docs updates @hsm207
Translations updates @gallegonovato @Harvester62 @dvanzoerlandt

💾 Installation and Updating

To install or update to v4.2.3, download the installer and follow the installation instructions.

To update, select the same installation location. Your user data (images, models, etc) will be retained.

Missing models after updating from v3 to v4

See this FAQ.

What's Changed

docs: fix link to Invoke AI's models site by @hsm207 in #6413
feat(ui): workflow resource check by @psychedelicious in #6417
feat(ui): restore spellcheck on prompt boxes by @psychedelicious in #6418
feat(ui): toasts rework by @psychedelicious in #6415
ui: translations update from weblate by @weblate in #6372
Update error boundary to link to support ticket for non-local by @maryhipp in #6422
fix(ui): regional guidance layers not disabling correctly by @psychedelicious in #6427
fix: remove db maintenance script from launcher by @psychedelicious in #6431
feat(api): downgrade metadata parse warnings to debug by @psychedelicious in #6426
fix(ui): 'undefined' being used for metadata on uploaded images by @psychedelicious in #6433
fix(ui): initial image layers always ignored by @psychedelicious in #6434
chore: v4.2.3 by @psychedelicious in #6432
ui: translations update from weblate by @weblate in #6428

Full Changelog: v4.2.2post1...v4.2.3

Contributors

weblate, hsm207, and 5 other contributors

Assets 5

21 May 02:49

psychedelicious

v4.2.2post1

aa0c59b

v4.2.2post1

This release brings many fixes and enhancements, including two long-awaited features: undo/redo in workflows and load workflow from any image.

If you missed v4.2.0, please review its release notes to get up to speed on Control Layers.

📈 Patch Nodes for v4.2.2post1

v4.2.2 had a critical bug related to notes nodes & missing templates in workflows. That is fixed in v4.2.2post1.

✨ Undo/redo in Workflows

Undo/redo redo now available in the workflow editor. There's some amount of tuning to be done with how actions are grouped.

For example, when you move a node around, do we allow you to undo each pixel of movement, or do we group the position changes as one action? When you are typing a prompt, do we undo each letter, word, or the whole change at once?

Currently, we group like changes together. It's possible some things are grouped when they shouldn't be, or should be grouped but are not. Your feedback will be very useful in tuning the behaviour so it un-does the right changes.

✨ Load Workflow from Any Image

Starting with v4.2.2, graphs are embedded in all images generated by Invoke. Images generated in the workflow editor also have the enriched workflow embedded separately. The Load Workflow button will load the enriched workflow if it exists, else it will load the graph.

You'll see a new Graph tab in the metadata viewer showing the embedded graph.

Graph vs Workflow

Graphs are used by the backend and contain minimal data. Workflows are an enrich data format that includes a representation of the graph plus extra information, including things like:

Title, description, author, etc
Node positions
Custom node and field labels

This new feature embeds the graph in every image - including images generated on the Generation or Canvas tabs.

Canvas Caveat

This functionality is available only for individual canvas generations - not the full composition. Why is that?

Consider what goes into a full canvas composition. It's the product of any number of graphs, with any amount of drawing and erasing between each graph execution. It's not possible to consolidate this into a single graph.

When you generate on canvas, your images for the given bounding box are added to a staging area, which allows you to cycle through images and commit or discard the image. The staging area also allows you to save a candidate generation. It is these images that can be loaded as a workflow, because they are the product of a single graph execution.

👷 Other Fixes and Enhancements

Min/max LoRA weight values extended (-10 to +10) @H0onnn
Denoising strength and layer opacity are retained when sending image to initial image @steffy-lo
SDXL T2I Adapter only blocks invoking when dimensions aren't multiple of 32 (was erroneously 64)
Improved UX when manipulating edges in workflows
Connected inputs on nodes collapse, hiding the nonfunctional UI component
Use ctrl/cmd-shift-v to paste copied nodes with input edges
Docs updates @hsm207
Fix: visible seams when outpainting
Fix: edge case that could prevent workflows from loading if user hadn't opened the workflows tab yet
Fix: minor jank/inefficiency with control adapter auto-process (control layers only)
Internal: utility to create graph objects without going crazy
Internal: rewritten connection validation logic for workflows with full test coverage
Internal: rewritten edge connection interactions
Internal: revised field type format

💾 Installation and Updating

To install or update to v4.2.2post1, download the installer and follow the installation instructions.

To update, select the same installation location. Your user data (images, models, etc) will be retained.

Missing models after updating from v3 to v4

See this FAQ.

What's Changed

fix: Fix Outpaint not applying the expanded mask correctly by @blessedcoolant in #6370
feat(ui): graph builder by @psychedelicious in #6361
docs: fix install reqs link by @psychedelicious in #6374
feat(ui): SDXL clip skip by @psychedelicious in #6378
[Refactor] Update min and max values for LoRACard weight input by @H0onnn in #6383
feat(ui): workflows undo/redo by @psychedelicious in #6379
feat(ui): prevent connections to direct-only inputs by @psychedelicious in #6387
feat(ui): copy/paste input edges when copying node by @psychedelicious in #6385
fix(ui): allow image dims multiple of 32 with SDXL and T2I adapter by @psychedelicious in #6366
revert(ui): SDXL clip skip by @psychedelicious in #6389
fix(ui): field ordering & display by @psychedelicious in #6390
feat(ui): store workflow in generation tab images by @psychedelicious in #6384
feat(worker): add nullable user_id and project_id to invocation error events by @maryhipp in #6388
docs: fix link to. install reqs by @hsm207 in #6392
fix(ui): control adapter auto-process jank by @psychedelicious in #6393
feat(ui): connection validation rework by @psychedelicious in #6386
fix(ui): edge case resulting in no node templates when loading workfl… by @psychedelicious in #6397
feat: write-only canvas metadata by @psychedelicious in #6404
fix(ui): fix t2i adapter dimensions error message by @psychedelicious in #6391
feat(ui): better field types by @psychedelicious in #6396
fix(ui): workflow edges not saved by @psychedelicious in #6403
[#6351] ui: retain denoise strength and opacity when changing image by @steffy-lo in #6407
chore: v4.2.2 by @psychedelicious in #6410
fix(ui): crash when using notes nodes or missing node/field templates by @psychedelicious in #6412

New Contributors

@H0onnn made their first contribution in #6383
@hsm207 made their first contribution in #6392
@steffy-lo made their first contribution in #6407

Full Changelog: v4.2.1...v4.2.2post1

Contributors

hsm207, psychedelicious, and 4 other contributors

Assets 5

20 May 22:26

psychedelicious

v4.2.2

66c9f47

v4.2.2

This release brings many fixes and enhancements, including two long-awaited features: undo/redo in workflows and load workflow from any image.

If you missed v4.2.0, please review its release notes to get up to speed on Control Layers.

📈 Patch Nodes for v4.2.2

✨ Undo/redo in Workflows

Undo/redo redo now available in the workflow editor. There's some amount of tuning to be done with how actions are grouped.

✨ Load Workflow from Any Image

You'll see a new Graph tab in the metadata viewer showing the embedded graph.

Graph vs Workflow

Graphs are used by the backend and contain minimal data. Workflows are an enrich data format that includes a representation of the graph plus extra information, including things like:

Title, description, author, etc
Node positions
Custom node and field labels

This new feature embeds the graph in every image - including images generated on the Generation or Canvas tabs.

Canvas Caveat

This functionality is available only for individual canvas generations - not the full composition. Why is that?

👷 Other Fixes and Enhancements

Min/max LoRA weight values extended (-10 to +10) @H0onnn
Denoising strength and layer opacity are retained when sending image to initial image @steffy-lo
SDXL T2I Adapter only blocks invoking when dimensions aren't multiple of 32 (was erroneously 64)
Improved UX when manipulating edges in workflows
Connected inputs on nodes collapse, hiding the nonfunctional UI component
Use ctrl/cmd-shift-v to paste copied nodes with input edges
Docs updates @hsm207
Fix: visible seams when outpainting
Fix: edge case that could prevent workflows from loading if user hadn't opened the workflows tab yet
Fix: minor jank/inefficiency with control adapter auto-process (control layers only)
Internal: utility to create graph objects without going crazy
Internal: rewritten connection validation logic for workflows with full test coverage
Internal: rewritten edge connection interactions
Internal: revised field type format

💾 Installation and Updating

To install or update to v4.2.2, download the installer and follow the installation instructions.

To update, select the same installation location. Your user data (images, models, etc) will be retained.

Missing models after updating from v3 to v4

See this FAQ.

What's Changed

fix: Fix Outpaint not applying the expanded mask correctly by @blessedcoolant in #6370
feat(ui): graph builder by @psychedelicious in #6361
docs: fix install reqs link by @psychedelicious in #6374
feat(ui): SDXL clip skip by @psychedelicious in #6378
[Refactor] Update min and max values for LoRACard weight input by @H0onnn in #6383
feat(ui): workflows undo/redo by @psychedelicious in #6379
feat(ui): prevent connections to direct-only inputs by @psychedelicious in #6387
feat(ui): copy/paste input edges when copying node by @psychedelicious in #6385
fix(ui): allow image dims multiple of 32 with SDXL and T2I adapter by @psychedelicious in #6366
revert(ui): SDXL clip skip by @psychedelicious in #6389
fix(ui): field ordering & display by @psychedelicious in #6390
feat(ui): store workflow in generation tab images by @psychedelicious in #6384
feat(worker): add nullable user_id and project_id to invocation error events by @maryhipp in #6388
docs: fix link to. install reqs by @hsm207 in #6392
fix(ui): control adapter auto-process jank by @psychedelicious in #6393
feat(ui): connection validation rework by @psychedelicious in #6386
fix(ui): edge case resulting in no node templates when loading workfl… by @psychedelicious in #6397
feat: write-only canvas metadata by @psychedelicious in #6404
fix(ui): fix t2i adapter dimensions error message by @psychedelicious in #6391
feat(ui): better field types by @psychedelicious in #6396
fix(ui): workflow edges not saved by @psychedelicious in #6403
[#6351] ui: retain denoise strength and opacity when changing image by @steffy-lo in #6407
chore: v4.2.2 by @psychedelicious in #6410

New Contributors

@H0onnn made their first contribution in #6383
@hsm207 made their first contribution in #6392
@steffy-lo made their first contribution in #6407

Full Changelog: v4.2.1...v4.2.2

Contributors

hsm207, psychedelicious, and 4 other contributors

Assets 5

Releases: invoke-ai/InvokeAI

v4.2.7rc1

Upscaling

Dedicated Upscaling Tab

Requirements

Tips

Custom Upscaling Models

Installing Models

Nodes

Gallery Improvements

Other Changes

Installation and Updating

Missing models after updating from v3 to v4

Error during installation ModuleNotFoundError: No module named 'controlnet_aux'

What's Changed

Contributors

v4.2.6post1

💾 Installation and Updating

Missing models after updating from v3 to v4

Error during installation ModuleNotFoundError: No module named 'controlnet_aux'

What's Changed

Contributors

v4.2.6

Gallery Updates

Pagination & Selection

Boards UI Refresh

Boards and Image Search

Archived Boards

Image Sorting

Tiled Upscaling via MultiDiffusion

How to use it

VRAM Usage

Checkpoint models work without conversion to diffusers

📈 Patch Nodes for v4.2.6

Enhancements

Fixes

Performance improvements

Internal changes

💾 Installation and Updating

Missing models after updating from v3 to v4

Error during installation ModuleNotFoundError: No module named 'controlnet_aux'

What's Changed

Contributors

v4.2.6rc1

Known Issues

Gallery Updates

Pagination & Selection

Boards UI Refresh

Boards and Image Search

Archived Boards

Image Sorting

Tiled Upscaling via MultiDiffusion

How to use it

VRAM Usage

Checkpoint models work without conversion to diffusers

📈 Patch Nodes for v4.2.6

Enhancements

Fixes

Performance improvements

Internal changes

💾 Installation and Updating

Missing models after updating from v3 to v4

Error during installation ModuleNotFoundError: No module named 'controlnet_aux'

What's Changed

Contributors

v4.2.6a1

Known Issues

Gallery Updates

Pagination & Selection

Boards UI Refresh

Boards and Image Search

Archived Boards

Image Sorting

Tiled Upscaling via MultiDiffusion

How to use it

VRAM Usage

Checkpoint models work without conversion to diffusers

📈 Patch Nodes for v4.2.5

Enhancements

Fixes

Error during installation `ModuleNotFoundError: No module named 'controlnet_aux'`

Error during installation `ModuleNotFoundError: No module named 'controlnet_aux'`

Tiled Upscaling via `MultiDiffusion`

Checkpoint models work without conversion to `diffusers`

Error during installation `ModuleNotFoundError: No module named 'controlnet_aux'`

Tiled Upscaling via `MultiDiffusion`

Error during installation `ModuleNotFoundError: No module named 'controlnet_aux'`

Tiled Upscaling via `MultiDiffusion`

Error during installation `ModuleNotFoundError: No module named 'controlnet_aux'`

Tiled Upscaling via `MultiDiffusion`

Error during installation `ModuleNotFoundError: No module named 'controlnet_aux'`

Error during installation `ModuleNotFoundError: No module named 'controlnet_aux'`