Arbitrary-steps Image Super-resolution via Diffusion Inversion

Zongsheng Yue, Kang Liao, Chen Change Loy

⭐ If you've found InvSR useful for your research or projects, please show your support by starring this repo. Thanks! 🤗

This study presents a new image super-resolution (SR) technique based on diffusion inversion, aiming at harnessing the rich image priors encapsulated in large pre-trained diffusion models to improve SR performance. We design a \textit{Partial noise Prediction} strategy to construct an intermediate state of the diffusion model, which serves as the starting sampling point. Central to our approach is a deep noise predictor to estimate the optimal noise maps for the forward diffusion process. Once trained, this noise predictor can be used to initialize the sampling process partially along the diffusion trajectory, generating the desirable high-resolution result. Compared to existing approaches, our method offers a flexible and efficient sampling mechanism that supports an arbitrary number of sampling steps, ranging from one to five. Even with a single sampling step, our method demonstrates superior or comparable performance to recent state-of-the-art approaches.

Update

_ 2024.12.14: Add .

2024.12.13: Add and .
2024.12.11: Create this repo.

Requirements

Python 3.10, Pytorch 2.4.0, xformers 0.0.27.post2
More detail (See environment.yaml)
A suitable conda environment named invsr can be created and activated with:

conda create -n invsr python=3.10
conda activate invsr
pip install torch==2.4.0 torchvision==0.19.0 torchaudio==2.4.0 --index-url https://download.pytorch.org/whl/cu121
pip install -U xformers==0.0.27.post2 --index-url https://download.pytorch.org/whl/cu121
pip install -e ".[torch]"
pip install -r requirements.txt

Applications

👉 Real-world Image Super-resolution

👉 General Image Enhancement

👉 AIGC Image Enhancement

Inference

🚀 Fast testing

python inference_invsr.py -i [image folder/image path] -o [result folder] --num_steps 1

To deal with large images, e.g., 1k---->4k, we recommend adding the option --chopping_size 256.
Other options:
- Specify the pre-downloaded SD Turbo Model: --sd_path.
- Specify the pre-downloaded noise predictor: --started_ckpt_path.
- The number of sampling steps: --num_steps.
- If your GPU memory is limited, please add the option --chopping_bs 1.

🚃 Online Demo

You can try our method through an online demo:

python app.py

✈️ Reproducing our paper results

Synthetic dataset of ImageNet-Test: Google Drive.
Real data for image super-resolution: RealSRV3 | RealSet80
To reproduce the quantitative results on Imagenet-Test and RealSRV3, please add the color fixing options by --color_fix wavelet.

Training

🐢 Preparing stage

Download the finetuned LPIPS model from this link and put it in the folder of "weights".
Prepare the config file:
- SD-Turbo path: configs.sd_pipe.params.cache_dir.
- Training data path: data.train.params.data_source.
- Validation data path: data.val.params.dir_path (low-quality image) and data.val.params.extra_dir_path (high-quality image).
- Batchsize: configs.train.batch and configs.train.microbatch (total batchsize = microbatch * #GPUS * num_grad_accumulation)

🐬 Begin training

CUDA_VISIBLE_DEVICES=0,1,2,3 torchrun --standalone --nproc_per_node=4 --nnodes=1 main.py --save_dir [Logging Folder]

🐳 Resume from interruption

CUDA_VISIBLE_DEVICES=0,1,2,3 torchrun --standalone --nproc_per_node=4 --nnodes=1 main.py --save_dir [Logging Folder] --resume save_dir/ckpts/model_xx.pth

License

This project is licensed under NTU S-Lab License 1.0. Redistribution and use should follow this license.

Acknowledgement

This project is based on BasicSR and diffusers. Thanks for their awesome works.

Contact

If you have any questions, please feel free to contact me via zsyzam@gmail.com.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
assets		assets
basicsr		basicsr
configs		configs
datapipe		datapipe
latent_lpips		latent_lpips
scripts		scripts
src		src
testdata		testdata
utils		utils
LICENSE		LICENSE
README.md		README.md
app.py		app.py
cog.yaml		cog.yaml
environment.yaml		environment.yaml
inference_invsr.py		inference_invsr.py
main.py		main.py
predict.py		predict.py
requirements.txt		requirements.txt
sampler_invsr.py		sampler_invsr.py
setup.py		setup.py
trainer.py		trainer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Arbitrary-steps Image Super-resolution via Diffusion Inversion

Update

Requirements

Applications

👉 Real-world Image Super-resolution

👉 General Image Enhancement

👉 AIGC Image Enhancement

Inference

🚀 Fast testing

🚃 Online Demo

✈️ Reproducing our paper results

Training

🐢 Preparing stage

🐬 Begin training

🐳 Resume from interruption

License

Acknowledgement

Contact

About

Releases

Packages

Languages

License

zsyOAOA/InvSR

Folders and files

Latest commit

History

Repository files navigation

Arbitrary-steps Image Super-resolution via Diffusion Inversion

Update

Requirements

Applications

👉 Real-world Image Super-resolution

👉 General Image Enhancement

👉 AIGC Image Enhancement

Inference

🚀 Fast testing

🚃 Online Demo

✈️ Reproducing our paper results

Training

🐢 Preparing stage

🐬 Begin training

🐳 Resume from interruption

License

Acknowledgement

Contact

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages