Official Implementation of DOODL (End-to-End Diffusion Latent Optimization Improves Classifier Guidance)

What is DOODL?

DOODL (Direct Optimization of Diffusion Latents) is a variant of classifier guidance that directly optimizes diffusion latents x_T instead of using model-based gradients to guide denoising. This is done be leveraging the EDICT algorithm and MemCNN library to construct a diffusion process that can be backpropagated through with constant memory cost w.r.t the number of diffusion steps without significant runtime increase. The control of this optimization allows a variety of guidance modes to be incorporated. Check out our paper for more details and don't hesitate to reach out with questions!

Setup

HF Auth token

Paste a copy of a suitable HF Auth Token into hf_auth with no new line (to be read by the following code in edict_functions.py)

with open('hf_auth', 'r') as f:
    auth_token = f.readlines()[0].strip()

Example file at ./hf_auth

abc123abc123

Environment

Run conda env create -f environment.yaml, activate that conda env (conda activate doodl). Run jupyter with that conda env active

FGVC models

FGVC models can be downloaded from the WS-DAN repo and saved at fgvc_ws_dan_helpers/checkpoints/

Experimentation

Check out this notebook for examples of how to use DOODL.

Other Files

doodl.py has the core functionality of DOODL
my_half_diffusers is a very slightly changed version of the HF Diffusers repo
fgvc_ws_dan_helpers/ gives access to the WSDAN Model.
memcnn/ is a very lightly modified version of the excellent MemCNN library. Thank you to the original MemCNN authors!

Citation

If you find our work useful in your research, please cite the following works:

@misc{wallace2023endtoend,
      title={End-to-End Diffusion Latent Optimization Improves Classifier Guidance}, 
      author={Bram Wallace and Akash Gokul and Stefano Ermon and Nikhil Naik},
      year={2023},
      eprint={2303.13703},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

@article{wallace2022edict,
  title={EDICT: Exact Diffusion Inversion via Coupled Transformations},
  author={Wallace, Bram and Gokul, Akash and Naik, Nikhil},
  journal={arXiv preprint arXiv:2211.12446},
  year={2022}
}

License

Our code is BSD-3 licensed. See LICENSE.txt for details.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
example_ims		example_ims
fgvc_ws_dan_helpers		fgvc_ws_dan_helpers
memcnn		memcnn
my_half_diffusers		my_half_diffusers
.gitignore		.gitignore
CODEOWNERS		CODEOWNERS
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE.txt		LICENSE.txt
README.md		README.md
SECURITY.md		SECURITY.md
demo.ipynb		demo.ipynb
doodl.py		doodl.py
environment.yaml		environment.yaml
helper_functions.py		helper_functions.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Official Implementation of DOODL (End-to-End Diffusion Latent Optimization Improves Classifier Guidance)

What is DOODL?

Setup

HF Auth token

Environment

FGVC models

Experimentation

Other Files

Citation

License

About

Releases

Packages

Languages

License

salesforce/DOODL

Folders and files

Latest commit

History

Repository files navigation

Official Implementation of DOODL (End-to-End Diffusion Latent Optimization Improves Classifier Guidance)

What is DOODL?

Setup

HF Auth token

Environment

FGVC models

Experimentation

Other Files

Citation

License

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages