Skip to content

de-id/diffusers-papers

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

60 Commits
 
 

Repository files navigation

Denoising Diffusion Probabilistic Models Papers

Papers club from the AI team in D-ID - this time Diffusion Model(DM).

Diffusion Models were first introduced in Deep Unsupervised Learning using Nonequilibrium Thermodynamics. However, it took until Generative Modeling by Estimating Gradients of the Data Distribution (Song et al., 2019, Stanford University), and then Denoising Diffusion Probabilistic Models (Ho et al., 2020, Google Brain) who independently improved the approach.

A good explnantion on what are Diffusion Models and why they are intresting can be found in Diffusion-Models Tutorial (CVPR 2022).

מועדון קריאת מאמרים שלנו - כל ההרצאות בעיברית

Title Paper / Resource Year Why is it interesting? Asignee Recording Slides
Denoising Diffusion Probabilistic Models Denoising Diffusion Probabilistic Models 2020
read whyhigh quality image synthesis results using diffusion probabilistic models, a class of latent variable models inspired by considerations from nonequilibrium thermodynamics.
@talbenha zoom(@NnH10JK) slides
The Annotated Diffusion Model The Annotated Diffusion Model
read why
self-work -- --
Colorization, Inpainting, Uncropping, and JPEG restoration Palette: Image-to-Image Diffusion Models 2021
read why A unified framework for image-to-image translation based on conditional diffusion models and evaluates this framework on four challenging image-to-image translation tasks, namely colorization, inpainting, uncropping, and JPEG restoration
@ArnoBen zoom (6CbWY6e*) slides
Rethinking Diffusion Models Design Elucidating the Design Space of Diffusion-Based Generative Models 2020
read whyKarras, the StyleGAN author is doing a back to the roots rethinking design choices of diffusion models, creating a well justified baseline archtecture
@orgoro zoom1(.m0gN7.?) zoom2(S^*c0ai3) slides
Super-Resolution Image Super-Resolution via Iterative Refinement 2021
read whyhigh quality image synthesis results using diffusion probabilistic models, a class of latent variable models inspired by considerations from nonequilibrium thermodynamics.
self-work -- --
Classifier (+ Classifier-Free) Diffusion Guidance Diffusion Models Beat GANs on Image Synthesis & Classifier-Free Diffusion Guidance 2021
read why DM achieve image sample quality superior to the current SOTA GAN models by improving the U-Net architecture, as well as introducing classifier (+calssifier free) guidance
@talbenha zoom(?JS330&C) slides
Text2Image ImageGen 2022
read why text-to-image synthesis
@alon.mengi zoom(7hB61@CU) slides
Efficient DM (Stable Diffusion) High-Resolution Image Synthesis with Latent Diffusion Models 2022
read why Apply DM in the latent space of powerful pretrained autoencoders to enable training on limited computational resources while retaining their quality and flexibility
@ShiraBaronn zoom(U!+B+7g+) slides
Imagic Imagic: Text-Based Real Image Editing with Diffusion Models 2022
read whyApply complex (e.g., non-rigid) text-guided semantic edits to a single real image
@Ganitk zoom(%1x7WWl*) slides
Text2Video Imagen Video: High Definition Video Generation with Diffusion Models 2022
read whya text-conditional video generation system based on a cascade of video diffusion models
@maysteinfeld zoom($Y=U45cT) slides
TTS-Diffusion Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech 2021
read whyText-to-speech model with score-based decoder producing mel-spectrograms by gradually transforming noise predicted by encoder and aligned with text input by means of Monotonic Alignment Search.
@amitay-nachmani zoom(@3yMN0gC) slides
3D Shape Synthesis LION: Latent Point Diffusion Models for 3D Shape Generation 2022
read whyHierarchical Latent Point Diffusion Model for 3D shape generation. LION is set up as a variational autoencoder (VAE) with a hierarchical latent space that combines a global shape latent representation with a point-structured latent space.
@matan-feldman zoom(q=v@4WYg) slides
DreamFusion DreamFusion: Text-to-3D using 2D Diffusion 2022
read whyDreamFusion use a pretrained 2D text-to-image diffusion model to perform text-to-3D synthesis
@ShiraBaronn zoom(9gZqV*2Y) slides
FMRI-to-Image with SD High-resolution image reconstruction with latent diffusion models from human brain activity 2023
read whyReconstruct images from FMRI using stable diffusion
@Ganitk zoom(B5J0vf?+) slides
Few cool papers 😎 Control Net, InstructPix2Pix, DreamBooth, Textual-Inversion, Prompt-to-Prompt 2023
read whyClosing the seminar with 5 cool papers
@talbenha zoom(r+52hd5@) slides

Releases

No releases published

Packages

No packages published