Guided Image-to-Image Translation papers

Feel free to send a PR or issue. (constantly updating)

Class Label Guided
Action Unit Guided
Facial Landmark Guided
Pose Guided Person Image Generation
Segmentation Map Guided Scene Image Generation
Texture Patch Guided
Example Guided
Attention Guided
Mask Guided
Text Guided
Audio Guided

Class Label Guided

Model	Paper	Conference	Arxiv	Code
IcGAN	Invertible Conditional GANs for image editing	NeurIPSW 2016	1611.06355	Guim3/IcGAN
Conditional CycleGAN	Conditional CycleGAN for Attribute Guided Face Image Generation	ECCV 2018	1705.09966
StarGAN	StarGAN: Uniﬁed Generative Adversarial Networks for Multi-Domain Image-to-Image Translation	CVPR 2018	1711.09020	yunjey/StarGAN
AGUIT	Attribute Guided Unpaired Image-to-Image Translation with Semi-supervised Learning		1904.12428	imlixinyang/AGUIT
AttGAN	AttGAN: Facial Attribute Editing by Only Changing What You Want	TIP 2019	1711.10678	LynnHo/AttGAN-Tensorflow
SGGAN	Sparsely Grouped Multi-task Generative Adversarial Networks for Facial Attribute Manipulation	MM 2018	1805.07509	zhangqianhui/Sparsely-Grouped-GAN
RelGAN	RelGAN: Multi-Domain Image-to-Image Translation via Relative Attributes	ICCV 2019	1908.07269	elvisyjlin/RelGAN-PyTorch, willylulu/RelGAN

Action Unit Guided

Model	Paper	Conference	Arxiv	Code
GANimation	GANimation: Anatomically-aware Facial Animation from a Single Image	ECCV 2018	1807.09251	albertpumarola/GANimation

Facial Landmark Guided

Model	Paper	Conference	Arxiv	Code
G2GAN	Geometry Guided Adversarial Facial Expression Synthesis	MM 2018	1712.03474
CMM-Net	Every Smile is Unique: Landmark-Guided Diverse Smile Generation	CVPR 2018	1802.01873
C2GAN	Cycle In Cycle Generative Adversarial Networks for Keypoint-Guided Image Generation	MM 2019	1908.00999	Ha0Tang/C2GAN
	Few-Shot Adversarial Learning of Realistic Neural Talking Head Models	ICCV 2019	1905.08233	grey-eye/talking-heads

Pose Guided Person Image Generation

Model	Paper	Conference	Arxiv	Code
PG2	Pose Guided Person Image Generation	NeurIPS 2017	1705.09368	charliememory/Pose-Guided-Person-Image-Generation
PoseGAN	Deformable GANs for Pose-Based Human Image Generation	CVPR 2018	1801.00055	AliaksandrSiarohin/pose-gan
VUnet	A Variational U-Net for Conditional Appearance and Shape Generation	CVPR 2018	1804.04694	CompVis/vunet
PoseWarp	Synthesizing Images of Humans in Unseen Poses	CVPR 2018	1804.07739	posewarp-cvpr2018
DPIG	Disentangled Person Image Generation	CVPR 2018	1712.02621	charliememory/Disentangled-Person-Image-Generation
FD-GAN	FD-GAN: Pose-guided Feature Distilling GAN for Robust Person Re-identification	NeurIPS 2018	1810.02936	yxgeee/FD-GAN
PN-GAN	Pose-Normalized Image Generation for Person Re-identification	ECCV 2018	1712.02225	naiq/PN_GAN
GestureGAN	GestureGAN for Hand Gesture-to-Gesture Translation in the Wild	MM 2018	1808.04859	Ha0Tang/GestureGAN
PATN	Progressive Pose Attention for Person Image Generation	CVPR 2019	1904.03349	tengteng95/Pose-Transfer
SPT	Unsupervised Person Image Generation with Semantic Parsing Transformation	CVPR 2019	1904.03379	SijieSong/person_generation_spt
	Coordinate-based Texture Inpainting for Pose-Guided Human Image Generation	CVPR 2019	1811.11459	project
IntrinsicFlow	Dense intrinsic appearance flow for human pose transfer	CVPR 2019	1903.11326	ly015/intrinsic_flow
TriangleGAN	Gesture-to-Gesture Translation in the Wild via Category-Independent Conditional Maps	MM 2019	1907.05916	yhlleo/TriangleGAN
Pix2pixHD + Temporal Smoothing + FaceGAN	Everybody Dance Now	ICCV 2019	1808.07371	project
LiquidWarpingGAN	Liquid warping gan: A unified framework for human motion imitation, appearance transfer and novel view synthesis	ICCV 2019	1909.12224	svip-lab/impersonator
Global-Flow-Local-Attention	Deep Image Spatial Transformation for Person Image Generation	CVPR 2020	2003.00696	RenYurui/Global-Flow-Local-Attention
ADGAN	Controllable Person Image Synthesis With Attribute-Decomposed GAN	CVPR 2020	2003.12267	menyifang/ADGAN
CoCosNet	Cross-domain Correspondence Learning for Exemplar-based Image Translation	CVPR 2020	2004.05571	microsoft/CoCosNet
SMIS	Semantically Multi-modal Image Synthesis	CVPR 2020	2003.12697	Seanseattle/SMIS
MISC	MISC: Multi-Condition Injection and Spatially-Adaptive Compositing for Conditional Person Image Synthesis	CVPR 2020	cvpr20
Warp3d_Reposing	Reposing Humans by Warping 3D Features	CVPR 2020 Workshop	2006.04898	MKnoche/warp3d_reposing
	Wish You Were Here: Context-Aware Human Generation	CVPR 2020	2005.10663
PoseStylizer	Generating Person Images with Appearance-aware Pose Stylizer	IJCAI 2020	2007.09077	siyuhuang/PoseStylizer
XingGAN	XingGAN for Person Image Generation	ECCV 2020	2007.09278	Ha0Tang/XingGAN

Segmentation Map Guided Scene Image Generation

Model	Paper	Conference	Arxiv	Code
CRN	Photographic Image Synthesis with Cascaded Refinement Networks	ICCV 2017	1707.09405	CQFIO/PhotographicImageSynthesis
CrossNet	Predicting Ground-Level Scene Layout from Aerial Imagery	CVPR 2017	1612.02709	viibridges/crossnet
SIMS	Semi-parametric Image Synthesis	CVPR 2018	1804.10992	xjqicuhk/SIMS
Pix2PixHD	High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs	CVPR 2018	1711.11585	NVIDIA/pix2pixHD
X-Fork & X-Seq	Cross-View Image Synthesis using Conditional GANs	CVPR 2018	1803.03396	kregmi/cross-view-image-synthesis
Vid2Vid	Video-to-Video Synthesis	NeurIPS 2018	1808.06601	NVIDIA/vid2vid
SPADE	Semantic Image Synthesis with Spatially-Adaptive Normalization	CVPR 2019	1903.07291	NVlabs/SPADE
SelectionGAN	Multi-Channel Attention Selection GAN with Cascaded Semantic Guidance for Cross-View Image Translation	CVPR 2019	1904.06807	Ha0Tang/SelectionGAN
Art2Real	Art2Real: Unfolding the Reality of Artworks via Semantically-Aware Image-to-Image Translation	CVPR 2019	1811.10666	aimagelab/art2real
	Mask-Guided Portrait Editing with Conditional GANs	CVPR 2019	1905.10346	cientgu/Mask_Guided_Portrait_Editing
Seg2Vid	Video Generation from Single Semantic Label Map	CVPR 2019	1903.04480	junting/seg2vid
	Semantic Bottleneck Scene Generation		1911.11357
Few-shot Vid2Vid	Few-shot Video-to-Video Synthesis	NeurIPS 2019	1910.12713	NVlabs/few-shot-vid2vid
CC-FPSE	Learning to Predict Layout-to-image Conditional Convolutions for Semantic Image Synthesis	NeurIPS 2019	1910.06809	xh-liu/CC-FPSE
SEAN	SEAN: Image Synthesis with Semantic Region-Adaptive Normalization	CVPR 2020	1911.12861	ZPdesu/SEAN
BachGAN	BachGAN: High-Resolution Image Synthesis from Salient Object Layout	CVPR 2020	2003.11690	Cold-Winter/BachGAN
	Panoptic-based Image Synthesis	CVPR 2020	2004.10289
SMIS	Semantically Multi-modal Image Synthesis	CVPR 2020	2003.12697	Seanseattle/SMIS
GAN Compression	GAN Compression: Efficient Architectures for Interactive Conditional GANs	CVPR 2020	2003.08936	mit-han-lab/gan-compression
LGGAN	Local Class-Specific and Global Image-Level Generative Adversarial Networks for Semantic-Guided Scene Generation	CVPR 2020	1912.12215	Ha0Tang/LGGAN
TSIT	TSIT: A Simple and Versatile Framework for Image-to-Image Translation	ECCV 2020	2007.12072	EndlessSora/TSIT
SegVAE	Controllable Image Synthesis via SegVAE	ECCV 2020	2007.08397	yccyenchicheng/SegVAE
SESAME	SESAME: Semantic Editing of Scenes by Adding, Manipulating or Erasing Objects	ECCV 2020	2004.04977
Style Semantics	Controlling Style and Semantics in Weakly-Supervised Image Generation	ECCV 2020	1912.03161	dariopavllo/style-semantics

Texture Patch Guided

Model	Paper	Conference	Arxiv	Code
TextureGAN	TextureGAN: Controlling Deep Image Synthesis with Texture Patches	CVPR 2018	1706.02823	janesjanes/Pytorch-TextureGAN
Guided-pix2pix	Guided Image-to-Image Translation with Bi-Directional Feature Transformation	ICCV 2019	1910.11328	vt-vl-lab/Guided-pix2pix

Example Guided

Model	Paper	Conference	Arxiv	Code
EG-UNIT	Exemplar Guided Unsupervised Image-to-Image Translation	ICLR 2019	1805.11145	charliememory/EGSC-IT
Pix2pixSC	Example-Guided Style-Consistent Image Synthesis from Semantic Labeling	CVPR 2019	1906.01314	cxjyxxme/pix2pixSC

Attention Guided

Model	Paper	Conference	Arxiv	Code
DA-GAN	DA-GAN: Instance-level Image Translation by Deep Attention Generative Adversarial Networks	CVPR 2018	1802.06454
Attention-GAN	Attention-GAN for Object Transfiguration in Wild Images	ECCV 2018	1803.06798
UAIT	Unsupervised Attention-guided Image to Image Translation	NeurIPS 2018	1806.02311	AlamiMejjati/Unsupervised-Attention-guided-Image-to-Image-Translation
	Show, Attend and Translate: Unsupervised Image Translation with Self-Regularization and Attention	TIP 2019	1806.06195
AttentionGAN	Attention-Guided Generative Adversarial Networks for Unsupervised Image-to-Image Translation	IJCNN 2019	1903.12296	Ha0Tang/AttentionGAN
U-GAT-IT	U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation	ICLR 2020	1907.10830	taki0112/UGATIT, znxlwm/UGATIT-pytorch

Mask Guided

Model	Paper	Conference	Arxiv	Code
ContrastGAN	Generative Semantic Manipulation with Mask-Contrasting GAN	ECCV 2018	1708.00315
InstaGAN	Instance-aware image-to-image translation	ICLR 2019	1812.10889	sangwoomo/instagan
INIT	Towards Instance-level Image-to-Image Translation	CVPR 2019	1905.01744	project

Text Guided

Model	Paper	Conference	Arxiv	Code
ControlGAN	Controllable Text-to-Image Generation	NeurIPS 2019	1909.07083	mrlibw/ControlGAN
DMIT	Multi-mapping Image-to-Image Translation via Learning Disentanglement	NeurIPS 2019	1909.07877	Xiaoming-Yu/DMIT
ManiGAN	ManiGAN: Text-Guided Image Manipulation		1912.06203
RefinedGAN	Image-to-Image Translation with Text Guidance		2002.05235

Audio Guided

Model	Paper	Conference	Arxiv	Code
X2Face	X2Face: A Network for Controlling Face Generation using Images, Audio, and Pose Codes	ECCV 2018	1807.10550	oawiles/X2Face

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Guided Image-to-Image Translation papers

Class Label Guided

Action Unit Guided

Facial Landmark Guided

Pose Guided Person Image Generation

Segmentation Map Guided Scene Image Generation

Texture Patch Guided

Example Guided

Attention Guided

Mask Guided

Text Guided

Audio Guided

Files

README.md

Latest commit

History

README.md

File metadata and controls

Guided Image-to-Image Translation papers

Class Label Guided

Action Unit Guided

Facial Landmark Guided

Pose Guided Person Image Generation

Segmentation Map Guided Scene Image Generation

Texture Patch Guided

Example Guided

Attention Guided

Mask Guided

Text Guided

Audio Guided