ArtFusion: Controllable Arbitrary Style Transfer using Dual Conditional Latent Diffusion Models
_{Official PyTorch Implementation}

Author: Dar-Yen Chen

This implementation is based on the CompVis/latent-diffusion repository.

Our paper presents the first learning-based arbitrary style transfer diffusion model. ArtFusion exhibits outstanding controllability and faithful representation of artistic details.

Controllability

ArtFusion empowers users with the flexibility to balance between source content and reference style in the outputs, catering to diverse stylization preferences. Results range from distinct content structures to pronounced stylization.

Style Representation

ArtFusion can capture the core style characteristics that are typically overlooked in SOTA methods, such as the blurry edges typical of Impressionist art, the texture of oil painting, and similar brush strokes.

Architecture

Environment

Create and activate the conda environment:

conda env create -f environment.yaml
conda activate artfusion

Training

The WikiArt style dataset we use is from Kaggle, which is gathered from WIKIART.

The content dataset is MS COCO 2017.

Please download and place the datasets as:

└── datasets
    ├── ms_coco
    └── wiki-art

Download the first-stage VAE utilized in LDM to ./checkpoints/vae/kl-f16.ckpt.

Then run the commands:

python main.py \
    --name experiment_name \
    --base ./configs/kl16_content12.yaml \
    --basedir ./checkpoints \
    -t True \
    --gpus 0,

Pretrained Model

The pretrained model can be downloaded here.

Please place it at the folder ./checkpoints/artfusion/.

Inference

Inference can be done via the notebook.

Type following to set the conda environment on the jupyter notebook.

python -m ipykernel install --user --name artfusion

License

This project is released under the MIT License.

BibTeX

If you find this repository useful for your research, please cite using the following.

@misc{chen2023artfusion,
      title={ArtFusion: Controllable Arbitrary Style Transfer using Dual Conditional Latent Diffusion Models}, 
      author={Dar-Yen Chen},
      year={2023},
      eprint={2306.09330},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
assets		assets
configs		configs
data		data
ldm		ldm
notebooks		notebooks
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yaml		environment.yaml
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ArtFusion: Controllable Arbitrary Style Transfer using Dual Conditional Latent Diffusion Models
_{Official PyTorch Implementation}

Controllability

Style Representation

Architecture

Environment

Training

Pretrained Model

Inference

License

BibTeX

About

Releases

Packages

Languages

License

ChenDarYen/ArtFusion

Folders and files

Latest commit

History

Repository files navigation

ArtFusion: Controllable Arbitrary Style Transfer using Dual Conditional Latent Diffusion ModelsOfficial PyTorch Implementation

Controllability

Style Representation

Architecture

Environment

Training

Pretrained Model

Inference

License

BibTeX

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

ArtFusion: Controllable Arbitrary Style Transfer using Dual Conditional Latent Diffusion Models
_{Official PyTorch Implementation}

Packages