GitHub - jonathandinu/ai4artists: A list of AI Art courses, tools, libraries, people, and places.

Resources at the intersection of AI AND Art. Mainly tools and tutorials but also with some inspiring people and places thrown in too!

For a broader resource covering more general creative coding tools (that you might want to use with what is listed here), check out terkelg/awesome-creative-coding or thatcreativecode.page. For resources on AI and deep learning in general, check out ChristosChristofidis/awesome-deep-learning and https://github.com/dair-ai.

Learning

Courses

General Deep Learning

Deep Generative Modeling

Creative Coding and New Media

⭐️ Deep Learning for Art, Aesthetics, and Creativity (MIT)
Machine Learning for the Web (ITP/NYU)
Art and Machine Learning (CMU)
New Media Installation: Art that Learns (CMU)
Introduction to Computational Media (ITP/NYU)
- Media course
- Code course

Videos

Books

Tutorials and Blogs

Deep Learning

Generative Art

Papers/Methods

Diffusion models (and text-to-image)

SDEdit: Guided Image Synthesis and Editing with Stochastic Differential Equations: Paper predating Stable Diffusion describing a method for image synthesis and editing with diffusion based models.
GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models
High-Resolution Image Synthesis with Latent Diffusion Models: Original paper that introduced Stable Diffusion and started it all.
Prompt-to-Prompt Image Editing with Cross-Attention Control: Edit Stable Diffusion outputs by editing the original prompt.
An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion: Similar to prompt-to-prompt but instead takes an input image and a text description. Kinda like Style Transfer... but with Stable diffusion.
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation: Similar to Textual Inversion but instead focused on manipulating subject based images (i.e. this thing/person/etc. but underwater).
Novel View Synthesis with Diffusion Models
AudioGen: Textually Guided Audio Generation
Make-A-Video: Text-to-Video Generation without Text-Video Data
Imagic: Text-Based Real Image Editing with Diffusion Models
MDM: Human Motion Diffusion Model
Soft Diffusion: Score Matching for General Corruptions
Multi-Concept Customization of Text-to-Image Diffusion: Like DreamBooth but capable of synthesizing multiple concepts.
eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers
Elucidating the Design Space of Diffusion-Based Generative Models (EDM)
Tackling the Generative Learning Trilemma with Denoising Diffusion GANs
Imagen Video: High Definition Video Generation with Diffusion Models

Neural Radiance fields (and NeRF like things)

Structure-from-Motion Revisited: prior work on sparse modeling (still needed/useful for NeRF)
Pixelwise View Selection for Unstructured Multi-View Stereo: prior work on dense modeling (NeRF kinda replaces this)
DeepSDF: Learning Continuous Signed Distance Functions for Shape Representation
Deferred Neural Rendering: Image Synthesis using Neural Textures
Neural Volumes: Learning Dynamic Renderable Volumes from Images
⭐️ NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis: The paper that started it all...
Neural Radiance Fields for Unconstrained Photo Collections: NeRF in the wild (alternative to MVS)
Nerfies: Deformable Neural Radiance Fields: Photorealistic NeRF from casual in-the-wild photos and videos (like from a cellphone)
Mip-NeRF: A Multiscale Representation for Anti-Aliasing Neural Radiance Fields: NeRF... but BETTER FASTER HARDER STRONGER
Depth-supervised NeRF: Fewer Views and Faster Training for Free: Train NeRF models faster with fewer images by leveraging depth information
Instant Neural Graphics Primitives with a Multiresolution Hash Encoding: caching for NeRF training to make it rlllly FAST
Understanding Pure CLIP Guidance for Voxel Grid NeRF Models: text-to-3D using CLIP
NeRF-SLAM: Real-Time Dense Monocular SLAM with Neural Radiance Fields: NeRF for robots (and cars)
nerf2nerf: Pairwise Registration of Neural Radiance Fields: pretrained NeRF
The One Where They Reconstructed 3D Humans and Environments in TV Shows
ClimateNeRF: Physically-based Neural Rendering for Extreme Climate Synthesis
Realistic one-shot mesh-based head avatars
Neural Point Catacaustics for Novel-View Synthesis of Reflections
3D Moments from Near-Duplicate Photos
NeRDi: Single-View NeRF Synthesis with Language-Guided Diffusion as General Image Priors

3D and point clouds

Unconditional Image Synthesis

Conditional Image Synthesis (and inverse problems)

GAN inversion (and editing)

Latent Space Interpretation

Image Matting

Tools

Generative Modeling

NVIDIA Imaginaire: 2D Image synthesis library
NVIDIA Omniverse: The platform for creating and operating metaverse applications
mmgeneration
Modelverse: Content-Based Search for Deep Generative Models
PaddleGAN

Creative ML

Deep Learning Frameworks

Runtimes/Deployment

Text-to-Image

⭐️ Stable Diffusion
Imagen
DALLE 2
VQGAN+CLIP
Parti
Muse: Text-To-Image Generation via Masked Generative Transformers: More efficient than diffusion or autoregressive text-to-image models used masked image modeling w/ transformers

Stable Diffusion (SD)

Dream Studio: Official Stability AI cloud hosted service.
⭐️ Stable Diffusion Web UI: A user friendly UI for SD with additional features to make common workflows easy.
AI render (Blender): Render scenes in Blender using a text prompt.
Dream Textures (Blender): Plugin to render textures, reference images, and background with SD.
lexica.art - SD Prompt Search.
koi (Krita): SD plugin for Krita for img2img generation.
Alpaca (Photoshop): Photoshop plugin (beta).
Christian Cantrell's Plugin (Photoshop): Another Photoshop plugin.
Stable Diffusion Studio: Animation focused frontend for SD.
DeepSpeed-MII: Low-latency and high-throughput inference for a variety (20,000+) models/tasks, including SD.

Neural Radiance Fields

Creative Coding

Frameworks

⭐️ Processing (Java) and p5.js (Javascript)
openFrameworks (C++)
Cinder (C++)
nannou (Rust)

Visual Programming Languages

Datasets

Permissively Licensed/Open Access

LAION Datasets: Various very large scale image-text pairs datasets (notably used to train the open source Stable Diffusion models).
LAION-Face
Unsplash Images
Pixabay
Pexels
Open Images: Open Images is a dataset of ~9M images annotated with image-level labels, object bounding boxes, object segmentation masks, visual relationships, and localized narratives:
Mozilla Common Voice: 17,127 validated hours of transcribed speech covering 104 languages. Additionally many of the recorded hours in the dataset also include demographic metadata like age, sex, and accent that can help improve the accuracy of speech recognition engines.
Flickr Commons: Flickr Commons is a unique collection of historical photography from over 100 cultural institutions from all around the world, all with no known copyright restrictions.
Internet Archive: Internet Archive is a non-profit library of millions of free books, movies, software, music, websites, and more.
Wikimedia Commons: a collection of 106,323,506 freely usable media files to which anyone can contribute.
Prelinger Archives
Getty Library Open Content Program: Making images from Getty’s collections freely available for study, teaching, and enjoyment.
Smithsonian Open Access
Public Domain Review: Focused on works now fallen into the public domain, the vast commons of out-of-copyright material that everyone is free to enjoy, share, and build upon without restrictions.
Library of Congress
Biodiversity Heritage Library
The Met Open Access
The National Gallery of Art Open Access
Art Institute of Chicago Open Access
NY Public Library Public Domain Collections
Museum für Kunst und Gewerbe Hamburg Steintorplatz
FairFace
Conceptual Captions
Quick, Draw!
Open Images
Visual Question Answering
TensorFlow Flowers
Stanford Online Products dataset
DeepMind 3d Shapes
PASS: An ImageNet replacement for self-supervised pretraining without humans which can be used for high-quality pretraining while significantly reducing privacy concerns.

Faces/People (restricted licenses)

Other

Brutus Light Field

Products/Apps

Artbreeder
Midjourney
DALLE 2 (OpenAI)
Runway - AI powered video editor.
Facet AI - AI powered image editor.
Adobe Sensei - AI powered features for the Creative Cloud suite.
NVIDIA AI Demos
ClipDrop and cleanup.pictures

Artists

A non-exhaustive list of people doing interesting things at the intersection of art, ML, and design.

Institutions/Places

Related lists and collections

Machine Learning for Art
Tools and Resources for AI Art (pharmapsychotic) - Big list of Google Colab notebooks for generative text-to-image techniques as well as general tools and resources.
Awesome Generative Deep Art - A curated list of Generative Deep Art / Generative AI projects, tools, artworks, and models

Contributing

Contributions are welcome! Read the contribution guidelines first.

Name		Name	Last commit message	Last commit date
Latest commit History 116 Commits
images		images
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md

License

jonathandinu/ai4artists

Folders and files

Latest commit

History

Repository files navigation

Contents