Skip to content

Text-to-Image Synthesis using Multimodal (VQGAN + CLIP) Architectures

Notifications You must be signed in to change notification settings

sbmagar13/VQGAN-CLIP-Text-to-Image

Repository files navigation

VQGAN-CLIP-Text-to-Image

Project is here: Link to Colab
All step-by-step explanations of codes: Link to the post

Read story on Medium: Link to Medium
I guarantee you that you'll completely understand every single step. And complete an advanced GAN project.

text prompt(input): #A man fighting with a bull

Output: After 100 epochs
crop:


result: