AIGEN

This project aims to develop an application that generates images and music based on user input using pre-trained deep learning models through an Application Programming Interface (API). The application will allow users to input text and the pre-trained deep learning models will generate a corresponding image and music track.

WORKING ⚔️

The system is divided into two parts: Getting the input from the user, Display the generated output to the user.

The music and image is generated using publicly available pre-trained models through Inference API from HuggingFace.

Image Generating Model: Stable Diffusion is a latent text-to-image diffusion model capable of generating photo-realistic images given any text input

Music Generating Model: Riffusion is a latent text-to-image diffusion model capable of generating spectrogram images given any text input. These spectrograms can be converted into audio clips.

OUTPUT 👍

APPLICATION HOMEPAGE

APPLICATION DISPLAY PAGE

SAMPLE 1 : Input ‘Boat with sunrise’ and ‘Pleasant heavy metal’

SAMPLE 2 : Input ‘Sparrow in tree abstract’ and ‘fun disco’

TO RUN LOCALLY 👇

Download the repository or clone it locally
Create a python virtual environment using python -m venv
Once Created Activate the virtual environment
Install the packages from the requierments.txt
Run the application using flask --app aigen run

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
aigen		aigen
.gitattributes		.gitattributes
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AIGEN

WORKING ⚔️

OUTPUT 👍

APPLICATION HOMEPAGE

APPLICATION DISPLAY PAGE

TO RUN LOCALLY 👇

About

Releases

Packages

Languages

Harshath143/AIGEN

Folders and files

Latest commit

History

Repository files navigation

AIGEN

WORKING ⚔️

OUTPUT 👍

APPLICATION HOMEPAGE

APPLICATION DISPLAY PAGE

TO RUN LOCALLY 👇

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages