Skip to content

Speech-to-Image is an AI application that converts spoken prompts into images. By capturing voice input, it transforms recognized speech into text, which is then used to generate visuals. Built with Streamlit, this project offers a user-friendly interface for an engaging and interactive creative experience.

Notifications You must be signed in to change notification settings

jaidh01/Speech-to-Image

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

🎤 Speech-to-Image Generation

📘 Overview

Speech-to-Image is a unique application that converts spoken input into AI-generated images. Utilizing speech recognition to capture user prompts, the project processes the recognized text and generates corresponding images through text-to-image modeling. Built using Streamlit for a user-friendly interface, this project highlights voice-driven creativity and AI interaction.

🔧 Features

  • Voice Recognition: Capture prompts through speech.
  • AI-Powered Image Generation: Translates text into visuals.
  • Interactive UI: Simple interface for speech-to-image experience.

📂 Repository Structure

  • SI_streamlit.py: Main script for the Streamlit app, integrating speech recognition and image generation functionalities.

🛠️ Getting Started

  1. Clone the Repository:
    git clone https://github.com/jaidh01/Speech-to-Image.git
  2. Run the Application:
    streamlit run SI_streamlit.py

Requirements

  • Python 3.8+
  • Streamlit
  • Speech Recognition libraries

🤝 Contributions

Contributions are welcome! Fork the repository, enhance it, and submit a pull request.

About

Speech-to-Image is an AI application that converts spoken prompts into images. By capturing voice input, it transforms recognized speech into text, which is then used to generate visuals. Built with Streamlit, this project offers a user-friendly interface for an engaging and interactive creative experience.

Resources

Stars

Watchers

Forks

Languages