TTS
Foundational model for human-like, expressive TTS
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
Zero-Shot Speech Editing and Text-to-Speech in the Wild
🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.
MARS5 speech model (TTS) from CAMB.AI
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
A generative speech model for daily dialogue.
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.