Releases: VRCWizard/TTS-Voice-Wizard
v1.6.8.1
v1.6.8.1
- minor fixes to chatgpt integration tab
v1.6.8
- OpenAI TTS
- Available with Pro and with your own API key
- ChatGPT Integration (Text to ChatGPT to Speech)
- Available with Pro and with your own API key. (with pro you can send a prompt and your current message. With your own key you can send full unlimited context)
- Code optimization to reduce response times, a couple ms have been shaved off of several TTS and STT options
- Faster responses from pro tts options and deepgram speech to text
- VRChat chatbox typing indicator toggle
- Minor UI changes
- Deepgram silence scale
- tooltips added for some ui elements
- Bug fixes
- fixed issue when switched stt methods when whisper was running
v1.6.7.3
- Deepgram Aura TTS (english voices only) available on the dev branch of Voice Wizard Pro
- Listen to voices here: https://developers.deepgram.com/docs/tts-models
- Added button to add progressbar for media integration
- Added tooltips to media integration buttons and some general setting buttons
- Added Albanian language for TTS though Azure/Pro and Speech to Text via Azure.
- Added "/TTSVoiceWizard/TextToText" OSC endpoint
- hotkeys no longer trigger multiple times when held down
- Removed zh-CN-XiaoxuanNeural voice from azure voice list
v1.6.8
v1.6.8 (current)
- OpenAI TTS
- Available with Pro and with your own API key
- ChatGPT Integration (Text to ChatGPT to Speech)
- Available with Pro and with your own API key. (with pro you can send a prompt and your current message. With your own key you can send full unlimited context)
- Code optimization to reduce response times, a couple ms have been shaved off of several TTS and STT options
- Faster responses from pro tts options and deepgram speech to text
- VRChat chatbox typing indicator toggle
- Minor UI changes
- Deepgram silence scale
- tooltips added for some ui elements
- Bug fixes
- fixed issue when switched stt methods when whisper was running
v1.6.7.3
- Deepgram Aura TTS (english voices only) available on the dev branch of Voice Wizard Pro
- Listen to voices here: https://developers.deepgram.com/docs/tts-models
- Added button to add progressbar for media integration
- Added tooltips to media integration buttons and some general setting buttons
- Added Albanian language for TTS though Azure/Pro and Speech to Text via Azure.
- Added "/TTSVoiceWizard/TextToText" OSC endpoint
- hotkeys no longer trigger multiple times when held down
- Removed zh-CN-XiaoxuanNeural voice from azure voice list
v1.6.7.3 Deepgram Aura
- Deepgram Aura TTS (english voices only) available on the dev branch of Voice Wizard Pro
- Listen to voices here: https://developers.deepgram.com/docs/tts-models
- Added button to add progressbar for media integration
- Added tooltips to media integration buttons and some general setting buttons
- Added Albanian language for TTS though Azure/Pro and Speech to Text via Azure.
- Added "/TTSVoiceWizard/TextToText" OSC endpoint
- hotkeys no longer trigger multiple times when held down
- Removed zh-CN-XiaoxuanNeural voice from azure voice list
v1.6.7.1
- Fix for playing local files on spotify that don't have an album artist
v1.6.7
v1.6.7 (current)
- Updated the CoreOSC package to allow for receiving utf8 characters via OSC
- Changed the elements of the progress bar variable
{progressBar E:◯ L:13}
to use more fitting utf characters. ┣━━━━┫ vs |-----------|
v1.6.6.8
- 2 optional override bools added for OSC to TTS endpoint
/TTSVoiceWizard/TextToSpeech
bool useChatbox
bool useKillFrenzyAvatarText
The bools allow you to optionally forward text the VRC Chatbox or Killfrenzy Avatar Text without adjusting the in app settings. When sending an OSC message to this endpoint you may still send just a string or both bools (sending only one of the bools will not override the settings).
Example of possible combinations in TouchOSC:
v1.6.6.7
- fixed vrchat counter causing timeout if different counters were activated in quick succession.
- spotify exceptions should always give a message.
v1.6.6.5
- minor bug fixes
- ElevenLabs turbo model added
- removed voices for TikTok API that no longer functioned (optimus prime voices)
- New Azure Voices:
es-ES | Spanish (Spain) | XimenaNeural (Female)
fr-CA | French (Canada) | ThierryNeural (Male)
fr-FR | French (France) | VivienneNeural (Female)
it-IT | Italian (Italy) | GiuseppeNeural (Male)
ko-KR | Korean (Korea) | HyunsuNeural (Male)
pt-BR | Portuguese (Brazil) | ThalitaNeural (Female)
zh-CN-XiaoxiaoDialectsNeural
de-DE | German (Germany) | de-DE-FlorianMultilingualNeural (Male)
de-DE | German (Germany) | de-DE-SeraphinaMultilingualNeural (Female)
en-US | English (United States) | en-US-AvaMultilingualNeural (Female)
en-US | English (United States) | en-US-EmmaMultilingualNeural (Female)
fr-FR | French (France) | fr-FR-RemyMultilingualNeural (Male)
en-US | English (United States) | en-US-BrianMultilingualNeural (Male)
en-US | English (United States) | en-US-AndrewMultilingualNeural (Male)
fr-FR | French (France) | fr-FR-VivienneMultilingualNeural (Female)
zh-CN | Chinese (Mandarin, Simplified) | zh-CN-XiaoxiaoMultilingualNeural (Female)
zh-CN | Chinese (Mandarin, Simplified) | zh-CN-XiaochenMultilingualNeural (Female)
zh-CN | Chinese (Mandarin, Simplified) | zh-CN-YunyiMultilingualNeural (Male)
v1.6.5.2
- uberduck voice selection fix (although some voices in the list no longer function)
- now using latest version of windows media controller, (report any crashing and include event viewer details)
- {albumArtist} added for windows media and Spotify media integration
v1.6.5
- vrchat "/chatbox/typing" indicator added for whisper and deepgram (if VAD is active)
- fix for whisper timestamps being offset for VAD so super short activations wouldn't be recognized
- possible fix for group in word replacement tab breaking on some computers
- hotkeys backend refactored, new hotkeys added for changing voice presets quickly
v1.6.6.8
- 2 optional override bools added for OSC to TTS endpoint
/TTSVoiceWizard/TextToSpeech
bool useChatbox
bool useKillFrenzyAvatarText
The bools allow you to optionally forward text the VRC Chatbox or Killfrenzy Avatar Text without adjusting the in app settings. When sending an OSC message to this endpoint you may still send just a string or both bools (sending only one of the bools will not override the settings).
v1.6.6.7
v1.6.6.5
v1.6.6.5 (current)
- minor bug fixes
- ElevenLabs turbo model added
- removed voices for TikTok API that no longer functioned (optimus prime voices)
- New Azure Voices:
es-ES | Spanish (Spain) | XimenaNeural (Female)
fr-CA | French (Canada) | ThierryNeural (Male)
fr-FR | French (France) | VivienneNeural (Female)
it-IT | Italian (Italy) | GiuseppeNeural (Male)
ko-KR | Korean (Korea) | HyunsuNeural (Male)
pt-BR | Portuguese (Brazil) | ThalitaNeural (Female)
zh-CN-XiaoxiaoDialectsNeural
de-DE | German (Germany) | de-DE-FlorianMultilingualNeural (Male)
de-DE | German (Germany) | de-DE-SeraphinaMultilingualNeural (Female)
en-US | English (United States) | en-US-AvaMultilingualNeural (Female)
en-US | English (United States) | en-US-EmmaMultilingualNeural (Female)
fr-FR | French (France) | fr-FR-RemyMultilingualNeural (Male)
en-US | English (United States) | en-US-BrianMultilingualNeural (Male)
en-US | English (United States) | en-US-AndrewMultilingualNeural (Male)
fr-FR | French (France) | fr-FR-VivienneMultilingualNeural (Female)
zh-CN | Chinese (Mandarin, Simplified) | zh-CN-XiaoxiaoMultilingualNeural (Female)
zh-CN | Chinese (Mandarin, Simplified) | zh-CN-XiaochenMultilingualNeural (Female)
zh-CN | Chinese (Mandarin, Simplified) | zh-CN-YunyiMultilingualNeural (Male)
v1.6.5.2
- uberduck voice selection fix (although some voices in the list no longer function)
- now using latest verson of windows media controller, report any crashing and include event viewer details
- {albumArtist} added for windows media and spotify media integration
v1.6.5
- vrchat "/chatbox/typing" indicator added for whisper and deepgram (if VAD is active)
- fix for whisper timestamps being offset for VAD so super short activations wouldn't be recognized
- possible fix for group in word replacement tab breaking on some computers
- hotkeys backend refactored, new hotkeys added for changing voice presets quickly
v1.6.6.1
This is a demo of a different whisper library, it has not been thoroughly tested so there are likely bugs
v1.6.6.1 (current)
- more robust output purification and removal
- metrics on by default for this version
- metrics changed to be more visible and distinguishable
- fixed model download loop for models that were considered too small to be real
v1.6.6
- Fixed memory leak in previous version (which caused crashing, oopsie)
Min VAD Duration (s)
andMin Avg Confidence
have been added to help fight against hallucinated outputs, you can enable debug text to see everything that gets filtered out
v1.6.5.9
- Whisper Library Changed from https://github.com/Const-me/Whisper to https://github.com/sandrohanea/whisper.net (so complete rework behind the scenes)
- Whisper can be "Recognize Once" or continuous
- Whisper model selection now uses absolute path
- Whisper can be GPU (and CPU) or just CPU
- Whisper now supports most if not all models here https://huggingface.co/ggerganov/whisper.cpp/tree/main (including quantized models)
- Whisper now supports distil models: https://huggingface.co/collections/distil-whisper/distil-whisper-models-65411987e6727569748d2eb6
v1.6.6 New Whisper Library
This is a demo of a different whisper library, it has not been thoroughly tested so there are likely bugs
v1.6.6 (current)
- Fixed memory leak in previous version (which caused crashing, oopsie)
Min VAD Duration (s)
andMin Avg Confidence
have been added to help fight against hallucinated outputs, you can enable debug text to see everything that gets filtered out
v1.6.5.9
- Whisper Library Changed from https://github.com/Const-me/Whisper to https://github.com/sandrohanea/whisper.net (so complete rework behind the scenes)
- Whisper can be "Recognize Once" or continuous
- Whisper model selection now uses absolute path
- Whisper can be GPU (and CPU) or just CPU
- Whisper now supports most if not all models here https://huggingface.co/ggerganov/whisper.cpp/tree/main (including quantized models)
- Whisper now supports distil models: https://huggingface.co/collections/distil-whisper/distil-whisper-models-65411987e6727569748d2eb6