Releases: KoljaB/RealtimeSTT
Releases · KoljaB/RealtimeSTT
v0.3.93
v0.3.92
v0.3.91
v0.3.9
RealtimeSTT v0.3.9 Release Notes
🚀 New Features
Batched Transcription
- Added support for batched transcription in both main and real-time models which improves performance and efficiency
- New parameters introduced:
batch_size
: Controls the batch size for main transcription tasks.realtime_batch_size
: Configures batch size for real-time transcription.
This feature is designed to speed up processing. I can't say yet if there may be cases where batching overhead impacts performance negatively. It looked promising for me in initial tests, but I need your feedback! Please report if you get into any issues or notice even slower transcription due to batching.
v0.3.81
RealtimeSTT 0.3.81
Enhanced CLI Interface
- Introduced the
-sed
command for improved speech end detection - Added the
-l
command to set the language - Implemented the
-L
command to quickly display a list of all available audio input devices - Enabled setting the input device index .
- Improved piping support for seamless with
>
or|
v0.3.7
RealtimeSTT 0.3.7
- fixed a bug to make client terminate gracefully (logged websocket error in debug mode before)
- reworked the CLI interfaces and added shorter commands (for example --writechunks is now -W or --write, for more information please look into the Client Server Readme)
v0.3.6
RealtimeSTT 0.3.6
- more logging for client/server:
Additional parameters for server:- --use_extended_logging, writes extensive log messages for the recording worker, that processes the audio chunks
- --debug, enables debug logging for detailed server operations
- --logchunks, enables logging of incoming audio chunks (periods)
- --writechunks, saves received audio chunks to a WAV file
Additional parameters for client: - --debug, enables debug logging for detailed client operations
- --writechunks, saves recorded audio chunks to a WAV file
- more logging for AudioToTextRecorder when called with use_extended_logging = True
- new init_realtime_after_seconds parameter for AudioToTextRecorder to finetune the default of 0.2s
v0.3.5
v0.3.4
v0.3.2
RealtimeSTT 0.3.2
New Features:
- server/stt_server.py and AudioToTextRecorderClient class now support wake words (all parameters and callbacks of AudioToTextRecorder should now have been already implemented into AudioToTextRecorderClient class, please write an issue if you miss a functionality)
- update microphone reconnect