Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.
-
Updated
Oct 16, 2024 - Python
Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.
Unlock the potential of Google's Gemini AI models with this versatile toolkit. Offering seamless chat, text generation, and multimodal interactions, supporting various file types, including PDF's, images, videos, audio, text and more. Enjoy real-time responses, customizable parameters, and easy integration for diverse AI tasks.
Uses the powerful WhisperS2T and Ctranslate2 libraries to batch transcribe multiple files
A simple Django project to demonstrate Google Speech Recognition.
Converts speech to text from any audio/video file
Web Chat Robot based on LLama3.2-1B Model at Server-side Deployment with Continuous Conversation
WhisperAudioTranscriber is an asynchronous audio recording and transcription tool built using Python. It utilizes the Hugging Face API, specifically leveraging the powerful capabilities of OpenAI's Whisper model
Add a description, image, and links to the audio-transcribing topic page so that developers can more easily learn about it.
To associate your repository with the audio-transcribing topic, visit your repo's landing page and select "manage topics."