Silero VAD: pre-trained enterprise-grade Voice Activity Detector
-
Updated
Nov 25, 2024 - Python
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
An audio/acoustic activity detection and audio segmentation tool
Gecko - A Tool for Effective Annotation of Human Conversations
Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.
A statistical model-based Voice Activity Detection
A stream-translator fork with VAD based audio slicing & GPT / Gemini translation.
Efficient voice activity detection algorithm using long-term speech information
Binary classification problem that aims to classify human voices from audio recordings. Implemented using PyTorch and Librosa.
Spoofing voice detection : 2nd YAICON
End to end AWS SageMaker application for detecting the AWS Polly voice in an audio recording using Gluon and MXNet.
iOS Voice Activity Detection (VAD). Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.
this is a p5js experiment that uses voice detection and cursor movement to multiply creative content in a variety of colours
TranscribeTube is a Python tool that transcribes and generates subtitles for videos from local files or YouTube links using Hugging Face models. It features an interactive Gradio web interface, allowing users to easily upload videos, select languages, and download subtitles in SRT format.
A Python project that handles speech commands and retrieves results from Google or Wikipedia based on the spoken input. Functions are organized in separate files, with a single raw file to execute the project. This repository is intended for project purposes and will be updated with additional features in the future.
Config files for my GitHub profile.
Add a description, image, and links to the voice-detection topic page so that developers can more easily learn about it.
To associate your repository with the voice-detection topic, visit your repo's landing page and select "manage topics."