A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
-
Updated
Dec 13, 2024 - Python
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
In defence of metric learning for speaker recognition
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
The authors' implementation of the "Neural Head Reenactment with Latent Pose Descriptors" (CVPR 2020) paper.
[WACV 2024] "CVTHead: One-shot Controllable Head Avatar with Vertex-feature Transformer"
Speaker identification with VGGVox network
Python toolkit for speech processing
Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021
Voxceleb1 i-vector based speaker recognition system
Luigi pipeline to download VoxCeleb(2) audio from YouTube and extract speaker segments
[IJCAI2022] Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast
Official implementation of the ICASSP 2024 paper: Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Speaker Verification
Voice gender classifier using ECAPA-TDNN
This project partially embodies the state-of-the-art practices in speaker verification technology up until 2020, while attaining the state-of-the-art performance on the VoxCeleb1 test sets.
Few-shot learning experiments mostly on speaker recognition.
SOTA method for self-supervised speaker verification leveraging a large-scale pretrained ASR model.
说话人识别仓库-说话人表征-ResNet/VGGVox || a ready-to-use repo for Speaker Verification / Speaker Embedding with xvector
A benchmark analysis of some Speaker Verification techniques based on Deep Learning.
Add a description, image, and links to the voxceleb topic page so that developers can more easily learn about it.
To associate your repository with the voxceleb topic, visit your repo's landing page and select "manage topics."