GitHub - DakeQQ/Voice-Activity-Detection-VAD-ONNX: Utilizes ONNX Runtime for speech activity detection.

Voice-Activity-Detection-VAD-ONNX

Speech activity detection powered by ONNX Runtime for high-performance applications.

Supported Model:
- FSMN
- Silero (Optimized for enhanced parallel computing performance)
Recommendation and Note:
- It is recommended to use the Audio Denoiser for optimal performance in noisy environments.
End-to-End Processing:
- This model includes internal STFT processing.
- Input: Raw audio
- Output: Detected speech timestamps
Resources:
- Download Models
- Explore More Projects

OS	Device	Backend	Model	Real-Time Factor (Chunk Size: 512 or 32ms)
Ubuntu-24.04	Desktop	CPU i3-12300	FSMN f32	0.0047
Ubuntu-24.04	Desktop	CPU i3-12300	Silero f32	0.0026

通过 ONNX Runtime 实现高性能的语音活动检测。

Name		Name	Last commit message	Last commit date
Latest commit History 94 Commits
FSMN		FSMN
Silero		Silero
LICENSE		LICENSE
README.md		README.md