A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
-
Updated
Sep 19, 2024 - Python
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Computes the MWER (minimum WER) Loss with beam search and negative sampling strategy.
Podcast Summarizer with LLM Technology
语音转文本的各类python封装实现(paraformer、whisper_online、whisper_offline、funasr),用于服务kuon仓库
Add a description, image, and links to the paraformer topic page so that developers can more easily learn about it.
To associate your repository with the paraformer topic, visit your repo's landing page and select "manage topics."