Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

update speechio whisper ft results #1605

Merged
merged 5 commits into from
Apr 30, 2024
Merged

Conversation

yuekaizhang
Copy link
Collaborator

Fine-tuning whisper-large-v2 using multi-hans-zh dataset, exclude datatang 200h (which is not open sourced any more), updating wenetspeech (according to wenet-e2e/WenetSpeech#54)

Currently, rank 8 according to https://github.com/SpeechColab/Leaderboard

Rank 排名 Model 模型 CER 字错误率 Date 时间
1 ximalaya_api_zh 1.72% 2023.12
2 aliyun_ftasr_api_zh 1.85% 2023.12
3 microsoft_batch_zh 2.40% 2023.12
4 bilibili_api_zh 2.90% 2023.09
5 tencent_api_zh 3.18% 2023.12
6 iflytek_lfasr_api_zh 3.32% 2023.12
7 aispeech_api_zh 3.62% 2023.12
8 whisper-large-ft-v1 4.45% 2024.04
9 baidu_pro_api_zh 7.29% 2023.12
Split Greedy Search
Datasets
alimeeting eval 23.45
alimeeting test 25.42
aishell-1 dev 0.78
aishell-1 test 0.83
aishell-2 dev 2.75
aishell-2 test 2.93
aishell-4 test 17.11
magicdata dev 2.68
magicdata test 2.33
kespeech-asr dev phase1 4.97
kespeech-asr dev phase2 2.02
kespeech-asr test 6.34
WenetSpeech dev 5.06
WenetSpeech test meeting 8.38
WenetSpeech test net 6.94

@yuekaizhang yuekaizhang requested a review from JinZr April 24, 2024 11:05
@JinZr
Copy link
Collaborator

JinZr commented Apr 24, 2024

thank you so much!

i'll look into this pr tonight!

egs/multi_zh-hans/ASR/whisper/train.py Show resolved Hide resolved
egs/multi_zh-hans/ASR/prepare.sh Outdated Show resolved Hide resolved
egs/multi_zh-hans/ASR/whisper/train.py Outdated Show resolved Hide resolved
egs/wenetspeech/ASR/whisper/train.py Outdated Show resolved Hide resolved
egs/wenetspeech/ASR/whisper/train.py Show resolved Hide resolved
@JinZr
Copy link
Collaborator

JinZr commented Apr 30, 2024

Thank you yuekai!

I left a few comments at the PR, please check those at your convenience.

best
jin

@yuekaizhang
Copy link
Collaborator Author

Thank you yuekai!

I left a few comments at the PR, please check those at your convenience.

best jin

@JinZr Done. Thanks.

Copy link
Collaborator

@JinZr JinZr left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@JinZr JinZr merged commit 6d7c1d1 into k2-fsa:master Apr 30, 2024
203 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants