Sharif-Wav2vec2

This repo shows how to finetune the wav2vec2.0 model along with its prerequisites.

In this repository, an attempt was made to examine all aspects of the wav2vec2 model.

General Info

Datasets: for fine-tuning the Sharif-Wav2vec2-v1 model we've used: Mozilla Common Voice

The main datasets used for fine-tuning the Sharif-Wav2vec2-v2 model consist of BigFarsdat, DeepMine, FarsSpon & Mozilla Common Voice (AGP Dataset)
Corpus : Most of our textual data was taken from naab corpus which is a Huge corpora of textual data in Farsi
System Config: To fine-tune this model, NVIDIA GeForce RTX 3060-12 GB is used

How to Use

Order of use:

Preprocessing
Fine-tuning
MakingLM
Test Model
client

Fine-tuned Model

🤗 You can find fine-tuned models at these addresses:
https://huggingface.co/SaraSadeghi/Sharif-Wav2vec2
https://huggingface.co/SLPL/Sharif-wav2vec2

Comparison

Several models were fine-tuned in this process, so this is the reason for the discrepancy between the code results. You insert your own route model. In order to make a fair comparison between the existing wav2vec2 models, we prepared a standard test set including various and appropriate data, which will soon be included with our paper.

Model	Dataset	LM
m3hrdadfi/wav2vec2-large-xlsr-persian-v3	Mozilla_CommonVoice	no
m3hrdadfi/wav2vec2-large-xlsr-persian	Mozilla_CommonVoice	no
m3hrdadfi/wav2vec2-large-xlsr-persian-v2	Mozilla_CommonVoice	no
m3hrdadfi/wav2vec2-large-xlsr-persian-shemo	shEMO	no
wav2vec2-xlsr-multilingual-53-fa	Mozilla_CommonVoice+ Personal Data	no
Sharif-Wav2vec2-v1	Mozilla_CommonVoice	no
Sharif-Wav2vec2-v2	Mozilla_CommonVoice+ AGP Dataset	no
Sharif-Wav2vec2-v1	Mozilla_CommonVoice	yes
Sharif-Wav2vec2-v2	Mozilla_CommonVoice+ AGP Dataset	yes

Useful Links

Base Model:https://huggingface.co/facebook/wav2vec2-large-xlsr-53
Base Paper: https://arxiv.org/abs/2006.13979
Language Model: https://github.com/kpu/kenlm https://kheafield.com/code/kenlm/
Other Wav2vec2 Models info: https://github.com/facebookresearch/fairseq/tree/main/examples/wav2vec#wav2vec-20
Our Standard Farsi Testset : Loading .... :hourglass_flowing_sand:

Thanks to

Thanks to Sadra Sabouri for his collaboration:handshake::handshake:

Also, I would like to thank Mehrdad Farahani for his normalizer and dictionary 🤝

⭐Give us a star if you found this repo useful.

🙋‍♀️ Open an issue if you have any comments about them.

🥰 Feel free to open a pull request addding your feature. We'll be more than happy to accept them.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
Fine-tuning.ipynb		Fine-tuning.ipynb
MakingLM.ipynb		MakingLM.ipynb
Preprocessing.ipynb		Preprocessing.ipynb
README.md		README.md
Test_Model.ipynb		Test_Model.ipynb
client.py		client.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sharif-Wav2vec2

This repo shows how to finetune the wav2vec2.0 model along with its prerequisites.

Table of Contents

General Info

How to Use

Fine-tuned Model

Comparison

Useful Links

Thanks to

About

Releases

Packages

Languages

Sarasadeghii/Sharif-Wav2vec2

Folders and files

Latest commit

History

Repository files navigation

Sharif-Wav2vec2

This repo shows how to finetune the wav2vec2.0 model along with its prerequisites.

Table of Contents

General Info

How to Use

Fine-tuned Model

Comparison

Useful Links

Thanks to

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages