Test wav2vec2 models by Microphone

Overview

This repository contains code of a script that recognizes your speech using wav2vec2 models.

Demo

Installation

Clone this repository:

git clone https://github.com/egorsmkv/test-wav2vec2-by-microphone
cd test-wav2vec2-by-microphone

Install Python requirements:

Linux

# the author has successfully tested the project with wave=0.0.2, torch==1.11.0, torchaudio==0.11.0, sox==1.4.1, and pyaudio==0.2.11 pyctcdecode==0.3.0 transformers==4.19.2

pip install https://github.com/kpu/kenlm/archive/master.zip
pip install wave torch torchaudio pyaudio sox pyctcdecode transformers

MacOS

brew install portaudio sox

pip install https://github.com/kpu/kenlm/archive/master.zip
pip install wave pyctcdecode transformers
pip install --global-option='build_ext' --global-option='-I/usr/local/include' --global-option='-L/usr/local/lib' pyaudio

To install torch and torchaudio on MacOS you need to install conda or miniconda (I recommend it) and then install torch libraries:

For Intel:

conda install pytorch torchaudio -c pytorch

For M1:

pip3 install torch torchaudio

If you have problems with installation of pyaudio, then check out this link. For me below command works:

pip3 install --global-option='build_ext' --global-option='-I/opt/homebrew/Cellar/portaudio/19.7.0/include/' --global-option='-L/opt/homebrew/Cellar/portaudio/19.7.0/lib/' pyaudio

Running

# Run the loop (this script will record speech and recognizes it)
# Use Ctrl-C to stop the script
python run.py --model_id Yehor/wav2vec2-xls-r-300m-uk-with-small-lm --record_seconds 15

Help

If you have any issues - create an issue in the repository
Currently tested on Linux and MacOS, for Windows you need to change the script slightly

Acknowledgements

PyAudio: https://people.csail.mit.edu/hubert/pyaudio/
wave: https://pythonhosted.org/Wave/

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
licenses		licenses
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
demo.png		demo.png
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Test wav2vec2 models by Microphone

Overview

Demo

Installation

Linux

MacOS

Running

Help

Acknowledgements

About

Releases

Packages

Languages

License

egorsmkv/test-wav2vec2-by-microphone

Folders and files

Latest commit

History

Repository files navigation

Test wav2vec2 models by Microphone

Overview

Demo

Installation

Linux

MacOS

Running

Help

Acknowledgements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages