gfcc-speech-kaldi

If you use this code or part of it, please cite us!

The success of any ASR system is dependent on the availability of its training data. However, output in low resource-speaking language is deteriorated due to the absence of sufficient signals processing features. In case of Indian tonal languages like Punjabi, one such difficulty is nearly zero resource conditions and language differences exist due to speaking and vocal tract length differences between children and audlt speech data.

The code attempts to create the Punjabi Children ASR structure with mismatched settings, with rigorous sound methods such as the Mel frequency cepstral factor (MFCC) and more noise robust methods employing gammatone frequency cepstral factor (GFCC).

Puneet Bawa , Virender Kadyan, "Noise robust in-domain children speech enhancement for automatic Punjabi recognition system under mismatched conditions" doi: https://doi.org/10.1016/j.apacoust.2020.107810

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
conf		conf
feat		feat
featbin		featbin
transform		transform
LICENSE		LICENSE
README.md		README.md
test.sh		test.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

gfcc-speech-kaldi

About

Releases

Packages

Contributors 2

Languages

License

puneetbawa/gfcc-speech-kaldi

Folders and files

Latest commit

History

Repository files navigation

gfcc-speech-kaldi

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages