Bachelor Thesis -
Phoneme classification and alignment
through recognition on TIMIT

My bachelor thesis on Phoneme recognition and alignment on the TIMIT dataset

Abstract

In this work we explore a hybrid between ANNs and DTW for phoneme alignment on the TIMIT dataset. The idea is to use the output probabilities of a neural phoneme recognition model together with a probability-based DTW in order to align phonemes. For phoneme recognition we achieve 18.1% FER which is an 4.0% improvement over the state- of-the-art. Our alignment results in a 86.3% phoneme boundary accuracy with a 20ms tolerance. Furthermore phoneme classification based on recordings of single phonemes is being tried resulting in an accuracy of 66.68%. Apart from that we introduce the CyclicPlateauScheduler, a new learning rate scheduler combining triangular cyclic learning rates with ReduceLROnPlateau.

CNN experiments

The code for the initial CNN experiments can be found here

Name		Name	Last commit message	Last commit date
Latest commit History 123 Commits
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bachelor Thesis -
Phoneme classification and alignment
through recognition on TIMIT

Abstract

CNN experiments

About

Packages

Languages

License

lischilpp/bachelor-thesis-phoneme-recognition-alignment

Folders and files

Latest commit

History

Repository files navigation

Bachelor Thesis - Phoneme classification and alignment through recognition on TIMIT

Abstract

CNN experiments

About

Topics

Resources

License

Stars

Watchers

Forks

Packages 0

Languages

Bachelor Thesis -
Phoneme classification and alignment
through recognition on TIMIT

Packages