Python toolkit for Visual Speech Recognition
-
Updated
Jun 10, 2020 - Python
Python toolkit for Visual Speech Recognition
Multimodal Speech Recognition for phoneme level prediction using Audio-Visual data from TCDTIMIT dataset implementing RNNs with bidirectional LSTMs for the audio subnetwork and CNN-LSTMs for the video subnetwork.
Add a description, image, and links to the tcdtimit topic page so that developers can more easily learn about it.
To associate your repository with the tcdtimit topic, visit your repo's landing page and select "manage topics."