Image-Annotation-Speech

Explaining the contents of an image in the form of speech through caption generation using Inception-V3 model for image feature extraction, LSTM model for caption generation and Goggle Text-To-Speech API and playsound library for text to speech conversion.