Explaining the contents of an image in the form of speech through caption generation using Inception-V3 model for image feature extraction, LSTM model for caption generation and Goggle Text-To-Speech API and playsound library for text to speech conversion.
To view/edit the full model,visit my kaggle notebook : Image Annotation Kaggle Notebook
Upvotes & Suggestions are appreciated!