Skip to content

Latest commit

 

History

History
12 lines (5 loc) · 477 Bytes

README.md

File metadata and controls

12 lines (5 loc) · 477 Bytes

Image-Annotation-Speech

Explaining the contents of an image in the form of speech through caption generation using Inception-V3 model for image feature extraction, LSTM model for caption generation and Goggle Text-To-Speech API and playsound library for text to speech conversion.

To view/edit the full model,visit my kaggle notebook : Image Annotation Kaggle Notebook

Upvotes & Suggestions are appreciated!