Medical Image Captioning

Welcome to the Medical Image Captioning Tool repository!

This repository contains all the necessary documents, design specifications, implementation details and related tools for this Image Captioning Tool that generates natural language captions for Chest X-Rays images!

You can find the official model implementation in this Kaggle notebook: Link

Model Architecture

The architecture for the model is inspired from "Show and Tell" by Vinyals. The model is built using Tensorflow library.

The overall model used in this project can be categorized as two types:

1- Image Feature Extraction, using CheXNet.

2- Caption Generation using LSTM.

CheXNet: the DenseCap model, which is a convolutional neural network that is pre-trained on the CXR dataset.

The project also contains code for Attention LSTM layer, although not integrated in the model.

Dataset

The model is trained on Chest X-rays (Indiana University)

The datasets used for this project are the National Institute of Health Chest X Ray Dataset to train the CNN feature extractor model (CheXNet) and the Chest X-rays (Indiana University) dataset to train the model with the captions.

Also it can be trained on any others Medical Dataset

Evaluation

The BLEU score for the test set is 0.64.
Model Loss: from 12 to 2.0831.

Requirements

tensorflow
keras
numpy
h5py
progressbar2

These requirements can be easily installed by: `pip install -r requirements.txt``

References

[1] Oriol Vinyals, Alexander Toshev, Samy Bengio, Dumitru Erhan. Show and Tell: A Neural Image Caption Generator

[2] Kelvin Xu, Jimmy Ba, Ryan Kiros, Kyunghyun Cho, Aaron Courville, Ruslan Salakhutdinov, Richard Zemel, Yoshua Bengio. Show, Attend and Tell: Neural Image Caption Generation with Visual Attention`

[3] Kaggle, Official Model Implementation

[4] Official Dataset link, Chest X-rays (Indiana University)

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
images		images
static		static
templates		templates
README.md		README.md
app.py		app.py
captions1.txt		captions1.txt
encodings.pkl		encodings.pkl
model_3.h5		model_3.h5
requirements.txt		requirements.txt
words.pkl		words.pkl
words1.pkl		words1.pkl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Medical Image Captioning

Model Architecture

Dataset

Evaluation

Requirements

References

About

Releases

Packages

Languages

EbGazar/Image-Captioning

Folders and files

Latest commit

History

Repository files navigation

Medical Image Captioning

Model Architecture

Dataset

Evaluation

Requirements

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages