Build software better, together

eric-ai-lab / ComCLIP

Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"

causality clip svo slip vision-and-language compositionality flickr8k-dataset image-text-matching flickr30k image-text-retrieval winoground blip2

Updated Aug 18, 2024
Python

Ash0508 / Code_Clause_DataScience_Image-Caption-Generator

Star

Image Caption Generator is a project that aims to generate descriptive captions for input images using advanced predictive techniques

machine-learning deep-learning tensorflow encoder decoder vgg16 bleu-score cnn-lstm flickr8k-dataset

Updated Jul 21, 2024
Jupyter Notebook

reshalfahsi / image-captioning-mobilenet-llama3

Star

Image Captioning With MobileNet-LLaMA 3

nlp cnn pytorch transformer image-captioning image-text flickr8k-dataset mobilenetv3 pytorch-lightning kv-cache rotary-position-embedding grouped-query-attention rms-norm llama3

Updated Jun 23, 2024
Jupyter Notebook

DarkKnightSgh / Text-Image-Text

Star

Text-Image-Text is a bidirectional system that enables seamless retrieval of images based on text descriptions, and vice versa. It leverages state-of-the-art language and vision models to bridge the gap between textual and visual representations.

python information-retrieval transformers image-text flickr8k-dataset text-image streamlit semantic-embedding huggingface-transformers

Updated Apr 27, 2024
Python

Delphboy / karpathy-splits

Star

Karpathy Splits json files for image captioning

image-caption mscoco-dataset flickr8k-dataset flickr30k karpathy-split

Updated Apr 4, 2024

tanzealist / AutoImageCaption-CNNvsResNet

Star

"AutoImageCaption-CNNvsResNet" leverages the Flickr 8k Dataset to automate image captioning, comparing CNN+LSTM and ResNet+GRU models using BLEU scores for performance evaluation.

machine-learning-algorithms jupyter-notebook python3 resnet-50 cnn-classification flickr8k-dataset elt-pipeline

Updated Jan 28, 2024
Jupyter Notebook

billy-enrizky / Image-Caption-Generator

Star

🚀 Image Caption Generator Project 🚀 🧠 Building Customized LSTM Neural Network Encoder model with Dropout, Dense, RepeatVector, and Bidirectional LSTM layers. Sequence feature layers with Embedding, Dropout, and Bidirectional LSTM layers. Attention mechanism using Dot product, Softmax attention scores,...

natural-language-processing computer-vision deep-learning image-captioning vgg16 lstm-neural-networks vgg16-model flickr8k-dataset image-caption-generator

Updated Jan 21, 2024
Jupyter Notebook

therrshan / image-captioning

Star

Comparitive analysis of image captioning model using RNN, BiLSTM and Transformer model architectures on the Flickr8K dataset and InceptionV3 for image feature extraction.

transformers rnn image-captioning inceptionv3 bilstm flickr8k-dataset

Updated Jan 12, 2024
Python

bhushan2311 / image_caption_generator

Star

An Image captioning web application combines the power of React.js for front-end, Flask and Node.js for back-end, utilizing the MERN stack. Users can upload images and instantly receive automatic captions. Authenticated users have access to extra features like translating captions and text-to-speech functionality.

Updated Jan 5, 2024
JavaScript

Jerrinthomas007 / Image-Caption-Generator

Star

Image Caption Generator using Python | Flickr Dataset | Deep Learning(CNN & RNN)

python deep-learning tensorflow keras dataset cnn-model bleu-score rnn-lstm flickr8k-dataset

Updated Dec 1, 2023
Jupyter Notebook

assignments-sliit / img-caption-img-flickr8k

Star

Caption Generation using Flickr8k dataset by @jbrownlee and image generation from caption prompt using pretrained models

machine-learning image-processing ml image-captioning image-generation hacktoberfest completed-project colab-notebook flickr8k-dataset

Updated Oct 4, 2023
Jupyter Notebook

atharvapathak / Eye_For_Blind_Project

Star

In this capstone project, we need to create a deep learning model which can explain the contents of an image in the form of speech through caption generation with an attention mechanism on Flickr8K dataset.

python opencv machine-learning django deep-learning pytorch recurrent-neural-networks convolutional-neural-networks urllib flickr8k-dataset natual-language-processing

Updated Sep 22, 2023
Jupyter Notebook

VasilisStavrianoudakis / ImageCaptioning

Star

An Image Captioning implementation of a CNN Encoder and an RNN Decoder in PyTorch.

python pytorch image-captioning bleu-score encoder-decoder-model flickr8k-dataset

Updated Jun 25, 2023
Jupyter Notebook

prernasingh05 / CodeClause_Image_Caption_Generator

Star

Image Caption Generator, a project aims to generate descriptive captions for input images using advanced predictive techniques.

machine-learning deep-learning tensorflow encoder decoder vgg16 bleu-score cnn-lstm flickr8k-dataset

Updated May 31, 2023
Jupyter Notebook

LeenAlnajjar / Arabic-Image-Captioning

Star

The concept of the project is to generate Arabic captions from the Arabic Flickr8K dataset, the tools that were used are the pre-trained CNN (MobileNet-V2) and the LSTM model, in addition to a set of steps using the NLP. The aim of the project is to create a solid ground and very initial steps in order to help children with learning difficulties.

nlp deep-learning cnn lstm image-captioning arabic-nlp mobilenetv2 pre-trained-model flickr8k-dataset arabic-image-captioning

Updated May 15, 2023
Jupyter Notebook

iVishalr / Image-Captioning

Star

Implementation of Image Captioning Model using CNNs and LSTMs

image-processing imagenet image-captioning convolutional-neural-networks transfer-learning lstm-neural-networks inceptionv3-model neuraltalk flickr8k-dataset

Updated Nov 21, 2022
Jupyter Notebook

nouranHisham / image_captioning_flickr

Star

Image Captioning is the task of describing the content of an image in words. This task lies at the intersection of computer vision and natural language processing.

computer-vision image-captioning nlp-machine-learning cnn-model lstm-neural-networks resnet-50 flickr-photos flickr8k-dataset image-caption-generator