Video caption usage

preparocess data

python prepro_feats.py --output_dir data/feats/resnet152_origin --model resnet152 --n_frame_steps 40 --gpu 4,5

then, use video-classification-3d-cnn-pytorch to extract features from video. Then mean pool to get a 2048 dim feature for each video, save each feature as {video id}_c3d.npy, such as video0_c3d.npy, put them under data/feats/resnet152_origin.

Then write src-train.txt, tgt-train.txt, src-val.txt, tgt-val.txt, src-test.txt, tgt-test.txt, put them under data\src_train.txt has video id line by line, such as video0 \n video1, tgt_train.txt has captions line by line, each line is a caption corresponding to video id in src_train.txt. Other files' format is the same as these two.

train

python train.py -model_type video -data data/msrvtt/video -save_model data/save/model -gpuid 4 -batch_size 180 -max_grad_norm 20 -dim_vid 4096 -rnn_size 1024 -optim adam -learning_rate 0.001 -epochs 250  -dropout 0.5 -global_attention mlp -encoder_type brnn

translate

python translate.py -data_type video -model data/nmt/model_acc_41.25_ppl_35.38_e12.pt -src_dir data/feats/resnet152_origin -src data/src-test.txt -output pred.txt -gpu 1

eval

python eval.py  -video_ids data/src-test.txt -pred pred.txt

OpenNMT-py: Open-Source Neural Machine Translation

This is a Pytorch port of OpenNMT, an open-source (MIT) neural machine translation system. It is designed to be research friendly to try out new ideas in translation, summary, image-to-text, morphology, and many other domains.

Codebase is relatively stable, but PyTorch is still evolving. We currently recommend forking if you need to have stable code.

OpenNMT-py is run as a collaborative open-source project. It is maintained by Sasha Rush (Cambridge, MA), Ben Peters (Saarbrücken), and Jianyu Zhan (Shenzhen). The original code was written by Adam Lerer (NYC). We love contributions. Please consult the Issues page for any Contributions Welcome tagged post.

@inproceedings{opennmt,
  author    = {Guillaume Klein and
               Yoon Kim and
               Yuntian Deng and
               Jean Senellart and
               Alexander M. Rush},
  title     = {OpenNMT: Open-Source Toolkit for Neural Machine Translation},
  booktitle = {Proc. ACL},
  year      = {2017},
  url       = {https://doi.org/10.18653/v1/P17-4012},
  doi       = {10.18653/v1/P17-4012}
}

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
coco-caption		coco-caption
onmt		onmt
tools		tools
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
data		data
eval.py		eval.py
prepro_feats.py		prepro_feats.py
preprocess.py		preprocess.py
train.py		train.py
translate.py		translate.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Video caption usage

OpenNMT-py: Open-Source Neural Machine Translation

Table of Contents

Requirements

Features

Quickstart

Step 1: Preprocess the data

Step 2: Train the model

Step 3: Translate

Pretrained embeddings (e.g. GloVe)

Pretrained Models

Citation

About

Releases

Packages

Languages

License

xiadingZ/video-caption-openNMT.pytorch

Folders and files

Latest commit

History

Repository files navigation

Video caption usage

OpenNMT-py: Open-Source Neural Machine Translation

Table of Contents

Requirements

Features

Quickstart

Step 1: Preprocess the data

Step 2: Train the model

Step 3: Translate

Pretrained embeddings (e.g. GloVe)

Pretrained Models

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages