This repository contains the datasets used in Zero Shot Detection by Pengkai Zhu, Hanxiao Wang, Tolga Bolukbasi and Venkatesh Saligrama.
@article{Zhu_2019,
title={Zero Shot Detection},
DOI={10.1109/tcsvt.2019.2899569},
journal={IEEE Transactions on Circuits and Systems for Video Technology},
publisher={Institute of Electrical and Electronics Engineers (IEEE)},
author={Zhu, Pengkai and Wang, Hanxiao and Saligrama, Venkatesh},
year={2019}
}
The scripts can download Pascal VOC or MSCOCO and split it into four parts as in the paper:
- Train: seen in train
- Test-Seen: seen in val/test
- Test-Unseen: unseen in train&val&test
- Test-Mix: both seen & unseen in val/test
The dataset is split based on assigned seen categories names. We provide
the splits we used in the paper in seen_names
subfolder.
The attributes for Pascal VOC will be downloaded and extracted automatically
when running get_voc_zsd_dataset.sh
. We also provide the attributes we use
in the paper in the attributes
subfolder:
coco_w2v.txt
: w2v attributes for coco categoriescoco_w2v_voc.txt
: projected w2v attributes for coco categories (mirroring VOC attributes similarity)voc.txt
: labelled attributes (from aP&Y) for VOC categoriesvoc_w2v.txt
: w2v attributes for VOC categories
- Preliminary:
numpy
bash get_voc_zsd_dataset.sh $zsd-data-dir # $zsd-data-dir: directory for saving pascal ZSD dataset
The dataset will be downloaded to $zsd-data-dir
and the split sets will be saved
in 1010split
subfolder by default. If you already downloaded the dataset or would like
to try some other splits, just run:
python zsd_split.py --dataset voc --data_dir $zsd-data-dir --name_file voc.names \
--seen_name_file seen_names/voc/${choose another split} \
--save_dir ${split save name} \
bash get_coco_zsd_dataset.sh $zsd-data-dir # $zsd-data-dir: directory for saving coco ZSD dataset