Dataset Preparation

Note that the following steps are required only if you want to prepare the annotations from parent repository.

Here I have outlined the steps to prepare the datasets of RefCOCO, RefCOCO+ and RefCOCOg and pretrained weights of visual backbone:

The project directory is $ROOT

Annotations

Current directory is located at $DATA_PREP=$ROOT/data to generate annotations.

Download the cleaned data and extract them into "data" folder
Prepare images from COCO train2014, and unzip the annotations. At this point the directory should look like:

$DATA_PREP
|-- refcoco
    |-- instances.json
    |-- refs(google).p
    |-- refs(unc).p
|-- refcoco+
    |-- instances.json
    |-- refs(unc).p
|-- refcocog
    |-- instances.json
    |-- refs(google).p
    |-- refs(umd).p
|-- images
	|-- train2014

After that, you should run $DATA_PREP/data_process.py to generate the annotations. For example, to generate the annotations for RefCOCO, you can run the code:

cd $ROOT/data_prep
python data_process.py --data_root $DATA_PREP --output_dir $DATA_PREP --dataset refcoco --split unc --generate_mask

At this point the directory $DATA_PREP should look like:

   $ROOT/data
   |-- refcoco
       |-- instances.json
       |-- refs(google).p
       |-- refs(unc).p
   |-- refcoco+
       |-- instances.json
       |-- refs(unc).p
   |-- refcocog
       |-- instances.json
       |-- refs(google).p
       |-- refs(umd).p
   |-- anns
       |-- refcoco
       |-- refcoco+
       |-- refcocog
   |-- masks
       |-- refcoco
       |-- refcoco+
       |-- refcocog
   |-- images
       |-- train2014
   |-- weights
       |-- pretrained_weights

Pretrained Weights of Visual Backbone

We provide the pretrained weights of vgg and darknet backbone, which are trained by darknet-yolov3. We remove all images appearing in the val+test splits of RefCOCO, RefCOCO+ and RefCOCOg. You should download the following weights into $DATA_PREP/weights.

Pretrained Weights of Backbone	keras version	darknet version
DarkNet53-yolov3	OneDrive, Baidu Cloud (password:xvue)	OneDrive, Baidu Cloud (password:az2j)
Vgg16-yolov3	OneDrive, Baidu Cloud(password:wdb8)	OneDrive, Baidu Cloud(password:4tml)

The weights of darknet pretrained on the whole train set of MS-COCO are also released as following, which will boost the performance of REC around 2~3% in practice. Notbly, we release it for training on other datasets like referit, but do not advise to use it for RefCOCO, RefCOCO+ and RefCOCOg. (it is incorrect to use the COCO pre-trained backbone on RefCOCO, RefCOCO+, and RefCOCOg datasets) .

Pretrained Weights of Backbone	keras version	darknet version
DarkNet-yolov3 (COCO pre-trained)	OneDrive, Baidu Cloud (password:l9q6)	Link

Tips: In our practice, different checkpoints partly varies the performence of model. if you require other checkpoints, please contact us.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Dataset Preparation

Annotations

Pretrained Weights of Visual Backbone

Files

README.md

Latest commit

History

README.md

File metadata and controls

Dataset Preparation

Annotations

Pretrained Weights of Visual Backbone