This directory should include the following subdirectories:
- The original instructions:
default/
- For object-related tokens ablations:
mask_object/
replace_object/
random_mask_for_object/
: the controlled trial
- For direction-related tokens ablations:
mask_direction/
replace_direction/
random_mask_for_direction/
- For numeric tokens ablations:
numeric_default/
: subsets of instructions that contains numeric tokensmask_numeric/
replace_numeric/
random_mask_for_numeric/
- The visual features for default setting and for visual environment ablations:
img_features/
For the object/direction/numeric token ablations, there should be 5 subdirectories under each setting, and corresponding directory name would be in the format of {label}{mask_rate:.2f}_{repeat_idx}
. By default, mask_rate = 1.00
, and repeat_idx
spans from 0~4.
We provide the processed RxR-en instructions for the ablations covered in our study, which can be downloaded by the following script:
cd Diagnose_VLN/
python data_processing/download_data.py --download_instructions --instruction_dataset rxr
Alternatively, you can follow the readme and run the instruction processing scripts in ~/Diagnose_VLN/data_processing/process_instructions/
to generate the ablated instructions from scratch.
We provide the processed visual features for the ablations covered in our study, which can be downloaded by the following script. The compressed file would be ~2.4G, and contains the CLIP ViT-B/32 features in the above listed ablation settings.
cd Diagnose_VLN/
python data_processing/download_data.py --download_image_features --image_fearture_dataset rxr
Alternatively, you can follow the readme and run the visual features precomputing scripts in ~/Diagnose_VLN/data_processing/Matterport3DSimulator/
to generate the ablated visual features from scratch.