Skip to content

Latest commit

 

History

History
24 lines (18 loc) · 1.74 KB

README.md

File metadata and controls

24 lines (18 loc) · 1.74 KB

Data Preparation

Notes on Video Data format

Since the original VideoDataloader of MMAction requires decord for efficient video loading which is non-trivial to compile, this repo only supports raw frame format of videos. Therefore, you have to extract frames from raw videos. We will find another libaries and support VideoLoader soon.

Supported datasets

The rawframe_dataset loads data in a general manner by preparing a .txt file which contains the directory path of frames, total number of a certain video, and the groundtruth label. After that, specify the data_root and image_tmpl of config files. See the sample below:

shot_put/c5-PBp04AQI 299 298
marching/5OEnoefcO1Y 299 192
dancing_ballet/pR1jxLvjcgU 249 84
motorcycling/0dC3o90WYHs 299 199
hoverboarding/RVkof6bxvg0 278 157
playing_piano/H3JzOkvTrJk 297 241

Such general loader might help your experiment with other dataset e.g. UCF101 or custom dataset.

Prepare annotations

  • Kinetics400 contains ~240k training videos and ~19k validation videos. See the guide of original MMAction to generate annotations.
  • Something-Someting has 2 versions which you have to apply on their website. See the guide of TSM to generate annotations.

Thank original MMAction and TSM repo for kindly providing preprocessing scripts.