GitHub - yogesh-iitj/RefRelations

Implementation of the Few-shot Referring Relatiohsip in Videos (CVPR 2023) paper

Requirements

Use python >= 3.8.5. Conda recommended : https://docs.anaconda.com/anaconda/install/linux/
Use pytorch 1.7.0 CUDA 10.2 or higher
Other requirements from 'requirements.txt'

To setup environment

  # create new env fsrr
  $ conda create -n fsrr python=3.8.5

  # activate fsrr
  $ conda activate fsrr

  # install pytorch, torchvision
  $ conda install pytorch==1.7.0 torchvision==0.8.0 cudatoolkit=10.2 -c pytorch

  # install other dependencies
  $ pip install -r requirements.txt

Training

Preparing dataset

Download ViOR and ImageNet VidVRD dataset from [https://xdshang.github.io/docs/imagenet-vidvrd.html and https://xdshang.github.io/docs/vidor.html)
Split Videos into Frames

$ python video_to_frame.py

Extract faster_rcnn features:

  $ sh data_preparation/vidor.sh
  # Please follow instructions [here](data_preparation/README.md).

Extract I3d features:

  $ sh data_preparation/vidor_i3d.sh

Traning RelationNet and VR_Encoder

  $ python model/relnet.py
  # Follow model/config.py for different model settings

Inference

  $ python inference/FullModel_inf.py
  # Follow inference/config.py for inference settings

Evaluation

  $ sh eval/eval.sh

Cite

If you find this work useful for your research, please consider citing.

@inproceedings{
fewshot_ref_rel,
title={Few-Shot Referring Relationships in Videos},
author={Yogesh Kumar, Anand Mishra},
booktitle={Conference on Computer Vision and Pattern Recognition 2023},
year={2023},
url={https://openreview.net/forum?id=dCbmHXhGtib}
}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Requirements

Training

Preparing dataset

Traning RelationNet and VR_Encoder

Inference

Evaluation

Cite

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
configs		configs
data_preparation		data_preparation
eval		eval
inference		inference
model		model
utils		utils
vidvrd_helper		vidvrd_helper
README.md		README.md
video_to_frame.py		video_to_frame.py

yogesh-iitj/RefRelations

Folders and files

Latest commit

History

Repository files navigation

Requirements

Training

Preparing dataset

Traning RelationNet and VR_Encoder

Inference

Evaluation

Cite

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages