Domain Agent Network

Overview

This code is for the paper "Delving Deep into Many-to-many Attention for Few-shot Video Object Segmentation" in CVPR2021.

The architecture of our Domain Agent Network:

Environment

conda create -n FSVOS python=3.6
conda activate FSVOS
conda install pytorch==1.6.0 torchvision==0.7.0 cudatoolkit=10.2 -c pytorch
conda install opencv cython
pip install easydict imgaug

Usage

Preparation

Download the 2019 version of Youtube-VIS dataset.
Put the dataset in the ./data folder.

data
└─ Youtube-VOS
    └─ train
        ├─ Annotations
        ├─ JPEGImages
        └─ train.json

Install cocoapi for Youtube-VIS.
Download the ImageNet pretrained backbone and put it into the pretrain_model folder.

pretrain_model
└─ resnet50_v2.pth

Update the root_path in config/DAN_config.py.

Training

python train_DAN.py --group 1 --batch_size 4

Inference

You can download our pretrained model to test.

python test_DAN.py --test_best --group 1

References

Part of the code is based upon:

PMMs: https://github.com/Yang-Bob/PMMs
PFENet: https://github.com/Jia-Research-Lab/PFENet
STM-Training: https://github.com/lyxok1/STM-Training

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
img		img
libs		libs
.gitignore		.gitignore
README.md		README.md
test_DAN.py		test_DAN.py
train_DAN.py		train_DAN.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Domain Agent Network

Overview

Environment

Usage

Preparation

Training

Inference

References

About

Releases

Packages

Languages

scutpaul/DANet

Folders and files

Latest commit

History

Repository files navigation

Domain Agent Network

Overview

Environment

Usage

Preparation

Training

Inference

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages