GitHub - Adam1904/2: CenterMask2 on detectron2 (open images)

CenterMask with Open Images

This repo trains CenterMask (built on Detectron2) on Open Images datasets. Credits to Youngwan Lee and Jongyoul Park arxiv: https://arxiv.org/abs/1911.06667 CenterMask original repo: https://github.com/youngwanLEE/centermask2

Hyperparameter Testing

Comparing 3 sets of training hyperparameters:

gradient clip value 0.5, lr 0.05, lr decay 0.2 x 2 (blue)
gradient clip value 0.5, lr 0.005 (orange)
lr 0.002, lr decay 0.2 x 2 (red)

total loss

Open Images Training and Results

Mask RCNN and CenterMask trained on Open Images V6, containing 24,591 images and 57,812 masks of 300 classes. Due to imbalanced distribution amongst classes, repeat factor sampling was used to oversample tail classes, learning rate schedule x2.

*All results measured on NVIDIA Quadro P1000

Method	Backbone	Inference Time(s)	mask AP	box AP	full metrics
CenterMask	VoVNetV2-19	0.16	15.513	14.594	metrics
Mask-RCNN	ResNet-50	0.48	17.765	15.512	metrics
CenterMask	SimpleNet	0.56	7.944	7.073	metrics
CenterMask	EfficientNet-B0	0.29	3.753	3.345	metrics

Side-by-Side Comparison

CenterMask-VoVNet

Mask RCNN

CenterMask-SimpleNet

CenterMask-EfficientNetB0

Image or Video Demo

To run inference on images or video, run CenterMask2/custom_demo.py for Mask RCNN or CenterMask2/projects/CenterMask2/custom_demo.py for CenterMask. Run it with:

python custom_demo.py --config_file configs/COCO-InstanceSegmentation/mask_rcnn_R_50_FPN_3x.yaml --input input1.jpg input2.jpg --output results/ --confidence_threshold 0.6 --weights model_final.pth

For video inference, replace --input files with --video video.mp4, and --output is name of output video saved in current dir.

If no --output given, instances are shown in cv2 window

Training on Open Images

Dataset has to be loaded as a json file in COCO format, together with a folder of training images. Every mask must have an annotation, with at least 6 polygon points in 'segmentation'

annotation{
"id": int, 
"image_id": int,
"category_id": int, 
"segmentation": RLE or [polygon], 
"area": float, 
"bbox": [x,y,width,height], 
"iscrowd": 0 or 1
}

categories[{
"id": int, 
"name": str, 
"supercategory": str
}]

Change the json file and training image directory for get_train_dicts() in openimages_utils/data_dicts.py to that of dataset you want to train.

To train, edit cfg.merge_from_file('path/to/config/file') in train.py with config file of choice. Load model weights at cfg.MODEL.WEIGHTS = 'path/to/model/weights'. Then, simply execute train.py

Training with EfficientNet backbone requires installation of Pytorch EfficientNet from (https://github.com/lukemelas/EfficientNet-PyTorch), files added to 'centermask/modeling/backbone' folder. The pretrained model is loaded in efficientnet.py

Validation

Change json file and validation image directory for get_val_dicts() in openimages_utils/data_dicts.py to that validation dataset.

Run it with

python validate.py --configs configs/COCO-InstanceSegmentation/mask_rcnn_R_50_FPN_3x.yaml --model_pth model_final.pth --mode evaluate --threshold 0.5

where --mode is either 'infer' or 'evaluate' on validation images.

Requirements

Python >= 3.6(Conda)
PyTorch 1.3
torchvision that matches the PyTorch installation. You can install them together at pytorch.org to make sure of this.
OpenCV, needed by demo and visualization
fvcore: pip install git+https://github.com/facebookresearch/fvcore
pycocotools: pip install cython; pip install git+https://github.com/philferriere/cocoapi.git#subdirectory=PythonAPI
VS2019(no test in older version)/CUDA10.1(no test in older version)

several files must be changed by manually.

file1: 
  {your evn path}\Lib\site-packages\torch\include\torch\csrc\jit\argument_spec.h
  example:
  {C:\Miniconda3\envs\py36}\Lib\site-packages\torch\include\torch\csrc\jit\argument_spec.h(190)
    static constexpr size_t DEPTH_LIMIT = 128;
      change to -->
    static const size_t DEPTH_LIMIT = 128;
file2: 
  {your evn path}\Lib\site-packages\torch\include\pybind11\cast.h
  example:
  {C:\Miniconda3\envs\py36}\Lib\site-packages\torch\include\pybind11\cast.h(1449)
    explicit operator type&() { return *(this->value); }
      change to -->
    explicit operator type&() { return *((type*)this->value); }

Build detectron2

After having the above dependencies, run:

conda activate {your env}

"C:\Program Files (x86)\Microsoft Visual Studio\2019\Enterprise\VC\Auxiliary\Build\vcvars64.bat"

git clone https://github.com/conansherry/detectron2

cd detectron2

python setup.py build develop

Note: you may need to rebuild detectron2 after reinstalling a different build of PyTorch.

Detectron2 is Facebook AI Research's next generation software system that implements state-of-the-art object detection algorithms. It is a ground-up rewrite of the previous version, Detectron, and it originates from maskrcnn-benchmark.

What's New

It is powered by the PyTorch deep learning framework.
Includes more features such as panoptic segmentation, densepose, Cascade R-CNN, rotated bounding boxes, etc.
Can be used as a library to support different projects on top of it. We'll open source more research projects in this way.
It trains much faster.

See our blog post to see more demos and learn about detectron2.

Installation

See INSTALL.md.

Quick Start

See GETTING_STARTED.md, or the Colab Notebook.

Learn more at our documentation. And see projects/ for some projects that are built on top of detectron2.

Model Zoo and Baselines

We provide a large set of baseline results and trained models available for download in the Detectron2 Model Zoo.

License

Detectron2 is released under the Apache 2.0 license.

Citing Detectron

If you use Detectron2 in your research or wish to refer to the baseline results published in the Model Zoo, please use the following BibTeX entry.

@misc{wu2019detectron2,
  author =       {Yuxin Wu and Alexander Kirillov and Francisco Massa and
                  Wan-Yen Lo and Ross Girshick},
  title =        {Detectron2},
  howpublished = {\url{https://github.com/facebookresearch/detectron2}},
  year =         {2019}
}

Name		Name	Last commit message	Last commit date
Latest commit History 209 Commits
.circleci		.circleci
.github		.github
configs		configs
datasets		datasets
demo		demo
detectron2		detectron2
dev		dev
docker		docker
docs		docs
openimages_utils		openimages_utils
projects/CenterMask2		projects/CenterMask2
tests		tests
tools		tools
.clang-format		.clang-format
.flake8		.flake8
.gitattributes		.gitattributes
.gitignore		.gitignore
GETTING_STARTED.md		GETTING_STARTED.md
INSTALL.md		INSTALL.md
LICENSE		LICENSE
MODEL_ZOO.md		MODEL_ZOO.md
README.md		README.md
custom_demo.py		custom_demo.py
data_cleaning.py		data_cleaning.py
setup.cfg		setup.cfg
setup.py		setup.py
setup.sh		setup.sh
train.py		train.py
validate.py		validate.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CenterMask with Open Images

Hyperparameter Testing

total loss

Open Images Training and Results

Side-by-Side Comparison

CenterMask-VoVNet

Mask RCNN

CenterMask-SimpleNet

CenterMask-EfficientNetB0

Image or Video Demo

Training on Open Images

Validation

Requirements

several files must be changed by manually.

Build detectron2

What's New

Installation

Quick Start

Model Zoo and Baselines

License

Citing Detectron

About

Releases

Packages

Languages

License

Adam1904/2

Folders and files

Latest commit

History

Repository files navigation

CenterMask with Open Images

Hyperparameter Testing

total loss

Open Images Training and Results

Side-by-Side Comparison

CenterMask-VoVNet

Mask RCNN

CenterMask-SimpleNet

CenterMask-EfficientNetB0

Image or Video Demo

Training on Open Images

Validation

Requirements

several files must be changed by manually.

Build detectron2

What's New

Installation

Quick Start

Model Zoo and Baselines

License

Citing Detectron

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages