Spherical image object detection is to detect objects in spherical images (360 degree panorams) which have spherical deformations.
As the spherical images have deformations, the traditional axis-aligned rectangles can not be used as the bounding boxes for objects. Here we use spherical rectangles as the bounding boxes for spherical objects.
Our Unbiased Spherical IoU first calculates the area of each spherical rectangle, then calculates the intersection area of the two spherical rectangles. Finally, we compute the spherical IoU.
First, the area of each spherical rectangle can be computed according to the following formula. (The derivation is given in the supplementary material of our paper.)
Second, the computation of intersection area contains 3 steps:
-
Step 1. Compute intersection points between boundaries of the two spherical rectangles.
-
Step 2. Remove unnecessary points by two sub-steps:
- Sub-step 1: Remove points outside the two spherical rectangles.
- Sub-step 2: Remove redundant Points. (This step is not required for most cases.)
-
Step 3. Compute all angles and the final intersection area.
Finally, the spherical IoU is computed by the following formula
The RBFoV is defined by
PANDORA (PANoramic Detection Dataset for Object with ORientAtion) is a dataset for object detection in panoramic images. It contains 3,000 real-world indoor panoramic images with high-resolution (1920 × 960) and 94,353 bounding boxes of 47 categories that locate from low latitude regions to high latitude regions. Note that the annotations of the bounding box in the PANDORA dataset are defined by RBFoV our proposed.
The dataset can be downloaded from [here](https://drive.google.com/file/d/1JAGReczN_h3F3mY-mlGTVSeDx-CCJigC/view?usp=sharing). The annotation is in COCO format.We propose two anchor-free object detection methods based on CenterNet. One is called Sph-CenterNet, the other one is called R-CenterNet. These networks take panoramic images as input, and predict heatmaps, offsets, sizes. R-CenterNet additionally predicts rotation angle for each object.
If you use our Unbiased IoU in your research or Sph-CenterNet, please cite our AAAI work as
@inproceedings{SphIoU,
title = {Unbiased IoU for Spherical Image Object Detection},
author = {Dai, Feng and Chen, Bin and Xu, Hang and Ma, Yike and Li, Xiaodong and Feng, Bailan and Yuan, Peng and Yan, Chenggang and Zhao, Qiang},
booktitle = {Proceedings of AAAI Conference on Artificial Intelligence},
year = {2022},
pages = {508-515},
}
If you use the PANDORA dataset, RBFoV or R-CenterNet, please cite our ECCV work as
@inproceedings{PANDORA,
title={PANDORA: A Panoramic Detection Dataset for Object with Orientation},
author={Xu, Hang and Zhao, Qiang and Ma, Yike and Li, Xiaodong and Yuan, Peng and Feng, Bailan and Yan, Chenggang and Dai, Feng},
booktitle={ECCV},
year={2022}
}