TY - GEN
T1 - A Multi-object Detection Sampling Algorithm For Large Scenes
AU - Jin, Liang
AU - Li, Xiaochuan
AU - Fan, Baoyu
AU - Guo, Zhenhua
AU - Li, Ruidong
AU - Wang, Li
AU - Wang, Yanwei
AU - Zhao, Yaqian
AU - Li, Rengang
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Multi-object detection in large scenes aims to find objects in images, which usually contain more than one billion pixels. Based on the concept of dividing and conquering, the state-of-the-art (SOTA) methods slice the super-resolution image into patches first and then lower the image solution to detect objects later. The advantage of this method is that it can adapt quickly to regular detection algorithms. However, a set of parameters needs to be set manually, such as the size of sliding windows and overlap, which is quite hard to fit all scenarios. It may result in a loss of samples located at the boundary of the sliding window and the oversampling of inefficient samples that appear within the overlap. In this paper, we propose a object-oriented image sampling algorithm based on anchor boxes during training and multi-scale pyramids during inference. Inspired by the mature object detection baseline Scale-YOLOv4, we present more tricks to fit large scenes. The accuracy can reach 66%, which is 24 points higher than the CascadeRCNN model of the official backbone network ResNet50. Finally, we have won first place in the PANDA object detection tracking using this method.
AB - Multi-object detection in large scenes aims to find objects in images, which usually contain more than one billion pixels. Based on the concept of dividing and conquering, the state-of-the-art (SOTA) methods slice the super-resolution image into patches first and then lower the image solution to detect objects later. The advantage of this method is that it can adapt quickly to regular detection algorithms. However, a set of parameters needs to be set manually, such as the size of sliding windows and overlap, which is quite hard to fit all scenarios. It may result in a loss of samples located at the boundary of the sliding window and the oversampling of inefficient samples that appear within the overlap. In this paper, we propose a object-oriented image sampling algorithm based on anchor boxes during training and multi-scale pyramids during inference. Inspired by the mature object detection baseline Scale-YOLOv4, we present more tricks to fit large scenes. The accuracy can reach 66%, which is 24 points higher than the CascadeRCNN model of the official backbone network ResNet50. Finally, we have won first place in the PANDA object detection tracking using this method.
KW - Object Detection
KW - PANDA
KW - Super Resolution
UR - http://www.scopus.com/inward/record.url?scp=85146576854&partnerID=8YFLogxK
U2 - 10.1109/PAAP56126.2022.10010614
DO - 10.1109/PAAP56126.2022.10010614
M3 - Conference contribution
AN - SCOPUS:85146576854
T3 - Proceedings - International Symposium on Parallel Architectures, Algorithms and Programming, PAAP
BT - Proceedings - 2022 IEEE 13th International Symposium on Parallel Architectures, Algorithms and Programming, PAAP 2022
PB - IEEE Computer Society
T2 - 13th IEEE International Symposium on Parallel Architectures, Algorithms and Programming, PAAP 2022
Y2 - 25 November 2022 through 27 November 2022
ER -