TY - JOUR
T1 - Deep Learning Models for Rotated Object Detection in Aerial Images
T2 - Survey and Performance Comparisons
AU - He, Jiaying
AU - Eddie Law, K. L.
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2024
Y1 - 2024
N2 - Rotated object detection in aerial images presents unique challenges due to the variability in object orientation and aspect ratios. In object detection, the common practice is to use boxes to enclose the spatial locations of discovered objects. Traditionally, these bounding boxes are rectangular in shape with edges parallel to the horizontal or vertical axes. However, with the ever-increasing content complexity of images, it would be desirable to rotate these bounding boxes properly to better fit the sizes and shapes of all semantic object locations in images with random orientations. Lately, through different deep learning model developments, there are noticeable improvements on the detection of rotated objects in images. Certainly, it always faces challenges with the most stringent expectations and accuracy requirements. Hence, the primary objective of this study is to analyze and compare state-of-the-art methodologies for rotated object detection in aerial imagery, and provide a comprehensive survey of the latest research status of it. From four different working principles, we systematically review recent advancements in deep learning architectures, including R-CNN variants, YOLO variants and Transformer variants adapted for rotated bounding boxes. Then we further carry out quantitative performance comparisons among the latest state-of-the-art rotated object detection algorithms using the benchmark dataset DOTA and computing facility. Among them, the RTMDeta outperforms all other modelsb. There are numerous important applications based on aerial images, for example, urban management, precision agriculture, environment monitoring, emergency rescue, and disaster relief, etc. This study contributes to advancing the understanding of rotated object detection in aerial imagery by providing insights into effective methodologies and highlighting areas for future research. The findings guide practitioners and researchers in selecting appropriate techniques for applications requiring robust rotated object detection in challenging aerial environments.
AB - Rotated object detection in aerial images presents unique challenges due to the variability in object orientation and aspect ratios. In object detection, the common practice is to use boxes to enclose the spatial locations of discovered objects. Traditionally, these bounding boxes are rectangular in shape with edges parallel to the horizontal or vertical axes. However, with the ever-increasing content complexity of images, it would be desirable to rotate these bounding boxes properly to better fit the sizes and shapes of all semantic object locations in images with random orientations. Lately, through different deep learning model developments, there are noticeable improvements on the detection of rotated objects in images. Certainly, it always faces challenges with the most stringent expectations and accuracy requirements. Hence, the primary objective of this study is to analyze and compare state-of-the-art methodologies for rotated object detection in aerial imagery, and provide a comprehensive survey of the latest research status of it. From four different working principles, we systematically review recent advancements in deep learning architectures, including R-CNN variants, YOLO variants and Transformer variants adapted for rotated bounding boxes. Then we further carry out quantitative performance comparisons among the latest state-of-the-art rotated object detection algorithms using the benchmark dataset DOTA and computing facility. Among them, the RTMDeta outperforms all other modelsb. There are numerous important applications based on aerial images, for example, urban management, precision agriculture, environment monitoring, emergency rescue, and disaster relief, etc. This study contributes to advancing the understanding of rotated object detection in aerial imagery by providing insights into effective methodologies and highlighting areas for future research. The findings guide practitioners and researchers in selecting appropriate techniques for applications requiring robust rotated object detection in challenging aerial environments.
KW - Aerial images
KW - deep learning
KW - oriented object detection
KW - rotated object detection
UR - http://www.scopus.com/inward/record.url?scp=85211454904&partnerID=8YFLogxK
U2 - 10.1109/ACCESS.2024.3509745
DO - 10.1109/ACCESS.2024.3509745
M3 - Article
AN - SCOPUS:85211454904
SN - 2169-3536
JO - IEEE Access
JF - IEEE Access
ER -