TY - JOUR
T1 - SFCANet
T2 - Channel Attention in Spatial-Frequency Domain for Infrared Small Target Detection
AU - Lin, Zijin
AU - Huang, Guoheng
AU - Li, Ming
AU - Yuan, Xiaochen
AU - Yue, Guanghui
AU - Pun, Chi Man
AU - Cheng, Lianglun
N1 - Publisher Copyright:
© 1965-2011 IEEE.
PY - 2025/10
Y1 - 2025/10
N2 - Recently, the field of infrared small target detection (IRSTD) in the spatial domain has seen rapid development. Nonetheless, distinguishing noise that closely mimics the target in the spatial domain remains a formidable task when relying solely on multiscale spatial features. Consequently, it is of great significance to explore the combination of frequency and spatial domain characteristics to aid in the discriminative process. Building on this, we propose the spatial-frequency channel attention network (SFCANet), which is composed of the spatial-frequency channel attention module (SFCA) and the deep supervised multitask ensemble learning module (SMTEL). By fusing multiscale spatial and frequency features, the SFCA refines the process of target feature extraction, thereby enhancing the capability for continuous modeling complex backgrounds. This aids in the discrimination between noise in the background and actual faint small targets. Furthermore, we introduce SMTEL to mitigate information loss in deep supervision multitask learning, particularly during extensive upsampling processes at low resolutions. Our SFCANet, by integrating multiscale spatial and frequency domain information, effectively directs the attention of network to the continuous modeling of backgrounds, while also compensating for the information loss caused by aggressive upsampling. This effectively enhances the detection accuracy for small infrared targets. Experiments conducted on three public datasets, IRSTD-1 K, NUDT-SIRST, and SIRST-V1 demonstrate the superiority of SFCANet in IRSTD.
AB - Recently, the field of infrared small target detection (IRSTD) in the spatial domain has seen rapid development. Nonetheless, distinguishing noise that closely mimics the target in the spatial domain remains a formidable task when relying solely on multiscale spatial features. Consequently, it is of great significance to explore the combination of frequency and spatial domain characteristics to aid in the discriminative process. Building on this, we propose the spatial-frequency channel attention network (SFCANet), which is composed of the spatial-frequency channel attention module (SFCA) and the deep supervised multitask ensemble learning module (SMTEL). By fusing multiscale spatial and frequency features, the SFCA refines the process of target feature extraction, thereby enhancing the capability for continuous modeling complex backgrounds. This aids in the discrimination between noise in the background and actual faint small targets. Furthermore, we introduce SMTEL to mitigate information loss in deep supervision multitask learning, particularly during extensive upsampling processes at low resolutions. Our SFCANet, by integrating multiscale spatial and frequency domain information, effectively directs the attention of network to the continuous modeling of backgrounds, while also compensating for the information loss caused by aggressive upsampling. This effectively enhances the detection accuracy for small infrared targets. Experiments conducted on three public datasets, IRSTD-1 K, NUDT-SIRST, and SIRST-V1 demonstrate the superiority of SFCANet in IRSTD.
KW - Deep supervision
KW - frequency attention
KW - infrared small target detection (IRSTD)
KW - spatial attention
KW - u-net
UR - https://www.scopus.com/pages/publications/105007892429
U2 - 10.1109/TAES.2025.3577586
DO - 10.1109/TAES.2025.3577586
M3 - Article
AN - SCOPUS:105007892429
SN - 0018-9251
VL - 61
SP - 13363
EP - 13379
JO - IEEE Transactions on Aerospace and Electronic Systems
JF - IEEE Transactions on Aerospace and Electronic Systems
IS - 5
ER -