Abstract
Recently, the field of infrared small target detection (IRSTD) in the spatial domain has seen rapid development. Nonetheless, distinguishing noise that closely mimics the target in the spatial domain remains a formidable task when relying solely on multiscale spatial features. Consequently, it is of great significance to explore the combination of frequency and spatial domain characteristics to aid in the discriminative process. Building on this, we propose the spatial-frequency channel attention network (SFCANet), which is composed of the spatial-frequency channel attention module (SFCA) and the deep supervised multitask ensemble learning module (SMTEL). By fusing multiscale spatial and frequency features, the SFCA refines the process of target feature extraction, thereby enhancing the capability for continuous modeling complex backgrounds. This aids in the discrimination between noise in the background and actual faint small targets. Furthermore, we introduce SMTEL to mitigate information loss in deep supervision multitask learning, particularly during extensive upsampling processes at low resolutions. Our SFCANet, by integrating multiscale spatial and frequency domain information, effectively directs the attention of network to the continuous modeling of backgrounds, while also compensating for the information loss caused by aggressive upsampling. This effectively enhances the detection accuracy for small infrared targets. Experiments conducted on three public datasets, IRSTD-1 K, NUDT-SIRST, and SIRST-V1 demonstrate the superiority of SFCANet in IRSTD.
| Original language | English |
|---|---|
| Pages (from-to) | 13363-13379 |
| Number of pages | 17 |
| Journal | IEEE Transactions on Aerospace and Electronic Systems |
| Volume | 61 |
| Issue number | 5 |
| DOIs | |
| Publication status | Published - Oct 2025 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 3 Good Health and Well-being
Keywords
- Deep supervision
- frequency attention
- infrared small target detection (IRSTD)
- spatial attention
- u-net
Fingerprint
Dive into the research topics of 'SFCANet: Channel Attention in Spatial-Frequency Domain for Infrared Small Target Detection'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver