TY - JOUR
T1 - An SAM Fine-Tuning Framework With Frequency-Domain Interactive LoRA for Remote Sensing Change Detection
AU - Huang, Junqing
AU - Ji, Shucheng
AU - Wang, Yapeng
AU - Xia, Min
AU - Yuan, Xiaochen
N1 - Publisher Copyright:
© 1980-2012 IEEE.
PY - 2026
Y1 - 2026
N2 - Achieving high-accuracy remote sensing change detection (RSCD) algorithms requires high-quality semantic feature extraction from remote sensing images (RSIs). Due to its powerful general-purpose feature extraction capability, the segment anything model (SAM) has found wide application across diverse fields. However, SAM may not be optimally suited for RSIs. To address this limitation, we propose a frequency-domain interactive low-rank adaptation (LoRA) fine-tuning architecture (FILFArch) to enhance the performance of SAM in RSCD tasks. Based on FILFArch, we then develop two task-specific algorithms: the FILFBCD for binary change detection (BCD) and the FILFSCD for semantic change detection (SCD). To enhance the capability of SAM in capturing bi-temporal RSIs feature relationship, the bi-temporal interaction fusion LoRA (BIF-LoRA) is designed with a Siamese architecture. Within BIF-LoRA, frequency-domain feature interaction (FDFI) utilizes fast Fourier transform block (FFTB) to fuse bi-temporal frequency-domain features. This enables cross-temporal frequency-domain interaction, effectively discriminating spatiotemporal feature differences. Additionally, we use a shared BCD Decoder to serve as the binary change detector for both FILFBCD and FILFSCD. The BCD Decoder first applies a coarse difference feature extraction (CDFE) to coarsely fuse deep semantic features, yielding a coarse-grained change feature map. Subsequently, a frequency-domain feature enhancement (FDFE) refines these abstract features to generate a fine-grained change map. In FILFSCD, FDFE is further utilized to recover the semantic change information of each temporal RSI. Experimental results demonstrate that FILFBCD achieves the highest F1 scores of 83.53%, 66.75%, and 83.79% on BCD datasets MLCD, S2Looking, and SYSU-CD, respectively. Meanwhile, FILFSCD achieves the highest F1 scores of 64.05% and 87.02% on SCD datasets SECOND and DSCD, respectively. These results demonstrate the effectiveness and versatility of the proposed FILFArch for RSCD tasks.
AB - Achieving high-accuracy remote sensing change detection (RSCD) algorithms requires high-quality semantic feature extraction from remote sensing images (RSIs). Due to its powerful general-purpose feature extraction capability, the segment anything model (SAM) has found wide application across diverse fields. However, SAM may not be optimally suited for RSIs. To address this limitation, we propose a frequency-domain interactive low-rank adaptation (LoRA) fine-tuning architecture (FILFArch) to enhance the performance of SAM in RSCD tasks. Based on FILFArch, we then develop two task-specific algorithms: the FILFBCD for binary change detection (BCD) and the FILFSCD for semantic change detection (SCD). To enhance the capability of SAM in capturing bi-temporal RSIs feature relationship, the bi-temporal interaction fusion LoRA (BIF-LoRA) is designed with a Siamese architecture. Within BIF-LoRA, frequency-domain feature interaction (FDFI) utilizes fast Fourier transform block (FFTB) to fuse bi-temporal frequency-domain features. This enables cross-temporal frequency-domain interaction, effectively discriminating spatiotemporal feature differences. Additionally, we use a shared BCD Decoder to serve as the binary change detector for both FILFBCD and FILFSCD. The BCD Decoder first applies a coarse difference feature extraction (CDFE) to coarsely fuse deep semantic features, yielding a coarse-grained change feature map. Subsequently, a frequency-domain feature enhancement (FDFE) refines these abstract features to generate a fine-grained change map. In FILFSCD, FDFE is further utilized to recover the semantic change information of each temporal RSI. Experimental results demonstrate that FILFBCD achieves the highest F1 scores of 83.53%, 66.75%, and 83.79% on BCD datasets MLCD, S2Looking, and SYSU-CD, respectively. Meanwhile, FILFSCD achieves the highest F1 scores of 64.05% and 87.02% on SCD datasets SECOND and DSCD, respectively. These results demonstrate the effectiveness and versatility of the proposed FILFArch for RSCD tasks.
KW - Binary change detection (BCD)
KW - frequency-domain interactive LoRA
KW - remote sensing change detection
KW - segment anything model (SAM)
KW - semantic change detection (SCD)
UR - https://www.scopus.com/pages/publications/105027991285
U2 - 10.1109/TGRS.2026.3650952
DO - 10.1109/TGRS.2026.3650952
M3 - Article
AN - SCOPUS:105027991285
SN - 0196-2892
VL - 64
JO - IEEE Transactions on Geoscience and Remote Sensing
JF - IEEE Transactions on Geoscience and Remote Sensing
M1 - 4500519
ER -