An SAM Fine-Tuning Framework With Frequency-Domain Interactive LoRA for Remote Sensing Change Detection

Research output: Contribution to journalArticlepeer-review

Abstract

Achieving high-accuracy remote sensing change detection (RSCD) algorithms requires high-quality semantic feature extraction from remote sensing images (RSIs). Due to its powerful general-purpose feature extraction capability, the segment anything model (SAM) has found wide application across diverse fields. However, SAM may not be optimally suited for RSIs. To address this limitation, we propose a frequency-domain interactive low-rank adaptation (LoRA) fine-tuning architecture (FILFArch) to enhance the performance of SAM in RSCD tasks. Based on FILFArch, we then develop two task-specific algorithms: the FILFBCD for binary change detection (BCD) and the FILFSCD for semantic change detection (SCD). To enhance the capability of SAM in capturing bi-temporal RSIs feature relationship, the bi-temporal interaction fusion LoRA (BIF-LoRA) is designed with a Siamese architecture. Within BIF-LoRA, frequency-domain feature interaction (FDFI) utilizes fast Fourier transform block (FFTB) to fuse bi-temporal frequency-domain features. This enables cross-temporal frequency-domain interaction, effectively discriminating spatiotemporal feature differences. Additionally, we use a shared BCD Decoder to serve as the binary change detector for both FILFBCD and FILFSCD. The BCD Decoder first applies a coarse difference feature extraction (CDFE) to coarsely fuse deep semantic features, yielding a coarse-grained change feature map. Subsequently, a frequency-domain feature enhancement (FDFE) refines these abstract features to generate a fine-grained change map. In FILFSCD, FDFE is further utilized to recover the semantic change information of each temporal RSI. Experimental results demonstrate that FILFBCD achieves the highest F1 scores of 83.53%, 66.75%, and 83.79% on BCD datasets MLCD, S2Looking, and SYSU-CD, respectively. Meanwhile, FILFSCD achieves the highest F1 scores of 64.05% and 87.02% on SCD datasets SECOND and DSCD, respectively. These results demonstrate the effectiveness and versatility of the proposed FILFArch for RSCD tasks.

Original languageEnglish
Article number4500519
JournalIEEE Transactions on Geoscience and Remote Sensing
Volume64
DOIs
Publication statusPublished - 2026

Keywords

  • Binary change detection (BCD)
  • frequency-domain interactive LoRA
  • remote sensing change detection
  • segment anything model (SAM)
  • semantic change detection (SCD)

Fingerprint

Dive into the research topics of 'An SAM Fine-Tuning Framework With Frequency-Domain Interactive LoRA for Remote Sensing Change Detection'. Together they form a unique fingerprint.

Cite this