跳至主導覽 跳至搜尋 跳過主要內容

3MOS: a multi-source, multi-resolution, and multi-scene optical-SAR dataset with insights for multi-modal image matching

  • Yibin Ye
  • , Xichao Teng
  • , Hongrui Yang
  • , Shuo Chen
  • , Yuli Sun
  • , Yijie Bian
  • , Tao Tan
  • , Zhang Li
  • , Qifeng Yu
  • National University of Defense Technology
  • Hunan Key Laboratory for Image Measurement and Vision Navigation
  • Hunan Institute of Advanced Technology

研究成果: Article同行評審

4 引文 斯高帕斯(Scopus)

摘要

Optical-SAR image matching is a fundamental task for remote sensing applications. While existing methods perform well on some popular datasets such as SEN1-2 and WHU-SEN-City, their generalizability across diverse data sources such as satellites, spatial resolutions, and scenes remains insufficiently investigated, hindering the practical implementation of optical-SAR matching in various downstream tasks. Thus, 3MOS, the first multi-source, multi-resolution, and multi-scene optical-SAR dataset, was proposed in our study to address this gap. This dataset consists of 113k optical-SAR image pairs, with the SAR data collected from five satellites and resolutions ranging from 3.5 m to 12.5 m, further categorized into eight scenes, such as urban, rural, and plains through a simple but practical classification strategy. Based on this dataset, the performance of optical-SAR matching methods was evaluated through the data with diverse characteristics. Additionally, extensive experiments were conducted, and the following two findings were obtained. 1) None of the state-of-the-art methods achieved consistently superior performance across different sources, resolutions, and scenes, specifying significant generalization challenges for diverse downstream task data. 2) Training data distribution significantly impacted the matching performance of deep-learning models, highlighting the domain adaptation challenge in optical-SAR image matching. Furthermore, the practical utility of the dataset was comprehensively validated through multimodal change detection experiments, demonstrating its substantial value for a wide range of downstream applications.

原文English
文章編號19
期刊Visual Intelligence
3
發行號1
DOIs
出版狀態Published - 12月 2025

指紋

深入研究「3MOS: a multi-source, multi-resolution, and multi-scene optical-SAR dataset with insights for multi-modal image matching」主題。共同形成了獨特的指紋。

引用此