Cross-View Geo-Localization via Learning Correspondence Semantic Similarity Knowledge

Guanli Chen, Guoheng Huang, Xiaochen Yuan, Xuhang Chen, Guo Zhong, Chi Man Pun

研究成果: Conference contribution同行評審

摘要

Cross-view geo-localization aims at retrieving and estimating accurate geographic locations from ground images in a geo-tagged aerial image database. Existing approaches focus on two independent two-branch models to learn fine-grained representations of perspectives, neglecting to learn more discriminative representations through interactions. In this paper, we propose the GeoSSK method, which adapts the learning process of the model by learning local semantic similarity information between aerial and ground pairs via a new interaction module. We then transfer the semantic similarity knowledge learned during the interaction process to the student model through knowledge distillation. Specifically, we design a Cross-fusion Interaction Module (CIM) based on cross-attention, which learns local semantic similarity information between perspectives to adjust the learning of the model. Meanwhile, considering the presence of visual distractions in complex environments, we adjust the degree of interaction between perspectives by the Contribution Factor (CF) of the local representation to the global representation. In addition, we introduce Semantic Similarity Knowledge Distillation (SSKD) between teachers and students for cross-view geo-localization. The interaction learning model serves as the teacher, transferring its semantic similarity knowledge to the student. At the same time, we designed an Incorrect Knowledge Filter (IKF) to filter incorrect knowledge of teachers. Experimental results demonstrate the effectiveness and competitive performance of GeoSSK.

原文English
主出版物標題MultiMedia Modeling - 31st International Conference on Multimedia Modeling, MMM 2025, Proceedings
編輯Ichiro Ide, Ioannis Kompatsiaris, Changsheng Xu, Keiji Yanai, Wei-Ta Chu, Naoko Nitta, Michael Riegler, Toshihiko Yamasaki
發行者Springer Science and Business Media Deutschland GmbH
頁面220-233
頁數14
ISBN(列印)9789819620531
DOIs
出版狀態Published - 2025
事件31st International Conference on Multimedia Modeling, MMM 2025 - Nara, Japan
持續時間: 8 1月 202510 1月 2025

出版系列

名字Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
15520 LNCS
ISSN(列印)0302-9743
ISSN(電子)1611-3349

Conference

Conference31st International Conference on Multimedia Modeling, MMM 2025
國家/地區Japan
城市Nara
期間8/01/2510/01/25

指紋

深入研究「Cross-View Geo-Localization via Learning Correspondence Semantic Similarity Knowledge」主題。共同形成了獨特的指紋。

引用此