跳至主導覽 跳至搜尋 跳過主要內容

QWNet: A quaternion wavelet network for spatial-frequency aware multi-modal image fusion

  • Jietao Yang
  • , Miaoshan Lin
  • , Guoheng Huang
  • , Xuhang Chen
  • , Xiaofeng Zhang
  • , Xiaochen Yuan
  • , Chi Man Pun
  • , Bingo Wing Kuen Ling

研究成果: Article同行評審

1 引文 斯高帕斯(Scopus)

摘要

Multi-modal Image Fusion (MMIF) enhances visual tasks by combining the strengths of different image modalities to improve object visibility and texture details. However, existing methods face two major challenges: First, a lack of intrinsic frequency-domain awareness, relying heavily on complex filters and fusion techniques that can be less adaptive. Second, simplistic channel combination that overlooks essential complex inter-channel relationships. To address these issues, we propose QWNet, a novel Quaternion Wavelet Network that harnesses both spatial and frequency information to enhance the network's inductive bias towards local features. By integrating wavelet transforms, we decompose input modalities into high- and low-frequency components, capturing global structures and fine details. These components are represented as quaternions, enabling the network to model complex inter-channel dependencies often missed by traditional real-valued networks. We also introduce a Bidirectional Adaptive Attention Module (BAAM) for effective multi-modal information interaction and difference enhancement, and a Quaternion Cross-modal Fusion Module (QCFM) to strengthen inter-channel relationships and effectively combine key features from different modalities. Extensive experiments confirm that our QWNet outperforms existing methods in fusion quality and downstream tasks like semantic segmentation, using only 4.27 K parameters and a computational cost of 0.30G FLOPs. The source code will be available at https://github.com/Mrzhans/QWNet.

原文English
文章編號108364
期刊Neural Networks
196
DOIs
出版狀態Published - 4月 2026

指紋

深入研究「QWNet: A quaternion wavelet network for spatial-frequency aware multi-modal image fusion」主題。共同形成了獨特的指紋。

引用此