跳至主導覽 跳至搜尋 跳過主要內容

MADAT: Missing-aware dynamic adaptive transformer model for medical prognosis prediction with incomplete multimodal data

  • Jianbin He
  • , Guoheng Huang
  • , Xiaochen Yuan
  • , Chi Man Pun
  • , Guo Zhong
  • , Qi Yang
  • , Ling Guo
  • , Siyu Zhu
  • , Baiying Lei
  • , Haojiang Li
  • Guangdong University of Technology
  • University of Macau
  • Guangdong University of Foreign Studies
  • Sun Yat-Sen University Cancer Center
  • Shenzhen University

研究成果: Article同行評審

摘要

Multimodal medical prognosis prediction has shown great potential in improving diagnostic accuracy by integrating various data types. However, incomplete multimodality, where certain modalities are missing, poses significant challenges to model performance. Current methods, including dynamic adaptation and modality completion, have limitations in handling incomplete multimodality comprehensively. Dynamic adaptation methods fail to fully utilize modality interactions as they only process available modalities. Modality completion methods address inter-modal relationships but risk generating unreliable data, especially when key modalities are missing, since existing modalities cannot replicate unique features of absent ones. This compromises fusion quality and degrades model performance. To address these challenges, we propose the Missing-aware Dynamic Adaptive Transformer (MADAT) model, which integrates two phases: the Decoupling Generalization Completion Phase (DGCP), the Adaptive Cross-Fusion Phase (ACFP). The DGCP reconstructs missing modalities by generating inter-modal and intra-modal shared information using Progressive Transformation Recursive Gated Convolutions (PTRGC) and Wavelet Alignment Domain Generalization (WADG). The ACFP, which incorporates Cross-Agent Attention (CAA) and Generation Quality Feedback Regulation (GQFR), adaptively fuses the original and generated modality features. CAA ensures thorough integration and alignment of the features, while GQFR dynamically adjusts the model's reliance on the generated features based on their quality, preventing over-dependence on low-quality data. Experiments on three private nasopharyngeal carcinoma datasets demonstrate that MADAT outperforms existing methods, achieving superior robustness in medical multimodal prediction under conditions of incomplete multimodality.

原文English
頁(從 - 到)103958
頁數1
期刊Medical Image Analysis
110
DOIs
出版狀態Published - 1 5月 2026

指紋

深入研究「MADAT: Missing-aware dynamic adaptive transformer model for medical prognosis prediction with incomplete multimodal data」主題。共同形成了獨特的指紋。

引用此