Skip to main navigation Skip to search Skip to main content

MADAT: Missing-aware dynamic adaptive transformer model for medical prognosis prediction with incomplete multimodal data

  • Jianbin He
  • , Guoheng Huang
  • , Xiaochen Yuan
  • , Chi Man Pun
  • , Guo Zhong
  • , Qi Yang
  • , Ling Guo
  • , Siyu Zhu
  • , Baiying Lei
  • , Haojiang Li
  • Guangdong University of Technology
  • University of Macau
  • Guangdong University of Foreign Studies
  • Sun Yat-Sen University Cancer Center
  • Shenzhen University

Research output: Contribution to journalArticlepeer-review

Abstract

Multimodal medical prognosis prediction has shown great potential in improving diagnostic accuracy by integrating various data types. However, incomplete multimodality, where certain modalities are missing, poses significant challenges to model performance. Current methods, including dynamic adaptation and modality completion, have limitations in handling incomplete multimodality comprehensively. Dynamic adaptation methods fail to fully utilize modality interactions as they only process available modalities. Modality completion methods address inter-modal relationships but risk generating unreliable data, especially when key modalities are missing, since existing modalities cannot replicate unique features of absent ones. This compromises fusion quality and degrades model performance. To address these challenges, we propose the Missing-aware Dynamic Adaptive Transformer (MADAT) model, which integrates two phases: the Decoupling Generalization Completion Phase (DGCP), the Adaptive Cross-Fusion Phase (ACFP). The DGCP reconstructs missing modalities by generating inter-modal and intra-modal shared information using Progressive Transformation Recursive Gated Convolutions (PTRGC) and Wavelet Alignment Domain Generalization (WADG). The ACFP, which incorporates Cross-Agent Attention (CAA) and Generation Quality Feedback Regulation (GQFR), adaptively fuses the original and generated modality features. CAA ensures thorough integration and alignment of the features, while GQFR dynamically adjusts the model's reliance on the generated features based on their quality, preventing over-dependence on low-quality data. Experiments on three private nasopharyngeal carcinoma datasets demonstrate that MADAT outperforms existing methods, achieving superior robustness in medical multimodal prediction under conditions of incomplete multimodality.

Original languageEnglish
Pages (from-to)103958
Number of pages1
JournalMedical Image Analysis
Volume110
DOIs
Publication statusPublished - 1 May 2026

Keywords

  • Dynamic adaptation
  • Medical prognosis prediction
  • Missing modality
  • Modality completion
  • Multimodal data

Fingerprint

Dive into the research topics of 'MADAT: Missing-aware dynamic adaptive transformer model for medical prognosis prediction with incomplete multimodal data'. Together they form a unique fingerprint.

Cite this