跳至主導覽 跳至搜尋 跳過主要內容

HDPL: Hypergraph-based Dynamic Prompting Learning for Incomplete Multimodal Medical Learning

  • Xiaomin Zhou
  • , Guoheng Huang
  • , Qin Zhao
  • , Jianbin He
  • , Xiaochen Yuan
  • , Ming Li
  • , Chi Man Pun
  • , Ling Guo
  • , Baiying Lei
  • , Qi Yang

研究成果: Article同行評審

摘要

Multimodal learning has garnered significant attention in the medical field due to its ability to provide a more comprehensive perspective utilizing various types of data, that aids in making more accurate decisions. However, the complexity of medical data, coupled with missing modalities, severely hinders predictive accuracy. Existing methods for multimodal learning with missing modalities still face considerable challenges. For instance, approaches that construct multimodal shared feature spaces often result in high computational costs, while methods that infer missing modalities based on complete ones may overly rely on the complete modalities, potentially skewing results. Pre-trained transformer methods address these issues but still have limitations, such as it can only process one missing modality at testing-stage. This is partly because structured data, unlike sequential data, lacks inherent minimum semantic units or natural order. Additionally, the positional encodings generated by this type of methods may introduce information interference when applied to structured data, leading to poor alignment with sequential data during modality fusion in transformer models. To tackle these challenges, we introduce HDPL: Hypergraph-based Dynamic Prompt Learning for Incomplete Multimodal Medical Learning, comprising three modules. The High-Order Hypergraph Embedding module can identify the minimal semantic units within structured data and utilizes hypergraph structures to extract high-dimensional features from clinical data. The Multimodal Medical Data Integrator module closes the distance of the embedding vectors corresponding in the shared space of modality-features, facilitating the integration of modalities in transformer. The Dynamic Network Structure Optimization module is a dynamic learning network by dynamically change the width and depth of network, improving the overall performance of the model, and it alleviates the shortcomings caused by incomplete modality to some extent. Through comprehensive experimentation, we demonstrate the efficiency and robustness of our model in dealing missing modalities and reducing training-burdens.

原文English
期刊IEEE Journal of Biomedical and Health Informatics
DOIs
出版狀態Accepted/In press - 2026

UN SDG

此研究成果有助於以下永續發展目標

  1. Good health and well being
    Good health and well being

指紋

深入研究「HDPL: Hypergraph-based Dynamic Prompting Learning for Incomplete Multimodal Medical Learning」主題。共同形成了獨特的指紋。

引用此