跳至主導覽 跳至搜尋 跳過主要內容

Multivariate Contrastive Predictive Coding with Sliding Windows for Disease Prediction from Electronic Health Records

  • Hongxu Yuan
  • , Xiaozhu Jing
  • , Yuzheng Yan
  • , Wuman Luo

研究成果: Article同行評審

摘要

Learning effective patient representations from electronic health records (EHRs) is crucial for improving disease prediction models. However, existing supervised learning methods are hindered by high labeling costs. Moreover, capturing complex temporal and multi-indicator relationships—as well as localized temporal pattern shifts in clinical settings—remains a significant challenge. To address these issues, the adaptive multi-indicator contrastive predictive coding (AMCPC) framework is proposed, a self-supervised pretraining approach tailored for EHR data. AMCPC utilizes an adaptive optimal window-size selection algorithm to segment patient visit sequences into temporal sub-windows, enabling the model to focus on localized, context-specific health patterns. Furthermore, by extending contrastive predictive coding (CPC) through a multi-indicator approach, AMCPC employs a 2D convolutional neural network to capture global correlations among medical indicators within each sub-window. Extensive experiments on three real-world clinical datasets demonstrate that AMCPC outperforms both fully supervised and existing self-supervised methods, particularly when trained with limited labeled data. AMCPC establishes an effective self-supervised pretraining framework for unlabeled EHR data, which can be fine-tuned with minimal labeled data—significantly enhancing downstream predictive performance and reducing reliance on large-scale labeled datasets.

原文English
文章編號e202500818
期刊Advanced Intelligent Systems
8
發行號3
DOIs
出版狀態Published - 3月 2026

UN SDG

此研究成果有助於以下永續發展目標

  1. Good health and well being
    Good health and well being

指紋

深入研究「Multivariate Contrastive Predictive Coding with Sliding Windows for Disease Prediction from Electronic Health Records」主題。共同形成了獨特的指紋。

引用此