TY - GEN
T1 - PatSimBoosting
T2 - 10th International Conference on Internet of Things, Big Data and Security, IoTBDS 2025
AU - Yan, Yuzheng
AU - Yu, Ziyue
AU - Luo, Wuman
N1 - Publisher Copyright:
Copyright © 2025 by SCITEPRESS - Science and Technology Publications, Lda.
PY - 2025
Y1 - 2025
N2 - Patient representation learning based on electronic health records (EHR) is crucial for disease prediction. So far, various deep learning-based methods have been proposed and have made great progress. In particular, recent research has shown that trends and variations of dynamic features are of great importance in patient representation learning. However, these methods ignored the similarity between the patients. Although a number of similarity-based methods have been proposed for patient representation learning, they regarded each dynamic feature as a whole in similarity detection and failed to utilize the important fine-grained characteristics of each feature. To address this issue, we propose a Patient Similarity-Based Representation Boosting framework (PatSimBoost) to enhance patient representation for disease prediction based on EHR. Our proposed framework consists of four modules: Frequency Extraction Module (FEM), Similarity Calculation Module (SCM), Patient Representation Learning Module (PRLM), and Prediction Module (PM). FEM extracts trends and variations of dynamic features, while SCM employs Dynamic Time Warping (DTW) to assess the similarity between patients. PRLM learns patient representations, and the PM utilizes the representation of the most similar patient, along with the current patient’s representation, to perform disease prediction. Experimental results on two real-world public datasets demonstrate that PatSimBoost outperforms existing state-of-the-art methods in terms of F1-score, AUROC, and AUPRC.
AB - Patient representation learning based on electronic health records (EHR) is crucial for disease prediction. So far, various deep learning-based methods have been proposed and have made great progress. In particular, recent research has shown that trends and variations of dynamic features are of great importance in patient representation learning. However, these methods ignored the similarity between the patients. Although a number of similarity-based methods have been proposed for patient representation learning, they regarded each dynamic feature as a whole in similarity detection and failed to utilize the important fine-grained characteristics of each feature. To address this issue, we propose a Patient Similarity-Based Representation Boosting framework (PatSimBoost) to enhance patient representation for disease prediction based on EHR. Our proposed framework consists of four modules: Frequency Extraction Module (FEM), Similarity Calculation Module (SCM), Patient Representation Learning Module (PRLM), and Prediction Module (PM). FEM extracts trends and variations of dynamic features, while SCM employs Dynamic Time Warping (DTW) to assess the similarity between patients. PRLM learns patient representations, and the PM utilizes the representation of the most similar patient, along with the current patient’s representation, to perform disease prediction. Experimental results on two real-world public datasets demonstrate that PatSimBoost outperforms existing state-of-the-art methods in terms of F1-score, AUROC, and AUPRC.
KW - Disease Prediction
KW - Electronic Health Records
KW - Patient Representation Learning
KW - Similarity
UR - http://www.scopus.com/inward/record.url?scp=105003708282&partnerID=8YFLogxK
U2 - 10.5220/0013224300003944
DO - 10.5220/0013224300003944
M3 - Conference contribution
AN - SCOPUS:105003708282
T3 - International Conference on Internet of Things, Big Data and Security, IoTBDS - Proceedings
SP - 48
EP - 55
BT - Proceedings of the 10th International Conference on Internet of Things, Big Data and Security, IoTBDS 2025
A2 - Emrouznejad, Ali
A2 - Hung, Patrick
A2 - Jacobsson, Andreas
PB - Science and Technology Publications, Lda
Y2 - 6 April 2025 through 8 April 2025
ER -