跳至主導覽 跳至搜尋 跳過主要內容

Quick-MIMIC: A Multimodal Data Extraction Pipeline for MIMIC with Parallelization

  • Yutao Dou
  • , Wei Li
  • , Yangtao Zheng
  • , Xiaojun Yao
  • , Huanxiang Liu
  • , Albert Y. Zomaya
  • , Shaoliang Peng
  • Hunan University
  • University of Sydney
  • Macao Polytechnic University

研究成果: Article同行評審

4 引文 斯高帕斯(Scopus)

摘要

Medical big data with artificial intelligence are vital in advancing digital medicine. However, the opaque and non-standardised nature embedded in most medical data extraction is prone to batch effects and has become a significant obstacle to reproducing previous works. This paper aims to develop an easy-to-use time-series multimodal data extraction pipeline, Quick-MIMIC, for standardised data extraction from MIMIC datasets. Our method can fully integrate different data structures into a time-series table, including structured, semi-structured, and unstructured data. We also introduce two additional modules to Quick-MIMIC, a pipeline parallelization method and data analysis methods, for reducing the data extraction time and presenting the characteristics of the extracted data intuitively. The extensive experimental results show that our pipeline can efficiently extract the needed data from the MIMIC dataset and convert it into the correct format for further analytic tasks.

原文English
頁(從 - 到)1333-1346
頁數14
期刊Big Data Mining and Analytics
7
發行號4
DOIs
出版狀態Published - 12月 2024

指紋

深入研究「Quick-MIMIC: A Multimodal Data Extraction Pipeline for MIMIC with Parallelization」主題。共同形成了獨特的指紋。

引用此