BBPM: A Study of Information Pre-retrieval Models Based on Medical BERT Model

Shun Guo, Yaofei Duan, Jingzhi Huang, Dashun Zheng, Patrick Cheong Iao Pang, Henry H.Y. Tong

研究成果: Conference contribution同行評審

摘要

This study focuses on the preprocessing of medical paper retrieval, aiming to address the challenge of extracting information from lengthy medical texts. This process is crucial in facilitating effective medical research. We propose BioBERT Preprocessing Model (BBPM). By employing the cutting-edge medical BERT model, we predict the subject words of the paper's abstract. Subsequently, based on these subject words, we calculate secondary similarity using the corresponding medical model. Finally, we incorporate the most strongly associated words to augment the associative vocabulary of the article, thereby enhancing its effectiveness in subsequent text retrieval. Experimental results show that BioBERT shows superior performance in predicting subject words in long medical texts, with a basic text similarity of about 90%. The primary contribution of this paper lies in the integration of the latest BioBERT model for preprocessing the abstracts of medical papers. This yields additional subject terms necessary for retrieval. These terms are then integrated with the title keywords into the final retrieval system, leading to more efficient retrieval outcomes. This approach promises to provide fresh insights into medical information retrieval.

原文English
主出版物標題2023 9th International Conference on Computer and Communications, ICCC 2023
發行者Institute of Electrical and Electronics Engineers Inc.
頁面2389-2393
頁數5
ISBN(電子)9798350317251
DOIs
出版狀態Published - 2023
事件9th International Conference on Computer and Communications, ICCC 2023 - Hybrid, Chengdu, China
持續時間: 8 12月 202311 12月 2023

出版系列

名字2023 9th International Conference on Computer and Communications, ICCC 2023

Conference

Conference9th International Conference on Computer and Communications, ICCC 2023
國家/地區China
城市Hybrid, Chengdu
期間8/12/2311/12/23

指紋

深入研究「BBPM: A Study of Information Pre-retrieval Models Based on Medical BERT Model」主題。共同形成了獨特的指紋。

引用此