跳至主導覽 跳至搜尋 跳過主要內容

Parallel Multimodal Language Model: Enhanced Breast Nodule Diagnosis through Parallel Multimodal Representations and Large Language Models

研究成果: Article同行評審

摘要

Large language models (LLMs) have emerged in medical image analysis and can provide accurate and personalized medical services for doctors and patients. However, by simply utilizing textual information and ignoring other modal details such as images, LLMs fail to achieve high accuracy in the early diagnosis of breast cancer and thus have not yet been seamlessly integrated into the clinical practice of breast cancer diagnosis. Therefore, this study proposes that the Parallel Multimodal Language Model (PMLM) combines images and text, integrates visual and semantic information in text for early screening and diagnosis of breast cancer, and improves the accuracy of early screening and diagnosis, while also enhancing health system access. In addition, existing multimodal diagnostic methods are evaluated. The final experimental results reveal that the PMLM achieves an F1 of 0.87 [95% CI: 0.85–0.89] and an Area Under Curve (AUC) of 0.90 [95% CI, 0.89, 0.92] in the early diagnosis of breast cancer, both of which exceeded those of the existing baseline model.

原文English
文章編號2500085
期刊Advanced Intelligent Systems
8
發行號1
DOIs
出版狀態Published - 1月 2026

UN SDG

此研究成果有助於以下永續發展目標

  1. Good health and well being
    Good health and well being

指紋

深入研究「Parallel Multimodal Language Model: Enhanced Breast Nodule Diagnosis through Parallel Multimodal Representations and Large Language Models」主題。共同形成了獨特的指紋。

引用此