跳至主導覽 跳至搜尋 跳過主要內容

Contrastive Knowledge-Guided Large Language Models for Medical Report Generation

  • Yuyang Sha
  • , Hongxin Pan
  • , Weiyu Meng
  • , Kefeng Li

研究成果: Conference contribution同行評審

1 引文 斯高帕斯(Scopus)

摘要

Automatic medical report generation (MRG) holds considerable research value and has the potential to significantly alleviate the workload of radiologists. Recently, the rapid development of large language models (LLMs) has improved the performance of MRG. However, numerous challenges still need to be addressed to achieve highly accurate medical reports. For instance, most existing methods struggle to interpret image details, lack relevant medical knowledge, and overlook fine-grained cross-modality alignment. To overcome these limitations, we propose a knowledge-guided vision-language alignment framework with contrastive learning and LLMs for medical report generation. The designed method leverages visual representations, relevant medical knowledge, and enhanced features to generate accurate reports via the LLMs-based decoder. To improve the integration of medical-related information, we introduce the Knowledge Injection Module, which enhances the model’s feature representation capabilities while unlocking medical domain knowledge in LLMs. Inspired by the contrastive learning scheme, we introduce the Contrastive Alignment Module to align the visual features and textual information effectively. Additionally, the Cross-Modality Enhancement Module can retrieve similar reports for the input images to boost diagnostic accuracy. We conduct extensive experiments on two popular benchmark datasets, including IU X-Ray and MIMIC-CXR. The results demonstrate that our proposed method achieves promising performance compared with state-of-the-art frameworks.

原文English
主出版物標題Medical Image Computing and Computer Assisted Intervention, MICCAI 2025 - 28th International Conference, Proceedings
編輯James C. Gee, Jaesung Hong, Carole H. Sudre, Polina Golland, Daniel C. Alexander, Juan Eugenio Iglesias, Archana Venkataraman, Jong Hyo Kim
發行者Springer Science and Business Media Deutschland GmbH
頁面111-120
頁數10
ISBN(列印)9783032049773
DOIs
出版狀態Published - 2026
事件28th International Conference on Medical Image Computing and Computer Assisted Intervention, MICCAI 2025 - Daejeon, Korea, Republic of
持續時間: 23 9月 202527 9月 2025

出版系列

名字Lecture Notes in Computer Science
15965 LNCS
ISSN(列印)0302-9743
ISSN(電子)1611-3349

Conference

Conference28th International Conference on Medical Image Computing and Computer Assisted Intervention, MICCAI 2025
國家/地區Korea, Republic of
城市Daejeon
期間23/09/2527/09/25

指紋

深入研究「Contrastive Knowledge-Guided Large Language Models for Medical Report Generation」主題。共同形成了獨特的指紋。

引用此