跳至主導覽 跳至搜尋 跳過主要內容

NLUS-VQA-VG: Enhancing Interpretability in Domain-Specific Med-VQA for Neonatal Lung Ultrasound Through Visual Grounding

  • Xuming Tong
  • , Yiran Wang
  • , Liyan Liu
  • , Ziyi Wang
  • , Yang Liu
  • , Xiaoyan Li
  • , Hongwei Gui
  • , Xiaoqian Cui
  • , Yingjin Zhao
  • , Lele Tian
  • , Sio Kei Im
  • , Yapeng Wang
  • , Jiangang Chen
  • Macao Polytechnic University
  • Hebei North University
  • East China Normal University

研究成果: Conference contribution同行評審

摘要

Neonatal lung diseases present significant diagnostic challenges during the perinatal period. Lung ultrasound, a safe and bedside imaging technique, has garnered increasing attention, yet its interpretation remains dependent on the operator's expertise. We introduce NLUS-VQA-VG, a domain-specific Med-VQA model designed for neonatal lung ultrasound, which enhances diagnostic accuracy and interpretability through visual grounding methods. A dedicated image-text dataset was developed, incorporating color-coded bounding boxes for visual grounding annotations, and a three-stage reasoning framework was implemented to ensure precise image-text alignment. Built on the Qwen2.5-VL-7B foundation, the model is optimized via efficient parameter fine-tuning using LoRA. Qualitative and quantitative evaluations demonstrate that NLUS-VQA-VG outperforms baseline models in sign recognition, localization accuracy, and reduction of hallucinations. This study addresses a critical gap in the Med-VQA domain for neonatal lung ultrasound, aiding clinical decision-making and advancing precision medicine in neonatology.

原文English
主出版物標題2025 11th International Conference on Computer and Communications, ICCC 2025
發行者Institute of Electrical and Electronics Engineers Inc.
頁面1261-1265
頁數5
ISBN(電子)9798331545581
DOIs
出版狀態Published - 2025
事件2025 11th International Conference on Computer and Communications, ICCC 2025 - Chengdu, China
持續時間: 12 12月 202515 12月 2025

Conference

Conference2025 11th International Conference on Computer and Communications, ICCC 2025
國家/地區China
城市Chengdu
期間12/12/2515/12/25

指紋

深入研究「NLUS-VQA-VG: Enhancing Interpretability in Domain-Specific Med-VQA for Neonatal Lung Ultrasound Through Visual Grounding」主題。共同形成了獨特的指紋。

引用此