Skip to main navigation Skip to search Skip to main content

NLUS-VQA-VG: Enhancing Interpretability in Domain-Specific Med-VQA for Neonatal Lung Ultrasound Through Visual Grounding

  • Xuming Tong
  • , Yiran Wang
  • , Liyan Liu
  • , Ziyi Wang
  • , Yang Liu
  • , Xiaoyan Li
  • , Hongwei Gui
  • , Xiaoqian Cui
  • , Yingjin Zhao
  • , Lele Tian
  • , Sio Kei Im
  • , Yapeng Wang
  • , Jiangang Chen
  • Macao Polytechnic University
  • Hebei North University
  • East China Normal University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Neonatal lung diseases present significant diagnostic challenges during the perinatal period. Lung ultrasound, a safe and bedside imaging technique, has garnered increasing attention, yet its interpretation remains dependent on the operator's expertise. We introduce NLUS-VQA-VG, a domain-specific Med-VQA model designed for neonatal lung ultrasound, which enhances diagnostic accuracy and interpretability through visual grounding methods. A dedicated image-text dataset was developed, incorporating color-coded bounding boxes for visual grounding annotations, and a three-stage reasoning framework was implemented to ensure precise image-text alignment. Built on the Qwen2.5-VL-7B foundation, the model is optimized via efficient parameter fine-tuning using LoRA. Qualitative and quantitative evaluations demonstrate that NLUS-VQA-VG outperforms baseline models in sign recognition, localization accuracy, and reduction of hallucinations. This study addresses a critical gap in the Med-VQA domain for neonatal lung ultrasound, aiding clinical decision-making and advancing precision medicine in neonatology.

Original languageEnglish
Title of host publication2025 11th International Conference on Computer and Communications, ICCC 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1261-1265
Number of pages5
ISBN (Electronic)9798331545581
DOIs
Publication statusPublished - 2025
Event2025 11th International Conference on Computer and Communications, ICCC 2025 - Chengdu, China
Duration: 12 Dec 202515 Dec 2025

Conference

Conference2025 11th International Conference on Computer and Communications, ICCC 2025
Country/TerritoryChina
CityChengdu
Period12/12/2515/12/25

Keywords

  • Clinical Decision-Making
  • Med-VQA
  • Neonatal Lung Ultrasound
  • Precision Medicine
  • Visual Grounding

Fingerprint

Dive into the research topics of 'NLUS-VQA-VG: Enhancing Interpretability in Domain-Specific Med-VQA for Neonatal Lung Ultrasound Through Visual Grounding'. Together they form a unique fingerprint.

Cite this