摘要
Neonatal lung diseases present significant diagnostic challenges during the perinatal period. Lung ultrasound, a safe and bedside imaging technique, has garnered increasing attention, yet its interpretation remains dependent on the operator's expertise. We introduce NLUS-VQA-VG, a domain-specific Med-VQA model designed for neonatal lung ultrasound, which enhances diagnostic accuracy and interpretability through visual grounding methods. A dedicated image-text dataset was developed, incorporating color-coded bounding boxes for visual grounding annotations, and a three-stage reasoning framework was implemented to ensure precise image-text alignment. Built on the Qwen2.5-VL-7B foundation, the model is optimized via efficient parameter fine-tuning using LoRA. Qualitative and quantitative evaluations demonstrate that NLUS-VQA-VG outperforms baseline models in sign recognition, localization accuracy, and reduction of hallucinations. This study addresses a critical gap in the Med-VQA domain for neonatal lung ultrasound, aiding clinical decision-making and advancing precision medicine in neonatology.
| 原文 | English |
|---|---|
| 主出版物標題 | 2025 11th International Conference on Computer and Communications, ICCC 2025 |
| 發行者 | Institute of Electrical and Electronics Engineers Inc. |
| 頁面 | 1261-1265 |
| 頁數 | 5 |
| ISBN(電子) | 9798331545581 |
| DOIs | |
| 出版狀態 | Published - 2025 |
| 事件 | 2025 11th International Conference on Computer and Communications, ICCC 2025 - Chengdu, China 持續時間: 12 12月 2025 → 15 12月 2025 |
Conference
| Conference | 2025 11th International Conference on Computer and Communications, ICCC 2025 |
|---|---|
| 國家/地區 | China |
| 城市 | Chengdu |
| 期間 | 12/12/25 → 15/12/25 |
指紋
深入研究「NLUS-VQA-VG: Enhancing Interpretability in Domain-Specific Med-VQA for Neonatal Lung Ultrasound Through Visual Grounding」主題。共同形成了獨特的指紋。引用此
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver