跳至主導覽 跳至搜尋 跳過主要內容

Region-Based Text-Consistent Augmentation for Multimodal Medical Segmentation

  • Kunyan Cai
  • , Chenggang Yan
  • , Min He
  • , Liangqiong Qu
  • , Shuai Wang
  • , Tao Tan

研究成果: Conference contribution同行評審

摘要

Medical image segmentation is crucial for various clinical applications, and deep learning has significantly advanced this field. To further enhance performance, recent research explores multimodal data integration, combining medical images and textual reports. However, a critical challenge lies in image data augmentation for multimodal medical data, specifically in maintaining text-image consistency. Traditional augmentation techniques, designed for unimodal images, can introduce mismatches between augmented images and text, hindering effective multimodal learning. To address this, we introduce Region-Based Text-Consistent Augmentation (RBTCA), a novel framework for coherent multimodal augmentation. Our approach performs region-based image augmentation by first identifying image regions described in associated text reports and then extracting textual cues grounded in these regions. These cues are integrated into the image, and augmentation is subsequently performed on this modality-aware representation, ensuring inherent text-cue consistency. Notably, the RBTCA’s plug-and-play design allows for straightforward integration into existing medical image analysis pipelines, enhancing its practical utility. We demonstrate the efficacy of our framework on the QaTa-Covid19 and our in-house Lung Tumor CT Segmentation (LTCT) datasets, achieving substantial gains, with a Dice coefficient improvement of up to 7.24% when integrated into baseline segmentation models. Our code will be released on https://github.com/KunyanCAI/RBTCA.

原文English
主出版物標題Medical Image Computing and Computer Assisted Intervention , MICCAI 2025 - 28th International Conference, 2025, Proceedings
編輯James C. Gee, Jaesung Hong, Carole H. Sudre, Polina Golland, Daniel C. Alexander, Juan Eugenio Iglesias, Archana Venkataraman, Jong Hyo Kim
發行者Springer Science and Business Media Deutschland GmbH
頁面533-543
頁數11
ISBN(列印)9783032049469
DOIs
出版狀態Published - 2026
事件28th International Conference on Medical Image Computing and Computer Assisted Intervention, MICCAI 2025 - Daejeon, Korea, Republic of
持續時間: 23 9月 202527 9月 2025

出版系列

名字Lecture Notes in Computer Science
15962 LNCS
ISSN(列印)0302-9743
ISSN(電子)1611-3349

Conference

Conference28th International Conference on Medical Image Computing and Computer Assisted Intervention, MICCAI 2025
國家/地區Korea, Republic of
城市Daejeon
期間23/09/2527/09/25

指紋

深入研究「Region-Based Text-Consistent Augmentation for Multimodal Medical Segmentation」主題。共同形成了獨特的指紋。

引用此