跳至主導覽 跳至搜尋 跳過主要內容

RST-UNet: Medical Image Segmentation Transformer Effectively Combining Superpixel

  • Songhang Ye
  • , Zhoule Feng
  • , Guoheng Huang
  • , Jinghong Ke
  • , Xuhang Chen
  • , Chi Man Pun
  • , Guo Zhong
  • , Xiaochen Yuan

研究成果: Conference contribution同行評審

摘要

Medical image segmentation has advanced with models like UCTransNet, TransUNet, and TransClaw U-Net, which integrate Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs). However, these models face limitations due to the locality of convolutions and the computational demands of Transformers. To overcome these challenges, we introduce RST-UNet, an innovative encoder-decoder network that balances effectiveness with computational efficiency. RST-UNet features two groundbreaking innovations: the Compact Representation Block (CRB) and the Compact Dependency Modeling Block (CDMB). The CRB utilizes superpixel pooling to capture long-range dependencies while minimizing parameters and computation time. The CDMB integrates superpixel unpooling with attention mechanisms and Rotary Position Embedding (RoPE) to enhance long-range dependency modeling. This approach emphasizes critical regions and leverages RoPE to capture extensive image dependencies effectively. Our experimental results on publicly available synapse datasets highlight RST-UNet’s exceptional performance, particularly in segmenting small organs such as the gallbladder, right kidney, and pancreas. Remarkably, RST-UNet achieves superior results without pre-training, showcasing its high adaptability for diverse medical image segmentation tasks. This work represents a significant advancement in developing efficient and effective algorithms for medical image analysis.

原文English
主出版物標題Neural Information Processing - 31st International Conference, ICONIP 2024, Proceedings
編輯Mufti Mahmud, Maryam Doborjeh, Kevin Wong, Andrew Chi Sing Leung, Zohreh Doborjeh, M. Tanveer
發行者Springer Science and Business Media Deutschland GmbH
頁面336-350
頁數15
ISBN(列印)9789819669684
DOIs
出版狀態Published - 2025
事件31st International Conference on Neural Information Processing, ICONIP 2024 - Auckland, New Zealand
持續時間: 2 12月 20246 12月 2024

出版系列

名字Communications in Computer and Information Science
2289 CCIS
ISSN(列印)1865-0929
ISSN(電子)1865-0937

Conference

Conference31st International Conference on Neural Information Processing, ICONIP 2024
國家/地區New Zealand
城市Auckland
期間2/12/246/12/24

指紋

深入研究「RST-UNet: Medical Image Segmentation Transformer Effectively Combining Superpixel」主題。共同形成了獨特的指紋。

引用此