跳至主導覽 跳至搜尋 跳過主要內容

BioVFM-21M: Benchmarking and Scaling Self-supervised Vision Foundation Models for Biomedical Image Analysis

  • Jiarun Liu
  • , Hong Yu Zhou
  • , Weijian Huang
  • , Hao Yang
  • , Dongning Song
  • , Tao Tan
  • , Yong Liang
  • , Shanshan Wang

研究成果: Conference contribution同行評審

摘要

Scaling up model and data size have demonstrated impressive improvement over a wide range of tasks. Despite extensive studies on scaling behaviors for general-purpose tasks, medical images exhibit substantial differences from natural data. It remains unclear the key factors in developing medical vision foundation models at scale. In this paper, we explored the scaling behavior across model sizes, training algorithms, data sizes, and imaging modalities in developing scalable medical vision foundation models by self-supervised learning. To support scalable pretraining, we introduce BioVFM-21M, a large-scale biomedical image dataset encompassing a wide range of biomedical image modalities and anatomies. We observed that scaling up does provide benefits but varies across tasks. Additional analysis reveals several factors correlated with scaling benefits. Finally, we propose BioVFM, a large-scale medical vision foundation model pretrained on 21 million biomedical images, which outperforms the previous state-of-the-art foundation models across 12 medical benchmarks. Our results highlight that while scaling up is beneficial for pursuing better performance, task characteristics, data diversity, pretraining methods, and computational efficiency remain critical considerations for developing scalable medical foundation models. We will open the dataset, model, and algorithms of this study at GitHub.

原文English
主出版物標題Foundation Models for General Medical AI - 3rd International Workshop, MedAGI 2025, Held in Conjunction with MICCAI 2025, Proceedings
編輯Won-Ki Jeong, Hyunwoo J. Kim, Zhongying Deng, Yiqing Shen, Angelica I Aviles-Rivero, Shaoting Zhang
發行者Springer Science and Business Media Deutschland GmbH
頁面23-33
頁數11
ISBN(列印)9783032078445
DOIs
出版狀態Published - 2026
事件3rd International Workshop on Foundation Models for Medical Artificial General Intelligence, MedAGI 2025, Held in Conjunction with the 28th International conference on Medical Image Computing and Computer Assisted Intervention, MICCAI 2025 - Daejeon, Korea, Republic of
持續時間: 27 9月 202527 9月 2025

出版系列

名字Lecture Notes in Computer Science
16112 LNCS
ISSN(列印)0302-9743
ISSN(電子)1611-3349

Conference

Conference3rd International Workshop on Foundation Models for Medical Artificial General Intelligence, MedAGI 2025, Held in Conjunction with the 28th International conference on Medical Image Computing and Computer Assisted Intervention, MICCAI 2025
國家/地區Korea, Republic of
城市Daejeon
期間27/09/2527/09/25

指紋

深入研究「BioVFM-21M: Benchmarking and Scaling Self-supervised Vision Foundation Models for Biomedical Image Analysis」主題。共同形成了獨特的指紋。

引用此