Skip to main navigation Skip to search Skip to main content

Adaptive multi-teacher knowledge distillation framework with foundation models for medical image analysis

  • Dudu Liu
  • , Yuan Gao
  • , Ningyi Zhang
  • , Xin Wang
  • , Tianyu Zhang
  • , Ming Fan
  • , Yue Sun
  • , Shuo Li
  • , Tao Tan
  • Macao Polytechnic University
  • Netherlands Cancer Institute
  • Radboud University Nijmegen
  • Hangzhou Dianzi University
  • Case Western Reserve University

Research output: Contribution to journalArticlepeer-review

Abstract

Foundation models (FMs) in medical imaging are increasingly specialized across vertical domains, yet their substantial computational demands and large parameter scales hinder deployment on resource-limited edge devices. Retraining these models for new tasks requires scarce high-quality data and significant computational resources, while cross-model knowledge transfer often introduces notable information loss. To address model coordination, capability migration, knowledge preservation, and practical edge deployment, we introduce MultiMedDistill, an adaptive multi-teacher distillation framework that integrates multiple heterogeneous FMs into a single lightweight student model. A dual-level gating mechanism enables dynamic teacher coordination, and a return decoder preserves semantic fidelity during feature projection. Across six benchmark datasets spanning ultrasound, endoscopy, fundus imaging, CT, and MRI, MultiMedDistill achieves 94.77% and 97.06% Dice on BUSI and Kvasir-SEG — improvements of 25.76% and 13.04% over baselines — while compressing the student model to 8.8M parameters (18× reduction). Ablation studies show that adaptive gating and reconstruction-based knowledge preservation contribute gains of 3.2% and 1.4%, respectively. These results demonstrate the framework’s effectiveness in transferring FM capabilities with minimal computational cost, enabling practical deployment on clinical edge devices.

Original languageEnglish
Article number102739
JournalComputerized Medical Imaging and Graphics
Volume130
DOIs
Publication statusPublished - Apr 2026

Keywords

  • Adaptive gating mechanism
  • Deep learning
  • Foundation models
  • Medical image analysis
  • Model compression
  • Multi-teacher knowledge distillation

Fingerprint

Dive into the research topics of 'Adaptive multi-teacher knowledge distillation framework with foundation models for medical image analysis'. Together they form a unique fingerprint.

Cite this