跳至主導覽 跳至搜尋 跳過主要內容

ClinCoCoOp: An Interpretable Prompt Learning Framework with Clinical Concept Guidance for Context Optimization

研究成果: Conference contribution同行評審

摘要

Large Vision-Language Models (VLMs) demonstrate significant potential in representation learning and exhibit strong performance across diverse downstream tasks. Soft prompt learning has emerged as an effective technique for adapting VLMs like CLIP to image classification. However, prevailing prompt learning methods typically generate non-interpretable text tokens, failing to satisfy the stringent interpretability requirements of eXplainable AI (XAI) in high-risk domains such as healthcare. To address this limitation, we introduce a novel interpretable prompt learning framework. Our approach enhances interpretability by incorporating clinical concepts and aligns image semantics with learnable prompts at multiple granularities. Departing from existing methods that apply uniform clinical concept weights across all prompts, we propose two key modules: (1) a Soft-Prompt Clinical Concept Alignment module, which computes image-concept similarity scores to weight clinical concepts before aligning them with the soft prompt (a set of learnable vectors), and (2) a Global-Local Image Soft-Prompt Alignment module, which processes local image regions by incorporating positional encodings and calculating significance weights, complementing the global alignment. Extensive experiments on three medical image datasets (Derm7pt, ISIC2018, Pneumonia) demonstrate the superior classification performance of our method name Clinical Concept CoOp(ClinCoCoOp). Notably, ClinCoCoOp also achieves outstanding zero-shot transfer results on the MED-NODE and ISIC2019 datasets.

原文English
主出版物標題Pattern Recognition and Computer Vision - 8th Chinese Conference, PRCV 2025, Proceedings
編輯Josef Kittler, Hongkai Xiong, Jian Yang, Xilin Chen, Jiwen Lu, Weiyao Lin, Jingyi Yu, Weishi Zheng
發行者Springer Science and Business Media Deutschland GmbH
頁面106-119
頁數14
ISBN(列印)9789819556786
DOIs
出版狀態Published - 2026
事件8th Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2025 - Shanghai, China
持續時間: 15 10月 202518 10月 2025

出版系列

名字Lecture Notes in Computer Science
16277 LNCS
ISSN(列印)0302-9743
ISSN(電子)1611-3349

Conference

Conference8th Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2025
國家/地區China
城市Shanghai
期間15/10/2518/10/25

指紋

深入研究「ClinCoCoOp: An Interpretable Prompt Learning Framework with Clinical Concept Guidance for Context Optimization」主題。共同形成了獨特的指紋。

引用此