跳至主導覽 跳至搜尋 跳過主要內容

FaceShield: Explainable Face Anti-Spoofing with Multimodal Large Language Models

  • Hongyang Wang
  • , Yichen Shi
  • , Zhuofu Tao
  • , Yuhao Gao
  • , Liepiao Zhang
  • , Xun Lin
  • , Jun Feng
  • , Xiaochen Yuan
  • , Zitong Yu
  • , Xiaochun Cao
  • Shijiazhuang Tiedao University
  • Shijiazhuang Key Laboratory of Artificial Intelligence
  • Shanghai Jiao Tong University
  • Eastern Institute of Technology
  • University of California at Los Angeles
  • GRGBanking
  • Great Bay University
  • Shenzhen University
  • Dongguan Key Laboratory for Intelligence and Information Technology
  • Sun Yat-Sen University

研究成果: Conference article同行評審

摘要

Face anti-spoofing (FAS) is crucial for protecting facial recognition systems from presentation attacks. Previous methods approached this task as a classification problem, lacking interpretability and reasoning behind the predicted results. Recently, multimodal large language models (MLLMs) have shown strong capabilities in perception, reasoning, and decision-making in visual tasks. However, there is currently no universal and comprehensive MLLM and dataset specifically designed for FAS task. To address this gap, we propose FaceShield, a MLLM for FAS, along with the corresponding pre-training and supervised fine-tuning (SFT) datasets, FaceShield-pre10K and FaceShield-sft45K. FaceShield is capable of determining the authenticity of faces, identifying types of spoofing attacks, providing reasoning for its judgments, and detecting attack areas. Specifically, we employ spoof-aware vision perception (SAVP) that incorporates both the original image and auxiliary information based on prior knowledge. We then use an prompt-guided vision token masking (PVTM) strategy to random mask vision tokens, thereby improving the model’s generalization ability. We conducted extensive experiments on three benchmark datasets, demonstrating that FaceShield significantly outperforms previous deep learning models and general MLLMs on four FAS tasks, i.e., coarse-grained classification, fine-grained classification, reasoning, and attack localization.

原文English
頁(從 - 到)9811-9819
頁數9
期刊Proceedings of the AAAI Conference on Artificial Intelligence
40
發行號12
DOIs
出版狀態Published - 2026
事件40th AAAI Conference on Artificial Intelligence, AAAI 2026 - Singapore, Singapore
持續時間: 20 1月 202627 1月 2026

指紋

深入研究「FaceShield: Explainable Face Anti-Spoofing with Multimodal Large Language Models」主題。共同形成了獨特的指紋。

引用此