Ifme: Influence function based model explanation for black box decision systems

Benbo Zha, Hong Shen

研究成果: Conference article同行評審

摘要

Due to the high precision and the huge prediction needs, machine learning models based decision systems has been widely adopted in all works of life. They were usually constructed as a black box based on sophisticated, opaque learning models. The lack of human understandable explanation to the inner logic of these models and the reason behind the predictions from such systems causes a serious trust issue. Interpretable Machine Learning methods can be used to relieve this problem by providing an explanation for the models or predictions. In this work, we focus on the model explanation problem, which study how to explain the black box prediction model globally through human understandable explanation. We propose the Influence Function based Model Explanation (IFME) method to provide interpretable model explanation based on key training points selected through influence function. First, our method introduces a novel local prediction interpreter, which also utilizes the key training points for local prediction. Then it finds the key training points to the learning models via influence function globally. Finally, we provide the influence function based model agnostic explanation to the model used. We also show the efficiency of our method through both theoretical analysis and simulated experiments.

原文English
頁(從 - 到)299-310
頁數12
期刊Communications in Computer and Information Science
1163
DOIs
出版狀態Published - 2020
對外發佈
事件10th International Symposium on Parallel Architectures, Algorithms and Programming, PAAP 2019 - Guangzhou, China
持續時間: 12 12月 201914 12月 2019

指紋

深入研究「Ifme: Influence function based model explanation for black box decision systems」主題。共同形成了獨特的指紋。

引用此