查看文章

acm.org 中的 [PDF]

What Do You See? Evaluation of Explainable Artificial Intelligence (XAI) Interpretability through Neural Backdoors

作者

Yi-Shan Lin, Wen-Chuan Lee, Z Berkay Celik

发表日期

2021/6/22

期刊

ACM SIGKDD Conference on Knowledge Discovery & Data Mining (KDD)

简介

EXplainable AI (XAI) methods have been proposed to interpret how a deep neural network predicts inputs through model saliency explanations that highlight the input parts deemed important to arrive at a decision for a specific target. However, it remains challenging to quantify the correctness of their interpretability as current evaluation approaches either require subjective input from humans or incur high computation cost with automated evaluation. In this paper, we propose backdoor trigger patterns--hidden malicious functionalities that cause misclassification--to automate the evaluation of saliency explanations. Our key observation is that triggers provide ground truth for inputs to evaluate whether the regions identified by an XAI method are truly relevant to its output. Since backdoor triggers are the most important features that cause deliberate misclassification, a robust XAI method should reveal their presence at …

引用总数

被引用次数：88

202020212022202320242 12 19 33 22

学术搜索中的文章

What do you see? Evaluation of explainable artificial intelligence (XAI) interpretability through neural backdoors

YS Lin, WC Lee, ZB Celik - Proceedings of the 27th ACM SIGKDD conference on …, 2021

被引用次数：86 相关文章所有 6 个版本

What Do You See? Evaluation of Explainable Artificial Intelligence (XAI) Interpretability through Neural Backdoors, arxiv*

YS Lin, WC Lee, ZB Celik - org Preprint. http://arxiv. org/pdf/2009.10639 v1, 2020

YS Lin, WC Lee, ZB Celik - 2009

被引用次数：2 相关文章