Explainable ai: A review of machine learning interpretability methods

P Linardatos, V Papastefanopoulos, S Kotsiantis - Entropy, 2020 - mdpi.com
Recent advances in artificial intelligence (AI) have led to its widespread industrial adoption,
with machine learning systems demonstrating superhuman performance in a significant …

Post-hoc interpretability for neural nlp: A survey

A Madsen, S Reddy, S Chandar - ACM Computing Surveys, 2022 - dl.acm.org
Neural networks for NLP are becoming increasingly complex and widespread, and there is a
growing concern if these models are responsible to use. Explaining models helps to address …

Complexity-based prompting for multi-step reasoning

Y Fu, H Peng, A Sabharwal, P Clark… - … Conference on Learning …, 2022 - openreview.net
We study the task of prompting large-scale language models to perform multi-step
reasoning. Existing work shows that when prompted with a chain of thoughts (CoT) …

Model cards for model reporting

M Mitchell, S Wu, A Zaldivar, P Barnes… - Proceedings of the …, 2019 - dl.acm.org
Trained machine learning models are increasingly used to perform high-impact tasks in
areas such as law enforcement, medicine, education, and employment. In order to clarify the …

Techniques for interpretable machine learning

M Du, N Liu, X Hu - Communications of the ACM, 2019 - dl.acm.org
Techniques for interpretable machine learning Page 1 68 COMMUNICATIONS OF THE
ACM | JANUARY 2020 | VOL. 63 | NO. 1 review articles MACHINE LEARNING IS …

Semantics-aware BERT for language understanding

Z Zhang, Y Wu, H Zhao, Z Li, S Zhang, X Zhou… - Proceedings of the …, 2020 - ojs.aaai.org
The latest work on language representations carefully integrates contextualized features into
language model training, which enables a series of success especially in various machine …

Analysis methods in neural language processing: A survey

Y Belinkov, J Glass - … of the Association for Computational Linguistics, 2019 - direct.mit.edu
The field of natural language processing has seen impressive progress in recent years, with
neural network models replacing many of the traditional systems. A plethora of new models …

Explaining the black-box model: A survey of local interpretation methods for deep neural networks

Y Liang, S Li, C Yan, M Li, C Jiang - Neurocomputing, 2021 - Elsevier
Recently, a significant amount of research has been investigated on interpretation of deep
neural networks (DNNs) which are normally processed as black box models. Among the …

SG-Net: Syntax-guided machine reading comprehension

Z Zhang, Y Wu, J Zhou, S Duan, H Zhao… - Proceedings of the AAAI …, 2020 - aaai.org
For machine reading comprehension, the capacity of effectively modeling the linguistic
knowledge from the detail-riddled and lengthy passages and getting ride of the noises is …

Counterfactual fairness in text classification through robustness

S Garg, V Perot, N Limtiaco, A Taly, EH Chi… - Proceedings of the 2019 …, 2019 - dl.acm.org
In this paper, we study counterfactual fairness in text classification, which asks the question:
How would the prediction change if the sensitive attribute referenced in the example were …