Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI

AB Arrieta, N Díaz-Rodríguez, J Del Ser, A Bennetot… - Information fusion, 2020 - Elsevier
In the last few years, Artificial Intelligence (AI) has achieved a notable momentum that, if
harnessed appropriately, may deliver the best of expectations over many application sectors …

[HTML][HTML] Analyzing biological and artificial neural networks: challenges with opportunities for synergy?

DGT Barrett, AS Morcos, JH Macke - Current opinion in neurobiology, 2019 - Elsevier
Highlights•Artificial and biological neural networks can be analyzed using similar
methods.•Neural analysis has revealed similarities between the representations in artificial …

React: Out-of-distribution detection with rectified activations

Y Sun, C Guo, Y Li - Advances in Neural Information …, 2021 - proceedings.neurips.cc
Abstract Out-of-distribution (OOD) detection has received much attention lately due to its
practical importance in enhancing the safe deployment of neural networks. One of the …

Ties-merging: Resolving interference when merging models

P Yadav, D Tam, L Choshen… - Advances in Neural …, 2024 - proceedings.neurips.cc
Transfer learning–ie, further fine-tuning a pre-trained model on a downstream task–can
confer significant advantages, including improved downstream performance, faster …

A toy model of universality: Reverse engineering how networks learn group operations

B Chughtai, L Chan, N Nanda - International Conference on …, 2023 - proceedings.mlr.press
Universality is a key hypothesis in mechanistic interpretability–that different models learn
similar features and circuits when trained on similar tasks. In this work, we study the …

Toward transparent ai: A survey on interpreting the inner structures of deep neural networks

T Räuker, A Ho, S Casper… - 2023 ieee conference …, 2023 - ieeexplore.ieee.org
The last decade of machine learning has seen drastic increases in scale and capabilities.
Deep neural networks (DNNs) are increasingly being deployed in the real world. However …

Rapid learning or feature reuse? towards understanding the effectiveness of maml

A Raghu, M Raghu, S Bengio, O Vinyals - arXiv preprint arXiv:1909.09157, 2019 - arxiv.org
An important research direction in machine learning has centered around developing meta-
learning algorithms to tackle few-shot learning. An especially successful algorithm has been …

Similarity of neural network representations revisited

S Kornblith, M Norouzi, H Lee… - … conference on machine …, 2019 - proceedings.mlr.press
Recent work has sought to understand the behavior of neural networks by comparing
representations between layers and between different trained models. We examine methods …

The quantization model of neural scaling

E Michaud, Z Liu, U Girit… - Advances in Neural …, 2024 - proceedings.neurips.cc
Abstract We propose the Quantization Model of neural scaling laws, explaining both the
observed power law dropoff of loss with model and data size, and also the sudden …

Explainability methods for graph convolutional neural networks

PE Pope, S Kolouri, M Rostami… - Proceedings of the …, 2019 - openaccess.thecvf.com
With the growing use of graph convolutional neural networks (GCNNs) comes the need for
explainability. In this paper, we introduce explainability methods for GCNNs. We develop the …