Convergent learning: Do different neural networks learn the same representations?

Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI

AB Arrieta, N Díaz-Rodríguez, J Del Ser, A Bennetot… - Information fusion, 2020 - Elsevier

In the last few years, Artificial Intelligence (AI) has achieved a notable momentum that, if
harnessed appropriately, may deliver the best of expectations over many application sectors …

被引用次数：7220 相关文章所有 21 个版本

[HTML] sciencedirect.com

[HTML][HTML] Analyzing biological and artificial neural networks: challenges with opportunities for synergy?

DGT Barrett, AS Morcos, JH Macke - Current opinion in neurobiology, 2019 - Elsevier

Highlights•Artificial and biological neural networks can be analyzed using similar
methods.•Neural analysis has revealed similarities between the representations in artificial …

被引用次数：127 相关文章所有 13 个版本

[PDF] neurips.cc

React: Out-of-distribution detection with rectified activations

Y Sun, C Guo, Y Li - Advances in Neural Information …, 2021 - proceedings.neurips.cc

Abstract Out-of-distribution (OOD) detection has received much attention lately due to its
practical importance in enhancing the safe deployment of neural networks. One of the …

被引用次数：353 相关文章所有 8 个版本

[PDF] neurips.cc

Ties-merging: Resolving interference when merging models

P Yadav, D Tam, L Choshen… - Advances in Neural …, 2024 - proceedings.neurips.cc

Transfer learning–ie, further fine-tuning a pre-trained model on a downstream task–can
confer significant advantages, including improved downstream performance, faster …

被引用次数：71 相关文章所有 7 个版本

[PDF] mlr.press

A toy model of universality: Reverse engineering how networks learn group operations

B Chughtai, L Chan, N Nanda - International Conference on …, 2023 - proceedings.mlr.press

Universality is a key hypothesis in mechanistic interpretability–that different models learn
similar features and circuits when trained on similar tasks. In this work, we study the …

被引用次数：57 相关文章所有 6 个版本

[PDF] arxiv.org

Toward transparent ai: A survey on interpreting the inner structures of deep neural networks

T Räuker, A Ho, S Casper… - 2023 ieee conference …, 2023 - ieeexplore.ieee.org

The last decade of machine learning has seen drastic increases in scale and capabilities.
Deep neural networks (DNNs) are increasingly being deployed in the real world. However …

被引用次数：122 相关文章所有 5 个版本

[PDF] arxiv.org

Rapid learning or feature reuse? towards understanding the effectiveness of maml

A Raghu, M Raghu, S Bengio, O Vinyals - arXiv preprint arXiv:1909.09157, 2019 - arxiv.org

An important research direction in machine learning has centered around developing meta-
learning algorithms to tackle few-shot learning. An especially successful algorithm has been …

被引用次数：691 相关文章所有 11 个版本

[PDF] mlr.press

Similarity of neural network representations revisited

S Kornblith, M Norouzi, H Lee… - … conference on machine …, 2019 - proceedings.mlr.press

Recent work has sought to understand the behavior of neural networks by comparing
representations between layers and between different trained models. We examine methods …

被引用次数：1204 相关文章所有 14 个版本

[PDF] neurips.cc

The quantization model of neural scaling

E Michaud, Z Liu, U Girit… - Advances in Neural …, 2024 - proceedings.neurips.cc

Abstract We propose the Quantization Model of neural scaling laws, explaining both the
observed power law dropoff of loss with model and data size, and also the sudden …

被引用次数：37 相关文章所有 6 个版本

[PDF] thecvf.com

Explainability methods for graph convolutional neural networks

PE Pope, S Kolouri, M Rostami… - Proceedings of the …, 2019 - openaccess.thecvf.com

With the growing use of graph convolutional neural networks (GCNNs) comes the need for
explainability. In this paper, we introduce explainability methods for GCNNs. We develop the …

被引用次数：537 相关文章所有 9 个版本