A review on the attention mechanism of deep learning

Z Niu, G Zhong, H Yu - Neurocomputing, 2021 - Elsevier
Attention has arguably become one of the most important concepts in the deep learning
field. It is inspired by the biological systems of humans that tend to focus on the distinctive …

Post-hoc interpretability for neural nlp: A survey

A Madsen, S Reddy, S Chandar - ACM Computing Surveys, 2022 - dl.acm.org
Neural networks for NLP are becoming increasingly complex and widespread, and there is a
growing concern if these models are responsible to use. Explaining models helps to address …

Prottrans: Toward understanding the language of life through self-supervised learning

A Elnaggar, M Heinzinger, C Dallago… - IEEE transactions on …, 2021 - ieeexplore.ieee.org
Computational biology and bioinformatics provide vast data gold-mines from protein
sequences, ideal for Language Models (LMs) taken from Natural Language Processing …

Quantifying attention flow in transformers

S Abnar, W Zuidema - arXiv preprint arXiv:2005.00928, 2020 - arxiv.org
In the Transformer model," self-attention" combines information from attended embeddings
into the representation of the focal embedding in the next layer. Thus, across layers of the …

Ai alignment: A comprehensive survey

J Ji, T Qiu, B Chen, B Zhang, H Lou, K Wang… - arXiv preprint arXiv …, 2023 - arxiv.org
AI alignment aims to make AI systems behave in line with human intentions and values. As
AI systems grow more capable, the potential large-scale risks associated with misaligned AI …

Towards faithfully interpretable NLP systems: How should we define and evaluate faithfulness?

A Jacovi, Y Goldberg - arXiv preprint arXiv:2004.03685, 2020 - arxiv.org
With the growing popularity of deep-learning based NLP models, comes a need for
interpretable systems. But what is interpretability, and what constitutes a high-quality …

Toward transparent ai: A survey on interpreting the inner structures of deep neural networks

T Räuker, A Ho, S Casper… - 2023 ieee conference …, 2023 - ieeexplore.ieee.org
The last decade of machine learning has seen drastic increases in scale and capabilities.
Deep neural networks (DNNs) are increasingly being deployed in the real world. However …

Explainable deep learning: A field guide for the uninitiated

G Ras, N Xie, M Van Gerven, D Doran - Journal of Artificial Intelligence …, 2022 - jair.org
Deep neural networks (DNNs) are an indispensable machine learning tool despite the
difficulty of diagnosing what aspects of a model's input drive its decisions. In countless real …

ERASER: A benchmark to evaluate rationalized NLP models

J DeYoung, S Jain, NF Rajani, E Lehman… - arXiv preprint arXiv …, 2019 - arxiv.org
State-of-the-art models in NLP are now predominantly based on deep neural networks that
are opaque in terms of how they come to make predictions. This limitation has increased …

Attention mechanism in intelligent fault diagnosis of machinery: A review of technique and application

H Lv, J Chen, T Pan, T Zhang, Y Feng, S Liu - Measurement, 2022 - Elsevier
Attention Mechanism has become very popular in the field of mechanical fault diagnosis in
recent years and has become an important technique for scholars to study and apply. The …