A survey of safety and trustworthiness of deep neural networks: Verification, testing, adversarial attack and defence, and interpretability

X Huang, D Kroening, W Ruan, J Sharp, Y Sun… - Computer Science …, 2020 - Elsevier
In the past few years, significant progress has been made on deep neural networks (DNNs)
in achieving human-level performance on several long-standing tasks. With the broader …

Linguistically inspired roadmap for building biologically reliable protein language models

MH Vu, R Akbar, PA Robert, B Swiatczak… - Nature Machine …, 2023 - nature.com
Deep neural-network-based language models (LMs) are increasingly applied to large-scale
protein sequence data to predict protein function. However, being largely black-box models …

Compositionality decomposed: How do neural networks generalise?

D Hupkes, V Dankers, M Mul, E Bruni - Journal of Artificial Intelligence …, 2020 - jair.org
Despite a multitude of empirical studies, little consensus exists on whether neural networks
are able to generalise compositionally, a controversy that, in part, stems from a lack of …

The logical expressiveness of graph neural networks

P Barceló, EV Kostylev, M Monet, J Pérez… - 8th International …, 2020 - hal.science
The ability of graph neural networks (GNNs) for distinguishing nodes in graphs has been
recently characterized in terms of the Weisfeiler-Lehman (WL) test for checking graph …

Learning transformer programs

D Friedman, A Wettig, D Chen - Advances in Neural …, 2024 - proceedings.neurips.cc
Recent research in mechanistic interpretability has attempted to reverse-engineer
Transformer models by carefully inspecting network weights and activations. However, these …

Deepstellar: Model-based quantitative analysis of stateful deep learning systems

X Du, X Xie, Y Li, L Ma, Y Liu, J Zhao - … of the 2019 27th ACM Joint …, 2019 - dl.acm.org
Deep Learning (DL) has achieved tremendous success in many cutting-edge applications.
However, the state-of-the-art DL systems still suffer from quality issues. While some recent …

Neuro-symbolic language modeling with automaton-augmented retrieval

U Alon, F Xu, J He, S Sengupta… - International …, 2022 - proceedings.mlr.press
Retrieval-based language models (R-LM) model the probability of natural language text by
combining a standard language model (LM) with examples retrieved from an external …

Thinking like transformers

G Weiss, Y Goldberg, E Yahav - International Conference on …, 2021 - proceedings.mlr.press
What is the computational model behind a Transformer? Where recurrent neural networks
have direct parallels in finite state machines, allowing clear discussion and thought around …

Sok: Explainable machine learning for computer security applications

A Nadeem, D Vos, C Cao, L Pajola… - 2023 IEEE 8th …, 2023 - ieeexplore.ieee.org
Explainable Artificial Intelligence (XAI) aims to improve the transparency of machine
learning (ML) pipelines. We systematize the increasingly growing (but fragmented) …

Interpretability illusions in the generalization of simplified models

D Friedman, AK Lampinen, L Dixon… - … on Machine Learning, 2023 - openreview.net
A common method to study deep learning systems is to use simplified model
representations—for example, using singular value decomposition to visualize the model's …