A survey of safety and trustworthiness of deep neural networks: Verification, testing, adversarial attack and defence, and interpretability
In the past few years, significant progress has been made on deep neural networks (DNNs)
in achieving human-level performance on several long-standing tasks. With the broader …
in achieving human-level performance on several long-standing tasks. With the broader …
Linguistically inspired roadmap for building biologically reliable protein language models
Deep neural-network-based language models (LMs) are increasingly applied to large-scale
protein sequence data to predict protein function. However, being largely black-box models …
protein sequence data to predict protein function. However, being largely black-box models …
Compositionality decomposed: How do neural networks generalise?
Despite a multitude of empirical studies, little consensus exists on whether neural networks
are able to generalise compositionally, a controversy that, in part, stems from a lack of …
are able to generalise compositionally, a controversy that, in part, stems from a lack of …
The logical expressiveness of graph neural networks
The ability of graph neural networks (GNNs) for distinguishing nodes in graphs has been
recently characterized in terms of the Weisfeiler-Lehman (WL) test for checking graph …
recently characterized in terms of the Weisfeiler-Lehman (WL) test for checking graph …
Learning transformer programs
Recent research in mechanistic interpretability has attempted to reverse-engineer
Transformer models by carefully inspecting network weights and activations. However, these …
Transformer models by carefully inspecting network weights and activations. However, these …
Deepstellar: Model-based quantitative analysis of stateful deep learning systems
Deep Learning (DL) has achieved tremendous success in many cutting-edge applications.
However, the state-of-the-art DL systems still suffer from quality issues. While some recent …
However, the state-of-the-art DL systems still suffer from quality issues. While some recent …
Neuro-symbolic language modeling with automaton-augmented retrieval
Retrieval-based language models (R-LM) model the probability of natural language text by
combining a standard language model (LM) with examples retrieved from an external …
combining a standard language model (LM) with examples retrieved from an external …
Thinking like transformers
What is the computational model behind a Transformer? Where recurrent neural networks
have direct parallels in finite state machines, allowing clear discussion and thought around …
have direct parallels in finite state machines, allowing clear discussion and thought around …
Sok: Explainable machine learning for computer security applications
Explainable Artificial Intelligence (XAI) aims to improve the transparency of machine
learning (ML) pipelines. We systematize the increasingly growing (but fragmented) …
learning (ML) pipelines. We systematize the increasingly growing (but fragmented) …
Interpretability illusions in the generalization of simplified models
A common method to study deep learning systems is to use simplified model
representations—for example, using singular value decomposition to visualize the model's …
representations—for example, using singular value decomposition to visualize the model's …