Extracting automata from recurrent neural networks using queries and counterexamples

A survey of safety and trustworthiness of deep neural networks: Verification, testing, adversarial attack and defence, and interpretability

X Huang, D Kroening, W Ruan, J Sharp, Y Sun… - Computer Science …, 2020 - Elsevier

In the past few years, significant progress has been made on deep neural networks (DNNs)
in achieving human-level performance on several long-standing tasks. With the broader …

被引用次数：531 相关文章所有 12 个版本

[PDF] arxiv.org

Linguistically inspired roadmap for building biologically reliable protein language models

MH Vu, R Akbar, PA Robert, B Swiatczak… - Nature Machine …, 2023 - nature.com

Deep neural-network-based language models (LMs) are increasingly applied to large-scale
protein sequence data to predict protein function. However, being largely black-box models …

被引用次数：29 相关文章所有 4 个版本

[PDF] jair.org

Compositionality decomposed: How do neural networks generalise?

D Hupkes, V Dankers, M Mul, E Bruni - Journal of Artificial Intelligence …, 2020 - jair.org

Despite a multitude of empirical studies, little consensus exists on whether neural networks
are able to generalise compositionally, a controversy that, in part, stems from a lack of …

被引用次数：314 相关文章所有 10 个版本

[PDF] hal.science

The logical expressiveness of graph neural networks

P Barceló, EV Kostylev, M Monet, J Pérez… - 8th International …, 2020 - hal.science

The ability of graph neural networks (GNNs) for distinguishing nodes in graphs has been
recently characterized in terms of the Weisfeiler-Lehman (WL) test for checking graph …

被引用次数：261 相关文章所有 9 个版本

[PDF] neurips.cc

Learning transformer programs

D Friedman, A Wettig, D Chen - Advances in Neural …, 2024 - proceedings.neurips.cc

Recent research in mechanistic interpretability has attempted to reverse-engineer
Transformer models by carefully inspecting network weights and activations. However, these …

被引用次数：23 相关文章所有 5 个版本

[PDF] smu.edu.sg

Deepstellar: Model-based quantitative analysis of stateful deep learning systems

X Du, X Xie, Y Li, L Ma, Y Liu, J Zhao - … of the 2019 27th ACM Joint …, 2019 - dl.acm.org

Deep Learning (DL) has achieved tremendous success in many cutting-edge applications.
However, the state-of-the-art DL systems still suffer from quality issues. While some recent …

被引用次数：175 相关文章所有 8 个版本

[PDF] mlr.press

Neuro-symbolic language modeling with automaton-augmented retrieval

U Alon, F Xu, J He, S Sengupta… - International …, 2022 - proceedings.mlr.press

Retrieval-based language models (R-LM) model the probability of natural language text by
combining a standard language model (LM) with examples retrieved from an external …

被引用次数：52 相关文章所有 5 个版本

[PDF] mlr.press

Thinking like transformers

G Weiss, Y Goldberg, E Yahav - International Conference on …, 2021 - proceedings.mlr.press

What is the computational model behind a Transformer? Where recurrent neural networks
have direct parallels in finite state machines, allowing clear discussion and thought around …

被引用次数：112 相关文章所有 9 个版本

[PDF] arxiv.org

Sok: Explainable machine learning for computer security applications

A Nadeem, D Vos, C Cao, L Pajola… - 2023 IEEE 8th …, 2023 - ieeexplore.ieee.org

Explainable Artificial Intelligence (XAI) aims to improve the transparency of machine
learning (ML) pipelines. We systematize the increasingly growing (but fragmented) …

被引用次数：52 相关文章所有 8 个版本

[PDF] openreview.net

Interpretability illusions in the generalization of simplified models

D Friedman, AK Lampinen, L Dixon… - … on Machine Learning, 2023 - openreview.net

A common method to study deep learning systems is to use simplified model
representations—for example, using singular value decomposition to visualize the model's …

被引用次数：7 相关文章所有 4 个版本