A survey of safety and trustworthiness of deep neural networks: Verification, testing, adversarial attack and defence, and interpretability

X Huang, D Kroening, W Ruan, J Sharp, Y Sun… - Computer Science …, 2020 - Elsevier
In the past few years, significant progress has been made on deep neural networks (DNNs)
in achieving human-level performance on several long-standing tasks. With the broader …

Testing machine learning based systems: a systematic mapping

V Riccio, G Jahangirova, A Stocco… - Empirical Software …, 2020 - Springer
Abstract Context: A Machine Learning based System (MLS) is a software system including
one or more components that learn how to perform a task from a given data set. The …

Machine learning testing: Survey, landscapes and horizons

JM Zhang, M Harman, L Ma… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
This paper provides a comprehensive survey of techniques for testing machine learning
systems; Machine Learning Testing (ML testing) research. It covers 144 papers on testing …

Software engineering for AI-based systems: a survey

S Martínez-Fernández, J Bogner, X Franch… - ACM Transactions on …, 2022 - dl.acm.org
AI-based systems are software systems with functionalities enabled by at least one AI
component (eg, for image-, speech-recognition, and autonomous driving). AI-based systems …

Deephunter: a coverage-guided fuzz testing framework for deep neural networks

X Xie, L Ma, F Juefei-Xu, M Xue, H Chen, Y Liu… - Proceedings of the 28th …, 2019 - dl.acm.org
The past decade has seen the great potential of applying deep neural network (DNN) based
software to safety-critical scenarios, such as autonomous driving. Similar to traditional …

Deepgauge: Multi-granularity testing criteria for deep learning systems

L Ma, F Juefei-Xu, F Zhang, J Sun, M Xue, B Li… - Proceedings of the 33rd …, 2018 - dl.acm.org
Deep learning (DL) defines a new data-driven programming paradigm that constructs the
internal system logic of a crafted neuron network through a set of training data. We have …

Guiding deep learning system testing using surprise adequacy

J Kim, R Feldt, S Yoo - 2019 IEEE/ACM 41st International …, 2019 - ieeexplore.ieee.org
Deep Learning (DL) systems are rapidly being adopted in safety and security critical
domains, urgently calling for ways to test their correctness and robustness. Testing of DL …

Fakespotter: A simple yet robust baseline for spotting ai-synthesized fake faces

R Wang, F Juefei-Xu, L Ma, X Xie, Y Huang… - arXiv preprint arXiv …, 2019 - arxiv.org
In recent years, generative adversarial networks (GANs) and its variants have achieved
unprecedented success in image synthesis. They are widely adopted in synthesizing facial …

Assuring the machine learning lifecycle: Desiderata, methods, and challenges

R Ashmore, R Calinescu, C Paterson - ACM Computing Surveys (CSUR), 2021 - dl.acm.org
Machine learning has evolved into an enabling technology for a wide range of highly
successful applications. The potential for this success to continue and accelerate has placed …

A software engineering perspective on engineering machine learning systems: State of the art and challenges

G Giray - Journal of Systems and Software, 2021 - Elsevier
Context: Advancements in machine learning (ML) lead to a shift from the traditional view of
software development, where algorithms are hard-coded by humans, to ML systems …