On the experiences of adopting automated data validation in an industrial machine learning project

LE Lwakatare, E Rånge, I Crnkovic… - 2021 IEEE/ACM 43rd …, 2021 - ieeexplore.ieee.org
Data errors are a common challenge in machine learning (ML) projects and generally cause
significant performance degradation in ML-enabled software systems. To ensure early …

Risk-based data validation in machine learning-based software systems

H Foidl, M Felderer - proceedings of the 3rd ACM SIGSOFT international …, 2019 - dl.acm.org
Data validation is an essential requirement to ensure the reliability and quality of Machine
Learning-based Software Systems. However, an exhaustive validation of all data fed to …

Systematic training and testing for machine learning using combinatorial interaction testing

T Cody, E Lanus, DD Doyle… - 2022 IEEE International …, 2022 - ieeexplore.ieee.org
This paper demonstrates the systematic use of combinatorial coverage for selecting and
characterizing test and training sets for machine learning models. The presented work …

A systematic mapping of quality models for AI systems, software and components

MA Ali, NK Yap, AAA Ghani, H Zulzalil… - Applied Sciences, 2022 - mdpi.com
Recently, there has been a significant increase in the number of Artificial Intelligence (AI)
systems, software, and components. As a result, it is crucial to evaluate their quality. Quality …

Simple prediction of an ecosystem-specific water quality index and the water quality classification of a highly polluted river through supervised machine learning

A Fernández del Castillo, C Yebra-Montes… - Water, 2022 - mdpi.com
Water quality indices (WQIs) are used for the simple assessment and classification of the
water quality of surface water sources. However, considerable time, financial resources, and …

Automatic fault detection for deep learning programs using graph transformations

A Nikanjam, HB Braiek, MM Morovati… - ACM Transactions on …, 2021 - dl.acm.org
Nowadays, we are witnessing an increasing demand in both corporates and academia for
exploiting Deep Learning (DL) to solve complex real-world problems. A DL program …

Peatmoss: A dataset and initial analysis of pre-trained models in open-source software

W Jiang, J Yasmin, J Jones, N Synovic… - 2024 IEEE/ACM 21st …, 2024 - ieeexplore.ieee.org
The development and training of deep learning models have become increasingly costly
and complex. Consequently, software engineers are adopting pre-trained models (PTMs) for …

[PDF][PDF] Compositional automata learning of synchronous systems

T Neele, M Sammartino - International Conference on …, 2023 - library.oapen.org
Automata learning is a technique to infer an automaton model of a black-box system via
queries to the system. In recent years it has found widespread use both in industry and …

Rethinking software engineering in the era of foundation models: A curated catalogue of challenges in the development of trustworthy fmware

AE Hassan, D Lin, GK Rajbahadur, K Gallaba… - arXiv preprint arXiv …, 2024 - arxiv.org
Foundation models (FMs), such as Large Language Models (LLMs), have revolutionized
software development by enabling new use cases and business models. We refer to …

AGORA: automated generation of test oracles for REST APIs

JC Alonso, S Segura, A Ruiz-Cortés - Proceedings of the 32nd ACM …, 2023 - dl.acm.org
Test case generation tools for REST APIs have grown in number and complexity in recent
years. However, their advanced capabilities for automated input generation contrast with the …