Advances, challenges and opportunities in creating data for trustworthy AI

W Liang, GA Tadesse, D Ho, L Fei-Fei… - Nature Machine …, 2022 - nature.com
As artificial intelligence (AI) transitions from research to deployment, creating the appropriate
datasets and data pipelines to develop and evaluate AI models is increasingly the biggest …

Sustainability in wood products: a new perspective for handling natural diversity

M Schubert, G Panzarasa, I Burgert - Chemical Reviews, 2022 - ACS Publications
Wood is a renewable resource with excellent qualities and the potential to become a key
element of a future bioeconomy. The increasing environmental awareness and drive to …

Deep neural networks and tabular data: A survey

V Borisov, T Leemann, K Seßler, J Haug… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Heterogeneous tabular data are the most commonly used form of data and are essential for
numerous critical and computationally demanding applications. On homogeneous datasets …

Data collection and quality challenges in deep learning: A data-centric ai perspective

SE Whang, Y Roh, H Song, JG Lee - The VLDB Journal, 2023 - Springer
Data-centric AI is at the center of a fundamental shift in software engineering where machine
learning becomes the new software, powered by big data and computing infrastructure …

Towards accountability for machine learning datasets: Practices from software engineering and infrastructure

B Hutchinson, A Smart, A Hanna, E Denton… - Proceedings of the …, 2021 - dl.acm.org
Datasets that power machine learning are often used, shared, and reused with little visibility
into the processes of deliberation that led to their creation. As artificial intelligence systems …

Machine learning testing: Survey, landscapes and horizons

JM Zhang, M Harman, L Ma… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
This paper provides a comprehensive survey of techniques for testing machine learning
systems; Machine Learning Testing (ML testing) research. It covers 144 papers on testing …

The what-if tool: Interactive probing of machine learning models

J Wexler, M Pushkarna, T Bolukbasi… - IEEE transactions on …, 2019 - ieeexplore.ieee.org
A key challenge in developing and deploying Machine Learning (ML) systems is
understanding their performance across a wide range of inputs. To address this challenge …

Software engineering for AI-based systems: a survey

S Martínez-Fernández, J Bogner, X Franch… - ACM Transactions on …, 2022 - dl.acm.org
AI-based systems are software systems with functionalities enabled by at least one AI
component (eg, for image-, speech-recognition, and autonomous driving). AI-based systems …

A survey on data collection for machine learning: a big data-ai integration perspective

Y Roh, G Heo, SE Whang - IEEE Transactions on Knowledge …, 2019 - ieeexplore.ieee.org
Data collection is a major bottleneck in machine learning and an active research topic in
multiple communities. There are largely two reasons data collection has recently become a …

Software engineering for machine learning: A case study

S Amershi, A Begel, C Bird, R DeLine… - 2019 IEEE/ACM 41st …, 2019 - ieeexplore.ieee.org
Recent advances in machine learning have stimulated widespread interest within the
Information Technology sector on integrating AI capabilities into software and services. This …