[HTML][HTML] A survey on dataset quality in machine learning

Y Gong, G Liu, Y Xue, R Li, L Meng - Information and Software Technology, 2023 - Elsevier
With the rise of big data, the quality of datasets has become a crucial factor affecting the
performance of machine learning models. High-quality datasets are essential for the …

A survey of data quality measurement and monitoring tools

L Ehrlinger, W Wöß - Frontiers in big data, 2022 - frontiersin.org
High-quality data is key to interpretable and trustworthy data analytics and the basis for
meaningful data-driven decisions. In practical scenarios, data quality is typically associated …

Interactive correction of mislabeled training data

S Xiang, X Ye, J Xia, J Wu, Y Chen… - 2019 IEEE Conference …, 2019 - ieeexplore.ieee.org
In this paper, we develop a visual analysis method for interactively improving the quality of
labeled data, which is essential to the success of supervised and semi-supervised learning …

An advanced big data quality framework based on weighted metrics

W Elouataoui, I El Alaoui, S El Mendili… - Big Data and Cognitive …, 2022 - mdpi.com
While big data benefits are numerous, the use of big data requires, however, addressing
new challenges related to data processing, data security, and especially degradation of data …

Capturing and visualizing provenance from data wrangling

C Bors, T Gschwandtner… - IEEE computer graphics …, 2019 - ieeexplore.ieee.org
Data quality management and assessment play a vital role for ensuring the trust in the data
and its fitness-of-use for subsequent analysis. The transformation history of a data wrangling …

A visual analysis approach to understand and explore quality problems of AIS data

W He, J Lei, X Chu, S Xie, C Zhong, Z Li - Journal of Marine Science and …, 2021 - mdpi.com
Low quality automatic identification system (AIS) data often mislead analysts to a
misunderstanding of ship behavior analysis and to making incorrect navigation risk …

Exploring the Impact of Data Quality on Business Performance in CRM Systems for Home Appliance Business

Y Suh - IEEE Access, 2023 - ieeexplore.ieee.org
In customer relationship management (CRM), high-quality customer data is at the heart of
reliable data analysis and is the foundation for data-driven decisions that impact business …

Use of context in data quality management: a systematic literature review

F Serra, V Peralta, A Marotta, P Marcel - ACM Journal of Data and …, 2022 - dl.acm.org
The importance of context in data quality (DQ) was shown many years ago and nowadays is
widely accepted. Early approaches and surveys defined DQ as fitness for use and showed …

AI-Driven Frameworks for Enhancing Data Quality in Big Data Ecosystems: Error_Detection, Correction, and Metadata Integration

W Elouataoui - arXiv preprint arXiv:2405.03870, 2024 - arxiv.org
The widespread adoption of big data has ushered in a new era of data-driven decision-
making, transforming numerous industries and sectors. However, the efficacy of these …

Data Guards: Challenges and Solutions for Fostering Trust in Data

N Sultanum, D Bromley, M Correll - arXiv preprint arXiv:2407.14042, 2024 - arxiv.org
From dirty data to intentional deception, there are many threats to the validity of data-driven
decisions. Making use of data, especially new or unfamiliar data, therefore requires a …