Data sets and data quality in software engineering: Eight years on

G Liebchen, M Shepperd - Proceedings of the the 12th international …, 2016 - dl.acm.org
Context: We revisit our review of data quality within the context of empirical software
engineering eight years on from our PROMISE 2008 article. Objective: To assess the extent …

Experience: Quality benchmarking of datasets used in software effort estimation

MF Bosu, SG Macdonell - Journal of Data and Information Quality (JDIQ), 2019 - dl.acm.org
Data is a cornerstone of empirical software engineering (ESE) research and practice. Data
underpin numerous process and project management activities, including the estimation of …

Towards a model and methodology for evaluating data quality in software engineering experiments

C Valverde, A Marotta, JI Panach, D Vallespir - Information and Software …, 2022 - Elsevier
Context Data collected during software engineering experiments might contain quality
problems, leading to wrong experimental conclusions. Objective We present a data quality …

Evaluating the quality of datasets in software engineering

MM Rosli, E Tempero… - Advanced Science …, 2018 - ingentaconnect.com
Research based on datasets needs to determine the quality of the data that form the basis of
the results. To facilitate this type of research, the meaning of the data needs to be interpreted …

Business process and organizational data quality model (BPODQM) for integrated process and data mining

F Betancor, F Pérez, A Marotta, A Delgado - Quality of Information and …, 2021 - Springer
Data Quality (DQ) is a key element in any Data Science project to guarantee that its results
provide consistent and reliable information. Both process mining and data mining, as part of …

[PDF][PDF] A Methodology for Integrated Process and Data Mining and Analysis towards Evidence-based Process Improvement.

A Delgado, D Calegari, A Marotta, L González… - …, 2021 - pdfs.semanticscholar.org
The socio-technical system supporting an organization's daily operations is becoming more
complex, with distributed infrastructures integrating heterogeneous technologies enacting …

A methodology for organizational data science towards evidence-based process improvement

A Delgado, D Calegari, A Marotta, L González… - … Conference on Software …, 2021 - Springer
Organizational data science projects provide organizations with evidence-based business
intelligence to improve their business processes (BPs). They require methodological …

Modeling the combined influence of complexity and quality in supervised learning

R de Ávila Mendes, LA da Silva - Intelligent Data Analysis, 2022 - content.iospress.com
Data classification is a data mining task that consists of an algorithm adjusted by a training
dataset that is used to predict an object's class (unclassified) on analysis. A significant part of …

[PDF][PDF] Data quality in empirical software engineering: An investigation of time-aware models in software effort estimation

MF Bosu - 2016 - ourarchive.otago.ac.nz
Since its inception as a recognized sub-discipline, empirical software engineering (ESE) has
been plagued with data quality issues, and in recent years this has led to an increasing …

[PDF][PDF] Data Flow Quality Monitoring in Data Infrastructures.

A Mannocci, M Avvenuti, P Manghi, A Rauber… - 2017 - iris.cnr.it
IN the last decade, a lot of attention worldwide has been brought by researchers,
organizations, and funders on the realization of Data Infrastructures (DIs), namely systems …