Data preparation for software vulnerability prediction: A systematic literature review

R Croft, Y Xie, MA Babar - IEEE Transactions on Software …, 2022 - ieeexplore.ieee.org
Software Vulnerability Prediction (SVP) is a data-driven technique for software quality
assurance that has recently gained considerable attention in the Software Engineering …

Data quality issues in software fault prediction: a systematic literature review

K Bhandari, K Kumar, AL Sangal - Artificial Intelligence Review, 2023 - Springer
Software fault prediction (SFP) aims to improve software quality with a possible minimum
cost and time. Various machine learning models have been proposed in the past for …

Data quality for software vulnerability datasets

R Croft, MA Babar, MM Kholoosi - 2023 IEEE/ACM 45th …, 2023 - ieeexplore.ieee.org
The use of learning-based techniques to achieve automated software vulnerability detection
has been of longstanding interest within the software security domain. These data-driven …

How to address the data quality issues in regression models: A guided process for data cleaning

DC Corrales, JC Corrales, A Ledezma - Symmetry, 2018 - mdpi.com
Today, data availability has gone from scarce to superabundant. Technologies like IoT,
trends in social media and the capabilities of smart-phones are producing and digitizing lots …

[图书][B] Social data analytics

A Beheshti, S Ghodratnama, M Elahi, H Farhood - 2022 - taylorfrancis.com
This book is an introduction to social data analytics along with its challenges and
opportunities in the age of Big Data and Artificial Intelligence. It focuses primarily on …

A random forest model for early-stage software effort estimation for the SEERA dataset

EI Mustafa, R Osman - Information and Software Technology, 2024 - Elsevier
Context Publicly available software cost estimation datasets are outdated and may not
represent current industrial environments. Thus most research has concentrated on the …

Experience: Quality benchmarking of datasets used in software effort estimation

MF Bosu, SG Macdonell - Journal of Data and Information Quality (JDIQ), 2019 - dl.acm.org
Data is a cornerstone of empirical software engineering (ESE) research and practice. Data
underpin numerous process and project management activities, including the estimation of …

Filter Methods for Feature Selection in Supervised Machine Learning Applications--Review and Benchmark

K Hopf, S Reifenrath - arXiv preprint arXiv:2111.12140, 2021 - arxiv.org
The amount of data for machine learning (ML) applications is constantly growing. Not only
the number of observations, especially the number of measured variables (features) …

Data cleaning and machine learning: a systematic literature review

PO Côté, A Nikanjam, N Ahmed, D Humeniuk… - Automated Software …, 2024 - Springer
Abstract Machine Learning (ML) is integrated into a growing number of systems for various
applications. Because the performance of an ML model is highly dependent on the quality of …

A functional taxonomy of data quality tools: Insights from science and practice

M Altendeitering, M Tomczyk - 2022 - aisel.aisnet.org
For organizations data quality is a prerequisite for automated decision making and agility. To
provide high quality data, numerous tools have emerged that support the different steps of …