Data preparation for software vulnerability prediction: A systematic literature review
Software Vulnerability Prediction (SVP) is a data-driven technique for software quality
assurance that has recently gained considerable attention in the Software Engineering …
assurance that has recently gained considerable attention in the Software Engineering …
Data quality issues in software fault prediction: a systematic literature review
Software fault prediction (SFP) aims to improve software quality with a possible minimum
cost and time. Various machine learning models have been proposed in the past for …
cost and time. Various machine learning models have been proposed in the past for …
Data quality for software vulnerability datasets
The use of learning-based techniques to achieve automated software vulnerability detection
has been of longstanding interest within the software security domain. These data-driven …
has been of longstanding interest within the software security domain. These data-driven …
How to address the data quality issues in regression models: A guided process for data cleaning
Today, data availability has gone from scarce to superabundant. Technologies like IoT,
trends in social media and the capabilities of smart-phones are producing and digitizing lots …
trends in social media and the capabilities of smart-phones are producing and digitizing lots …
[图书][B] Social data analytics
This book is an introduction to social data analytics along with its challenges and
opportunities in the age of Big Data and Artificial Intelligence. It focuses primarily on …
opportunities in the age of Big Data and Artificial Intelligence. It focuses primarily on …
A random forest model for early-stage software effort estimation for the SEERA dataset
EI Mustafa, R Osman - Information and Software Technology, 2024 - Elsevier
Context Publicly available software cost estimation datasets are outdated and may not
represent current industrial environments. Thus most research has concentrated on the …
represent current industrial environments. Thus most research has concentrated on the …
Experience: Quality benchmarking of datasets used in software effort estimation
MF Bosu, SG Macdonell - Journal of Data and Information Quality (JDIQ), 2019 - dl.acm.org
Data is a cornerstone of empirical software engineering (ESE) research and practice. Data
underpin numerous process and project management activities, including the estimation of …
underpin numerous process and project management activities, including the estimation of …
Filter Methods for Feature Selection in Supervised Machine Learning Applications--Review and Benchmark
K Hopf, S Reifenrath - arXiv preprint arXiv:2111.12140, 2021 - arxiv.org
The amount of data for machine learning (ML) applications is constantly growing. Not only
the number of observations, especially the number of measured variables (features) …
the number of observations, especially the number of measured variables (features) …
Data cleaning and machine learning: a systematic literature review
Abstract Machine Learning (ML) is integrated into a growing number of systems for various
applications. Because the performance of an ML model is highly dependent on the quality of …
applications. Because the performance of an ML model is highly dependent on the quality of …
A functional taxonomy of data quality tools: Insights from science and practice
M Altendeitering, M Tomczyk - 2022 - aisel.aisnet.org
For organizations data quality is a prerequisite for automated decision making and agility. To
provide high quality data, numerous tools have emerged that support the different steps of …
provide high quality data, numerous tools have emerged that support the different steps of …