Data-centric artificial intelligence: A survey

D Zha, ZP Bhat, KH Lai, F Yang, Z Jiang… - arXiv preprint arXiv …, 2023 - arxiv.org
Artificial Intelligence (AI) is making a profound impact in almost every domain. A vital enabler
of its great success is the availability of abundant and high-quality data for building machine …

Fuzzy clustering decomposition of genetic algorithm-based instance selection for regression problems

M Kordos, M Blachnik, R Scherer - Information Sciences, 2022 - Elsevier
Data selection, which includes feature and instance selection, is often an important step in
building prediction systems. Genetic algorithms (GA) frequently allow finding better solutions …

Abnormal transactions detection in the ethereum network using semi-supervised generative adversarial networks

YK Sanjalawe, SR Al-E'mari - IEEE Access, 2023 - ieeexplore.ieee.org
Numerous abnormal transactions have been exposed as a result of targeted attacks on
Ethereum, such as the Ethereum Decentralized Autonomous Organization attack. Exploiting …

How to improve customer engagement in social networks: a study of Spanish brands in the automotive industry

L Matosas-López, A Romero-Ania - Journal of theoretical and applied …, 2021 - mdpi.com
The objective of this research is to identify to what extent volumes, components, time slots,
and publication topics improve customer engagement with Spanish automotive brands …

Long-horizon predictions of credit default with inconsistent customers

G Chi, B Dong, Y Zhou, P Jin - Technological Forecasting and Social …, 2024 - Elsevier
We developed a decision support framework for default predictions that addresses two
common issues: inconsistent customers and predictions of future defaults. We developed a …

Graph reduction techniques for instance selection: comparative and empirical study

Z Rustamov, N Zaki, J Rustamov, A Zaitouny… - Artificial Intelligence …, 2024 - Springer
The surge in data generation has prompted a shift to big data, challenging the notion that
“more data equals better performance” due to processing and time constraints. In this …

Logistic model tree forest for steel plates faults prediction

B Ghasemkhani, R Yilmaz, D Birant, RA Kut - Machines, 2023 - mdpi.com
Fault prediction is a vital task to decrease the costs of equipment maintenance and repair, as
well as to improve the quality level of products and production efficiency. Steel plates fault …

The efficiency of social network services management in organizations. An in-depth analysis applying machine learning algorithms and multiple linear regressions

L Matosas-López, A Romero-Ania - Applied Sciences, 2020 - mdpi.com
The objective of this work is to detect the variables that allow organizations to manage their
social network services efficiently. The study, applying machine learning algorithms and …

Novel hippocampus-centered methodology for informative instance selection in Alzheimer's disease data

JA Castro-Silva, MN Moreno-García, L Guachi-Guachi… - Heliyon, 2024 - cell.com
The quantity and quality of a dataset play a crucial role in the performance of prediction
models. Increasing the amount of data increases the computational requirements and can …

Maximum a posteriori estimation and filtering algorithm for numerical label noise

G Jiang, Z Li, W Wang - Applied Intelligence, 2024 - Springer
Data quality, especially label quality, may have a significant impact on the prediction
accuracy in supervised learning. Training on datasets with label noise causes a degradation …