Feature subset selection for data and feature streams: a review

C Villa-Blanco, C Bielza, P Larrañaga - Artificial Intelligence Review, 2023 - Springer
Real-world problems are commonly characterized by a high feature dimensionality, which
hinders the modelling and descriptive analysis of the data. However, some of these data …

An improved random forest based on the classification accuracy and correlation measurement of decision trees

Z Sun, G Wang, P Li, H Wang, M Zhang… - Expert Systems with …, 2024 - Elsevier
Random forest is one of the most widely used machine learning algorithms. Decision trees
used to construct the random forest may have low classification accuracies or high …

Performance supervised plant-wide process monitoring in industry 4.0: A roadmap

Y Jiang, S Yin, O Kaynak - IEEE Open Journal of the Industrial …, 2020 - ieeexplore.ieee.org
The intensive research and development efforts directed towards large-scale complex
industrial systems in the context of Industry 4.0 indicate that safety and reliability issues pose …

Multi-objective PSO based online feature selection for multi-label classification

D Paul, A Jain, S Saha, J Mathew - Knowledge-Based Systems, 2021 - Elsevier
Feature selection approaches aim to select a set of prominent features that best describe the
data to improve the efficiency without degrading the performance of the model. In many real …

Feature interaction for streaming feature selection

P Zhou, P Li, S Zhao, X Wu - IEEE Transactions on Neural …, 2020 - ieeexplore.ieee.org
Traditional feature selection methods assume that all data instances and features are known
before learning. However, it is not the case in many real-world applications that we are more …

A-Stacking and A-Bagging: Adaptive versions of ensemble learning algorithms for spoof fingerprint detection

S Agarwal, CR Chowdary - Expert Systems with Applications, 2020 - Elsevier
Stacking and bagging are widely used ensemble learning approaches that make use of
multiple classifier systems. Stacking focuses on building an ensemble of heterogeneous …

Stable bagging feature selection on medical data

S Alelyani - Journal of Big Data, 2021 - Springer
In the medical field, distinguishing genes that are relevant to a specific disease, let's say
colon cancer, is crucial to finding a cure and understanding its causes and subsequent …

Predictive model for the 5-year survival status of osteosarcoma patients based on the SEER database and XGBoost algorithm

J Jiang, H Pan, M Li, B Qian, X Lin, S Fan - Scientific reports, 2021 - nature.com
Osteosarcoma is the most common bone malignancy, with the highest incidence in children
and adolescents. Survival rate prediction is important for improving prognosis and planning …

Machine learning for detecting anomalies and intrusions in communication networks

Z Li, ALG Rios, L Trajković - IEEE Journal on Selected Areas in …, 2021 - ieeexplore.ieee.org
Cyber attacks are becoming more sophisticated and, hence, more difficult to detect. Using
efficient and effective machine learning techniques to detect network anomalies and …

An alternative approach to dimension reduction for pareto distributed data: a case study

M Roccetti, G Delnevo, L Casini, S Mirri - Journal of big Data, 2021 - Springer
Deep learning models are tools for data analysis suitable for approximating (non-linear)
relationships among variables for the best prediction of an outcome. While these models can …