A systematic literature review and future perspectives for handling big data analytics in COVID-19 diagnosis

N Tenali, GRM Babu - New Generation Computing, 2023 - Springer
In today's digital world, information is growing along with the expansion of Internet usage
worldwide. As a consequence, bulk of data is generated constantly which is known to be …

Virtual sample generation for small sample learning: a survey, recent developments and future prospects

J Wen, A Su, X Wang, H Xu, J Ma, K Chen, X Ge, Z Xu… - Neurocomputing, 2024 - Elsevier
Virtual sample generation (VSG) technology aims to generate virtual samples based on real
samples, in order to expand the size of the datasets and improve model performance …

Efficient hybrid oversampling and intelligent undersampling for imbalanced big data classification

C Vairetti, JL Assadi, S Maldonado - Expert Systems with Applications, 2024 - Elsevier
Imbalanced classification is a well-known challenge faced by many real-world applications.
This issue occurs when the distribution of the target variable is skewed, leading to a …

Adaptive SV-Borderline SMOTE-SVM algorithm for imbalanced data classification

J Guo, H Wu, X Chen, W Lin - Applied Soft Computing, 2024 - Elsevier
In recent years, imbalanced data classification has emerged as a challenging task. To
address this issue, we propose an adaptive SV-Borderline SMOTE-SVM (Synthetic Minority …

Magnetic force classifier: a Novel Method for Big Data classification

AB Hassanat, HN Ali, AS Tarawneh, M Alrashidi… - IEEE …, 2022 - ieeexplore.ieee.org
There are a plethora of invented classifiers in Machine learning literature, however, there is
no optimal classifier in terms of accuracy and time taken to build the trained model …

The accuracy of Random Forest performance can be improved by conducting a feature selection with a balancing strategy

MI Prasetiyowati, NU Maulidevi, K Surendro - PeerJ Computer Science, 2022 - peerj.com
One of the significant purposes of building a model is to increase its accuracy within a
shorter timeframe through the feature selection process. It is carried out by determining the …

CANTO: An actor model-based distributed fog framework supporting neural networks training in IoT applications

SN Srirama, D Vemuri - Computer Communications, 2023 - Elsevier
The large volumes of Internet of Things (IoT) data transmission to and from the cloud leads
to one of cloud-centric processing's major drawbacks: latency. Fog computing gives a …

Review of Methods for Handling Class Imbalance in Classification Problems

SS Rawat, AK Mishra - International Conference on Data, Engineering and …, 2022 - Springer
Learning classifiers using skewed or imbalanced datasets can occasionally lead to
classification issues; this is a serious issue. In some cases, one class contains the majority of …

Dealing with Class Imbalance in Uplift Modeling-Efficient Data Preprocessing via Oversampling and Matching

C Vairetti, MJ Marfán, S Maldonado - IEEE Access, 2024 - ieeexplore.ieee.org
Uplift modeling is a widely recognized predictive approach used to identify individuals who
are more likely to respond positively to an intervention or treatment, such as a marketing …

Machine learning methods to estimate productivity of harvesters: mechanized timber harvesting in Brazil

RA Munis, RO Almeida, DA Camargo, RBG da Silva… - Forests, 2022 - mdpi.com
The correct capture of forest operations information carried out in forest plantations can help
in the management of mechanized harvesting timber. Proper management must be able to …