Two density-based sampling approaches for imbalanced and overlapping data

S Mayabadi, H Saadatfar - Knowledge-Based Systems, 2022 - Elsevier
An imbalanced dataset consists of a majority class and a minority class, where the former's
sample size is substantially larger than other classes. This difference disrupts the data …

Ensemble Synthesized Minority Oversampling based Generative Adversarial Networks and Random Forest Algorithm for Credit Card Fraud Detection

FA Ghaleb, F Saeed, M Al-Sarem, SN Qasem… - IEEE …, 2023 - ieeexplore.ieee.org
The recent increase in credit card fraud is rapidly has caused huge monetary losses for
individuals and financial institutions. Most credit card frauds are conducted online by …

Fraud detection using large-scale imbalance dataset

ZS Rubaidi, BB Ammar, MB Aouicha - International Journal on …, 2022 - World Scientific
In the context of machine learning, an imbalanced classification problem states to a dataset
in which the classes are not evenly distributed. This problem commonly occurs when …

An improved SMOTE based on center offset factor and synthesis strategy for imbalanced data classification

Y Zhang, L Deng, H Huang, B Wei - The Journal of Supercomputing, 2024 - Springer
It is an enormous challenge for imbalanced data learning in the field of machine learning. To
construct balanced datasets, oversampling techniques have been studied extensively …

An adaptive synthetic sampling and batch generation-oriented hybrid approach for addressing class imbalance problem in software defect prediction

A Taskeen, SUR Khan, A Mashkoor - Soft Computing, 2024 - Springer
Learning classifiers with uneven class distribution datasets poses a significant challenge in
software defect prediction. This problem arises when the number of samples representing …

Imbalanced Data Classification Based on Improved Random-SMOTE and Feature Standard Deviation

Y Zhang, L Deng, B Wei - Mathematics, 2024 - mdpi.com
Oversampling techniques are widely used to rebalance imbalanced datasets. However,
most of the oversampling methods may introduce noise and fuzzy boundaries for dataset …

Real-time video surveillance on highways using combination of extended Kalman Filter and deep reinforcement learning

L Fu, Q Zhang, S Tian - Heliyon, 2024 - cell.com
Highways, as one of the main arteries of transit and transportation in today's world, play a
fundamental role in accelerating transportation, and for this reason, continuous monitoring of …

[PDF][PDF] End2end unstructured data processing, confidential data structuring & storage using image processing, nlp, machine learning, and blockchain

CD STRUCTURING - Journal of Theoretical and Applied Information …, 2022 - jatit.org
The expediting magnification of automating the manual jobs into automated is incrementing
day by day, as there are approximately 2.5 quintillion bytes of data exchanged over the …

Learning from Highly Imbalanced Big Data with Label Noise.

JM Johnson, RKL Kennedy… - … Journal on Artificial …, 2023 - search.ebscohost.com
This study explores the effects of class label noise on detecting fraud within three highly
imbalanced healthcare fraud data sets containing millions of claims and minority class sizes …

[PDF][PDF] Perbandingan Algoritma Klasifikasi Random Forest, Gaussian Naive Bayes, dan K-Nearest Neighbor untuk Data Tidak Seimbang dan Data yang …

AP Monika, FEP Risti, I Binanto… - Jurnal Seminar Nasional …, 2023 - researchgate.net
Tujuan dari penelitian ini adalah membandingkan efektivitas penggunaan teknik data
seimbang (balance) menggunakan Adaptive Synthetic (ADASYN) dengan metode …