Addressing data complexity for imbalanced data sets: analysis of SMOTE-based oversampling and evolutionary undersampling

J Luengo, A Fernández, S García, F Herrera - Soft Computing, 2011 - Springer
In the classification framework there are problems in which the number of examples per
class is not equitably distributed, formerly known as imbalanced data sets. This situation is a …

The proposal of undersampling method for learning from imbalanced datasets

M Bach, A Werner, M Palt - Procedia Computer Science, 2019 - Elsevier
Highly imbalanced data, which occurs in many real-world applications, often makes
machine-based processing difficult or even impossible. The over-and under-sampling …

Imbalance: Oversampling algorithms for imbalanced classification in R

I Cordón, S García, A Fernández, F Herrera - Knowledge-Based Systems, 2018 - Elsevier
Addressing imbalanced datasets in classification tasks is a relevant topic in research
studies. The main reason is that for standard classification algorithms, the success rate when …

An empirical comparison and evaluation of minority oversampling techniques on a large number of imbalanced datasets

G Kovács - Applied Soft Computing, 2019 - Elsevier
Learning and mining from imbalanced datasets gained increased interest in recent years.
One simple but efficient way to increase the performance of standard machine learning …

SMOTE–IPF: Addressing the noisy and borderline examples problem in imbalanced classification by a re-sampling method with filtering

JA Sáez, J Luengo, J Stefanowski, F Herrera - Information Sciences, 2015 - Elsevier
Classification datasets often have an unequal class distribution among their examples. This
problem is known as imbalanced classification. The Synthetic Minority Over-sampling …

Influence of minority class instance types on SMOTE imbalanced data oversampling

P Skryjomski, B Krawczyk - first international workshop on …, 2017 - proceedings.mlr.press
Despite more than two decades of intense research, learning from imbalanced data still
remains as one of the major difficulties posed for computational intelligence systems. Among …

[PDF][PDF] Using information on class interrelations to improve classification of multiclass imbalanced data: a new resampling algorithm

M Janicka, M Lango… - International Journal of …, 2019 - intapi.sciendo.com
The relations between multiple imbalanced classes can be handled with a specialized
approach which evaluates types of examples' difficulty based on an analysis of the class …

Evolutionary undersampling for classification with imbalanced datasets: Proposals and taxonomy

S García, F Herrera - Evolutionary computation, 2009 - direct.mit.edu
Learning with imbalanced data is one of the recent challenges in machine learning. Various
solutions have been proposed in order to find a treatment for this problem, such as …

Evaluation of sampling methods for learning from imbalanced data

G Goel, L Maguire, Y Li, S McLoone - … , ICIC 2013, Nanning, China, July 28 …, 2013 - Springer
The problem of learning from imbalanced data is of critical importance in a large number of
application domains and can be a bottleneck in the performance of various conventional …

A self‐adaptive synthetic over‐sampling technique for imbalanced classification

X Gu, PP Angelov, EA Soares - International Journal of …, 2020 - Wiley Online Library
Traditionally, in supervised machine learning,(a significant) part of the available data
(usually 50%‐80%) is used for training and the rest—for validation. In many problems …