SMOTE for learning from imbalanced data: progress and challenges, marking the 15-year anniversary

A Fernández, S Garcia, F Herrera, NV Chawla - Journal of artificial …, 2018 - jair.org
The Synthetic Minority Oversampling Technique (SMOTE) preprocessing algorithm is
considered" de facto" standard in the framework of learning from imbalanced data. This is …

[HTML][HTML] Learning from imbalanced data: open challenges and future directions

B Krawczyk - Progress in artificial intelligence, 2016 - Springer
Despite more than two decades of continuous development learning from imbalanced data
is still a focus of intense research. Starting as a problem of skewed distributions of binary …

Data imbalance in classification: Experimental evaluation

F Thabtah, S Hammoud, F Kamalov, A Gonsalves - Information Sciences, 2020 - Elsevier
Abstract The advent of Big Data has ushered a new era of scientific breakthroughs. One of
the common issues that affects raw data is class imbalance problem which refers to …

DKDFN: Domain knowledge-guided deep collaborative fusion network for multimodal unitemporal remote sensing land cover classification

Y Li, Y Zhou, Y Zhang, L Zhong, J Wang… - ISPRS Journal of …, 2022 - Elsevier
Land use and land cover maps provide fundamental information that has been used in
different types of studies, ranging from public health to carbon cycling. However, the existing …

A survey of predictive modeling on imbalanced domains

P Branco, L Torgo, RP Ribeiro - ACM computing surveys (CSUR), 2016 - dl.acm.org
Many real-world data-mining applications involve obtaining predictive models using
datasets with strongly imbalanced distributions of the target variable. Frequently, the least …

A review on classification of imbalanced data for wireless sensor networks

H Patel, D Singh Rajput… - International …, 2020 - journals.sagepub.com
Classification of imbalanced data is a vastly explored issue of the last and present decade
and still keeps the same importance because data are an essential term today and it …

Tutorial on practical tips of the most influential data preprocessing algorithms in data mining

S García, J Luengo, F Herrera - Knowledge-Based Systems, 2016 - Elsevier
Data preprocessing is a major and essential stage whose main goal is to obtain final data
sets that can be considered correct and useful for further data mining algorithms. This paper …

[HTML][HTML] 不平衡数据分类方法综述

李艳霞, 柴毅, 胡友强, 尹宏鹏 - 控制与决策, 2019 - kzyjc.alljournals.cn
随着信息技术的快速发展, 各领域的数据正以前所未有的速度产生并被广泛收集和存储,
如何实现数据的智能化处理从而利用数据中蕴含的有价值信息已成为理论和应用的研究热点 …

Adaptive semi-unsupervised weighted oversampling (A-SUWO) for imbalanced datasets

I Nekooeimehr, SK Lai-Yuen - Expert Systems with Applications, 2016 - Elsevier
In many applications, the dataset for classification may be highly imbalanced where most of
the instances in the training set may belong to one of the classes (majority class), while only …

Analyzing the oversampling of different classes and types of examples in multi-class imbalanced datasets

JA Sáez, B Krawczyk, M Woźniak - Pattern Recognition, 2016 - Elsevier
Canonical machine learning algorithms assume that the number of objects in the considered
classes are roughly similar. However, in many real-life situations the distribution of examples …