The balancing trick: Optimized sampling of imbalanced datasets—A brief survey of the recent State of the Art

S Susan, A Kumar - Engineering Reports, 2021 - Wiley Online Library
This survey paper focuses on one of the current primary issues challenging data mining
researchers experimenting on real‐world datasets. The problem is that of imbalanced class …

Review of resampling techniques for the treatment of imbalanced industrial data classification in equipment condition monitoring

Y Yuan, J Wei, H Huang, W Jiao, J Wang… - … Applications of Artificial …, 2023 - Elsevier
In an actual industrial scenario, machines typically operate normally for the majority of the
time, with malfunctions occurring only occasionally. As a result, there is very little recorded …

Tabular and latent space synthetic data generation: a literature review

J Fonseca, F Bacao - Journal of Big Data, 2023 - Springer
The generation of synthetic data can be used for anonymization, regularization,
oversampling, semi-supervised learning, self-supervised learning, and several other tasks …

COSTE: Complexity-based OverSampling TEchnique to alleviate the class imbalance problem in software defect prediction

S Feng, J Keung, X Yu, Y Xiao, KE Bennin… - Information and …, 2021 - Elsevier
Context: Generally, there are more non-defective instances than defective instances in the
datasets used for software defect prediction (SDP), which is referred to as the class …

Incremental weighted ensemble broad learning system for imbalanced data

K Yang, Z Yu, CLP Chen, W Cao… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Broad learning system (BLS) is a novel and efficient model, which facilitates representation
learning and classification by concatenating feature nodes and enhancement nodes. In spite …

Gaussian distribution based oversampling for imbalanced data classification

Y Xie, M Qiu, H Zhang, L Peng… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
The imbalanced data classification problem widely exists in many real-world applications.
Data resampling is a promising technique to deal with imbalanced data through either …

A GAN-based hybrid sampling method for imbalanced customer classification

B Zhu, X Pan, S vanden Broucke, J Xiao - Information Sciences, 2022 - Elsevier
Class imbalance is a critical issue in customer classification, for which a plethora of
techniques have been proposed in the current body of literature. In particular, generative …

Subspace-based minority oversampling for imbalance classification

T Li, Y Wang, L Liu, L Chen, CLP Chen - Information Sciences, 2023 - Elsevier
In pattern classification, the class imbalance problem always occurs when the number of
observations in some classes is significantly different from that of other categories, which …

Hybrid neural network with cost-sensitive support vector machine for class-imbalanced multimodal data

KH Kim, SY Sohn - Neural Networks, 2020 - Elsevier
Although deep learning exhibits advantages in various applications involving multimodal
data, it cannot effectively solve the class-imbalance problem. Herein, we propose a hybrid …

Direct-sense brain–computer interfaces and wearable computers

CT Lin, TTN Do - IEEE Transactions on Systems, Man, and …, 2020 - ieeexplore.ieee.org
Brain-computer interfaces (BCIs) allow users to communicate directly with external devices
via their brain signals. Recently, BCIs, and wearable computers in particular, have been …