Learning from class-imbalanced data: Review of methods and applications

G Haixiang, L Yijing, J Shang, G Mingyun… - Expert systems with …, 2017 - Elsevier
Rare events, especially those that could potentially negatively impact society, often require
humans' decision-making responses. Detecting rare events can be viewed as a prediction …

Systematic literature review of preprocessing techniques for imbalanced data

EA Felix, SP Lee - Iet Software, 2019 - Wiley Online Library
Data preprocessing remains an important step in machine learning studies. This is because
proper preprocessing of imbalanced data can enable researchers to reduce defects as …

GHOST: adjusting the decision threshold to handle imbalanced data in machine learning

C Esposito, GA Landrum, N Schneider… - Journal of Chemical …, 2021 - ACS Publications
Machine learning classifiers trained on class imbalanced data are prone to overpredict the
majority class. This leads to a larger misclassification rate for the minority class, which in …

Consensus clustering‐based undersampling approach to imbalanced learning

A Onan - Scientific Programming, 2019 - Wiley Online Library
Class imbalance is an important problem, encountered in machine learning applications,
where one class (named as, the minority class) has extremely small number of instances …

Spatio-temporal modeling of PM2. 5 risk mapping using three machine learning algorithms

SZ Shogrkhodaei, SV Razavi-Termeh, A Fathnia - Environmental Pollution, 2021 - Elsevier
Urban air pollution is one of the most critical issues that affect the environment, community
health, economy, and management of urban areas. From a public health perspective, PM …

Integrating TANBN with cost sensitive classification algorithm for imbalanced data in medical diagnosis

D Gan, J Shen, B An, M Xu, N Liu - Computers & Industrial Engineering, 2020 - Elsevier
For the imbalanced classification problems, most traditional classification models only focus
on searching for an excellent classifier to maximize classification accuracy with the fixed …

Equalization ensemble for large scale highly imbalanced data classification

J Ren, Y Wang, M Mao, Y Cheung - Knowledge-Based Systems, 2022 - Elsevier
The class-imbalance problem has been widely distributed in various research fields. The
larger the data scale and the higher the data imbalance, the more difficult the proper …

Creating the 2011 area classification for output areas (2011 OAC)

CG Gale, A Singleton, AG Bates… - Journal of Spatial …, 2016 - discovery.ucl.ac.uk
This paper presents the methodology that has been used to create the 2011 Area
Classification for Output Areas (2011 OAC). This extends a lineage of widely used public …

[HTML][HTML] Influence of resampling techniques on Bayesian network performance in predicting increased algal activity

MZ Rezaabad, H Lacey, L Marshall, F Johnson - Water Research, 2023 - Elsevier
Early warning of increased algal activity is important to mitigate potential impacts on aquatic
life and human health. While many methods have been developed to predict increased algal …

Machine-learning approach for predicting the occurrence and timing of mid-winter ice breakups on canadian rivers

M De Coste, Z Li, Y Dibike - Environmental Modelling & Software, 2022 - Elsevier
The increasingly common occurrence of Mid-Winter Breakups (MWBs) in Canadian rivers,
consisting of the early breakup of ice cover outside of the typical spring season, is a cause …