Handling data irregularities in classification: Foundations, trends, and future challenges

S Das, S Datta, BB Chaudhuri - Pattern Recognition, 2018 - Elsevier
Most of the traditional pattern classifiers assume their input data to be well-behaved in terms
of similar underlying class distributions, balanced size of classes, the presence of a full set of …

Fuzzy-based information decomposition for incomplete and imbalanced data learning

S Liu, J Zhang, Y Xiang, W Zhou - IEEE Transactions on Fuzzy …, 2017 - ieeexplore.ieee.org
Class imbalance and missing values are two critical problems in pattern classification.
Researchers have proposed a number of techniques to address each of the problems …

Fuzzy rule-based oversampling technique for imbalanced and incomplete data learning

G Liu, Y Yang, B Li - Knowledge-Based Systems, 2018 - Elsevier
Datasets that have skewed class distributions pose a difficulty to learning algorithms in
pattern classification. A number of different methods to deal with this problem have been …

[HTML][HTML] Estimating summary statistics for electronic health record laboratory data for use in high-throughput phenotyping algorithms

DJ Albers, N Elhadad, J Claassen, R Perotte… - Journal of biomedical …, 2018 - Elsevier
We study the question of how to represent or summarize raw laboratory data taken from an
electronic health record (EHR) using parametric model selection to reduce or cope with …

Ontology-supported database marketing

FM Pinto, A Marques, MF Santos - Journal of Database Marketing & …, 2009 - Springer
Database marketing (DBM) provides in-depth analysis of marketing databases. Knowledge
discovery in database techniques is one of the most prominent approaches to supporting …

Using social network activity data to identify and target job seekers

P Ebbes, O Netzer - Management Science, 2022 - pubsonline.informs.org
An important challenge for many firms is to identify the life transitions of its customers, such
as job searching, expecting a child, or purchasing a home. Inferring such transitions, which …

Addressing Class Imbalance in Electronic Health Records Data Imputation

L Qian, Z Ibrahim, A Zhang… - CEUR Workshop …, 2023 - discovery.ucl.ac.uk
Imputing missing values in imbalanced datasets remains an open challenge. Most methods
assume data are missing at random or follow a standard distribution, lacking robustness for …

[PDF][PDF] Regression model approach to predict missing values in the Excel sheet databases

ZM Kumar, R Manjula - International Journal of Computer Science & …, 2012 - ijcset.com
The most important stage of data mining is pre-processing, where we prepare the data for
mining. Real-world data tends to be incomplete, noisy, and inconsistent and an important …

Empirical analysis of machine learning techniques under class imbalance and incomplete datasets

A Puri, MK Gupta - … Systems and Applications in Computer Vision, 2023 - taylorfrancis.com
Class imbalance and missing value are significant problems in datasets. Various
researchers study both issues under a different scenario, but very few consider them …

Data migration challenges: The impact of data quality—Case study of University Putra Malaysia UPM

IF Zamzami, HAA Fatani… - … on Research and …, 2011 - ieeexplore.ieee.org
Data migration is a critical process that directly influenced the quality of data management.
Data migration had affected on the quality of the data, such as, accuracy, data elements, and …