Handling data irregularities in classification: Foundations, trends, and future challenges
Most of the traditional pattern classifiers assume their input data to be well-behaved in terms
of similar underlying class distributions, balanced size of classes, the presence of a full set of …
of similar underlying class distributions, balanced size of classes, the presence of a full set of …
Fuzzy-based information decomposition for incomplete and imbalanced data learning
Class imbalance and missing values are two critical problems in pattern classification.
Researchers have proposed a number of techniques to address each of the problems …
Researchers have proposed a number of techniques to address each of the problems …
Fuzzy rule-based oversampling technique for imbalanced and incomplete data learning
G Liu, Y Yang, B Li - Knowledge-Based Systems, 2018 - Elsevier
Datasets that have skewed class distributions pose a difficulty to learning algorithms in
pattern classification. A number of different methods to deal with this problem have been …
pattern classification. A number of different methods to deal with this problem have been …
[HTML][HTML] Estimating summary statistics for electronic health record laboratory data for use in high-throughput phenotyping algorithms
We study the question of how to represent or summarize raw laboratory data taken from an
electronic health record (EHR) using parametric model selection to reduce or cope with …
electronic health record (EHR) using parametric model selection to reduce or cope with …
Ontology-supported database marketing
Database marketing (DBM) provides in-depth analysis of marketing databases. Knowledge
discovery in database techniques is one of the most prominent approaches to supporting …
discovery in database techniques is one of the most prominent approaches to supporting …
Using social network activity data to identify and target job seekers
An important challenge for many firms is to identify the life transitions of its customers, such
as job searching, expecting a child, or purchasing a home. Inferring such transitions, which …
as job searching, expecting a child, or purchasing a home. Inferring such transitions, which …
Addressing Class Imbalance in Electronic Health Records Data Imputation
Imputing missing values in imbalanced datasets remains an open challenge. Most methods
assume data are missing at random or follow a standard distribution, lacking robustness for …
assume data are missing at random or follow a standard distribution, lacking robustness for …
[PDF][PDF] Regression model approach to predict missing values in the Excel sheet databases
ZM Kumar, R Manjula - International Journal of Computer Science & …, 2012 - ijcset.com
The most important stage of data mining is pre-processing, where we prepare the data for
mining. Real-world data tends to be incomplete, noisy, and inconsistent and an important …
mining. Real-world data tends to be incomplete, noisy, and inconsistent and an important …
Empirical analysis of machine learning techniques under class imbalance and incomplete datasets
Class imbalance and missing value are significant problems in datasets. Various
researchers study both issues under a different scenario, but very few consider them …
researchers study both issues under a different scenario, but very few consider them …
Data migration challenges: The impact of data quality—Case study of University Putra Malaysia UPM
IF Zamzami, HAA Fatani… - … on Research and …, 2011 - ieeexplore.ieee.org
Data migration is a critical process that directly influenced the quality of data management.
Data migration had affected on the quality of the data, such as, accuracy, data elements, and …
Data migration had affected on the quality of the data, such as, accuracy, data elements, and …