Data discretization: taxonomy and big data challenge

S Ramírez‐Gallego, S García… - … : Data Mining and …, 2016 - Wiley Online Library
Discretization of numerical data is one of the most influential data preprocessing tasks in
knowledge discovery and data mining. The purpose of attribute discretization is to find …

[图书][B] Multiple attribute decision making: methods and applications

GH Tzeng, JJ Huang - 2011 - books.google.com
Decision makers are often faced with several conflicting alternatives. How do they evaluate
trade-offs when there are more than three criteria? To help people make optimal decisions …

Tutorial on practical tips of the most influential data preprocessing algorithms in data mining

S García, J Luengo, F Herrera - Knowledge-Based Systems, 2016 - Elsevier
Data preprocessing is a major and essential stage whose main goal is to obtain final data
sets that can be considered correct and useful for further data mining algorithms. This paper …

A survey of discretization techniques: Taxonomy and empirical analysis in supervised learning

S Garcia, J Luengo, JA Sáez, V Lopez… - IEEE transactions on …, 2012 - ieeexplore.ieee.org
Discretization is an essential preprocessing technique used in many knowledge discovery
and data mining tasks. Its main goal is to transform a set of continuous attributes into discrete …

Consistency measures for feature selection

A Arauzo-Azofra, JM Benitez, JL Castro - Journal of Intelligent Information …, 2008 - Springer
The use of feature selection can improve accuracy, efficiency, applicability and
understandability of a learning process. For this reason, many methods of automatic feature …

A new representation in PSO for discretization-based feature selection

B Tran, B Xue, M Zhang - IEEE Transactions on Cybernetics, 2017 - ieeexplore.ieee.org
In machine learning, discretization and feature selection (FS) are important techniques for
preprocessing data to improve the performance of an algorithm on high-dimensional data …

Machine learning for knowledge transfer across multiple metals additive manufacturing printers

S Liu, AP Stebner, BB Kappes, X Zhang - Additive Manufacturing, 2021 - Elsevier
Adopting new metals 3D printers introduces time and cost obstacles to printing parts with the
same quality as was attained on existing printers. A large number of trial-and-error …

Over-sampling algorithm for imbalanced data classification

XU Xiaolong, C Wen, SUN Yanfei - Journal of Systems …, 2019 - ieeexplore.ieee.org
For imbalanced datasets, the focus of classification is to identify samples of the minority
class. The performance of current data mining algorithms is not good enough for processing …

A discretization algorithm based on class-attribute contingency coefficient

CJ Tsai, CI Lee, WP Yang - Information Sciences, 2008 - Elsevier
Discretization algorithms have played an important role in data mining and knowledge
discovery. They not only produce a concise summarization of continuous attributes to help …

An extended chi2 algorithm for discretization of real value attributes

CT Su, JH Hsu - IEEE transactions on knowledge and data …, 2005 - ieeexplore.ieee.org
The variable precision rough sets (VPRS) model is a powerful tool for data mining, as it has
been widely applied to acquire knowledge. Despite its diverse applications in many …