A comparative study of logistic model tree, random forest, and classification and regression tree models for spatial prediction of landslide susceptibility

W Chen, X Xie, J Wang, B Pradhan, H Hong, DT Bui… - Catena, 2017 - Elsevier
The main purpose of the present study is to use three state-of-the-art data mining
techniques, namely, logistic model tree (LMT), random forest (RF), and classification and …

A feature selection model based on genetic rank aggregation for text sentiment classification

A Onan, S Korukoğlu - Journal of Information Science, 2017 - journals.sagepub.com
Sentiment analysis is an important research direction of natural language processing, text
mining and web mining which aims to extract subjective information in source materials. The …

A k-mean clustering algorithm for mixed numeric and categorical data

A Ahmad, L Dey - Data & Knowledge Engineering, 2007 - Elsevier
Use of traditional k-mean type algorithm is limited to numeric data. This paper presents a
clustering algorithm based on k-mean paradigm that works well for data with mixed numeric …

Clear cell renal cell carcinoma: machine learning-based quantitative computed tomography texture analysis for prediction of fuhrman nuclear grade

CT Bektas, B Kocak, AH Yardimci, MH Turkcanoglu… - European …, 2019 - Springer
Objective To evaluate the performance of quantitative computed tomography (CT) texture
analysis using different machine learning (ML) classifiers for discriminating low and high …

A methodology for energy multivariate time series forecasting in smart buildings based on feature selection

A Gonzalez-Vidal, F Jimenez, AF Gomez-Skarmeta - Energy and Buildings, 2019 - Elsevier
The massive collection of data via emerging technologies like the Internet of Things (IoT)
requires finding optimal ways to reduce the created features that have a potential impact on …

Reducing features to improve code change-based bug prediction

S Shivaji, EJ Whitehead, R Akella… - IEEE Transactions on …, 2012 - ieeexplore.ieee.org
Machine learning classifiers have recently emerged as a way to predict the introduction of
bugs in changes made to source code files. The classifier is first trained on software history …

A hybrid genetic algorithm for feature selection wrapper based on mutual information

J Huang, Y Cai, X Xu - Pattern recognition letters, 2007 - Elsevier
In this study, a hybrid genetic algorithm is adopted to find a subset of features that are most
relevant to the classification task. Two stages of optimization are involved. The outer …

Textural differences between renal cell carcinoma subtypes: Machine learning-based quantitative computed tomography texture analysis with independent external …

B Kocak, AH Yardimci, CT Bektas… - European Journal of …, 2018 - Elsevier
Objective To develop externally validated, reproducible, and generalizable models for
distinguishing three major subtypes of renal cell carcinomas (RCCs) using machine learning …

The impact of feature selection on defect prediction performance: An empirical comparison

Z Xu, J Liu, Z Yang, G An, X Jia - 2016 IEEE 27th international …, 2016 - ieeexplore.ieee.org
Software defect prediction aims to determine whether a software module is defect-prone by
constructing prediction models. The performance of such models is susceptible to the high …

Feature selection in multimedia: the state-of-the-art review

PY Lee, WP Loh, JF Chin - Image and vision computing, 2017 - Elsevier
Multimedia data mining, particularly feature selection (FS), has been successfully applied in
recent classification and recognition works. However, only a few studies in the contemporary …