A review of the stability of feature selection techniques for bioinformatics data

W Awada, TM Khoshgoftaar, D Dittman… - 2012 IEEE 13th …, 2012 - ieeexplore.ieee.org
Feature selection is an important step in data mining and is used in various domains
including genetics, medicine, and bioinformatics. Choosing the important features (genes) is …

Exploiting the ensemble paradigm for stable feature selection: a case study on high-dimensional genomic data

B Pes, N Dessì, M Angioni - Information fusion, 2017 - Elsevier
Ensemble classification is a well-established approach that involves fusing the decisions of
multiple predictive models. A similar “ensemble logic” has been recently applied to …

An extensive comparison of feature ranking aggregation techniques in bioinformatics

R Wald, TM Khoshgoftaar, D Dittman… - 2012 IEEE 13th …, 2012 - ieeexplore.ieee.org
Univariate feature rankers have been frequently used to order genes (features) in terms of
their importance to a given bioinformatics challenge. Unfortunately, the resulting feature …

First order statistics based feature selection: A diverse and powerful family of feature seleciton techniques

T Khoshgoftaar, D Dittman, R Wald… - … on Machine Learning …, 2012 - ieeexplore.ieee.org
Dimensionality reduction techniques have become a required step when working with
bioinformatics datasets. Techniques such as feature selection have been known to not only …

Impact of noise and data sampling on stability of feature ranking techniques for biological datasets

AA Shanab, TM Khoshgoftaar, R Wald… - 2012 IEEE 13th …, 2012 - ieeexplore.ieee.org
Feature selection is an important preprocessing step when learning from bioinformatics
datasets. Since these datasets often have high dimensionality (a large number of features) …

A Multicriteria Approach to Find Predictive and Sparse Models with Stable Feature Selection for High‐Dimensional Data

A Bommert, J Rahnenführer… - … mathematical methods in …, 2017 - Wiley Online Library
Finding a good predictive model for a high‐dimensional data set can be challenging. For
genetic data, it is not only important to find a model with high predictive accuracy, but it is …

Mean aggregation versus robust rank aggregation for ensemble gene selection

R Wald, TM Khoshgoftaar… - 2012 11th international …, 2012 - ieeexplore.ieee.org
Feature (gene) selection is an important preprocessing step for performing data mining on
large-scale bioinformatics datasets. However, one known concern is that feature selection …

Tracking vigilance fluctuations in real-time: a sliding-window heart rate variability-based machine-learning approach

T Xie, N Ma - Sleep, 2024 - academic.oup.com
Abstract Study Objectives Heart rate variability (HRV)-based machine learning models hold
promise for real-world vigilance evaluation, yet their real-time applicability is limited by …

Selecting the appropriate data sampling approach for imbalanced and high-dimensional bioinformatics datasets

DJ Dittman, TM Khoshgoftaar… - 2014 IEEE International …, 2014 - ieeexplore.ieee.org
One of the more prevalent problems when working with bioinformatics datasets is class
imbalance, when there are more instances in one class compared to the other class (es) …

Comparing two new gene selection ensemble approaches with the commonly-used approach

DJ Dittman, TM Khoshgoftaar, R Wald… - … on Machine Learning …, 2012 - ieeexplore.ieee.org
Ensemble feature selection has recently become a topic of interest for researchers,
especially in the area of bioinformatics. The benefits of ensemble feature selection include …