Reflections on univariate and multivariate analysis of metabolomics data

E Saccenti, HCJ Hoefsloot, AK Smilde, JA Westerhuis… - Metabolomics, 2014 - Springer
Metabolomics experiments usually result in a large quantity of data. Univariate and
multivariate analysis techniques are routinely used to extract relevant information from the …

Novel ensemble machine learning models in flood susceptibility mapping

P Prasad, VJ Loveson, B Das, M Kotha - Geocarto International, 2022 - Taylor & Francis
The research aims to propose the new ensemble models by combining the machine
learning techniques, such as rotation forest (RF), nearest shrunken centroids (NSC), k …

A cancer biologist's primer on machine learning applications in high‐dimensional cytometry

TJ Keyes, P Domizi, YC Lo, GP Nolan… - Cytometry Part …, 2020 - Wiley Online Library
The application of machine learning and artificial intelligence to high‐dimensional cytometry
data sets has increasingly become a staple of bioinformatic data analysis over the past …

A direct approach to sparse discriminant analysis in ultra-high dimensions

Q Mai, H Zou, M Yuan - Biometrika, 2012 - academic.oup.com
Sparse discriminant methods based on independence rules, such as the nearest shrunken
centroids classifier (Tibshirani et al., 2002) and features annealed independence rules (Fan …

A road to classification in high dimensional space: the regularized optimal affine discriminant

J Fan, Y Feng, X Tong - Journal of the Royal Statistical Society …, 2012 - academic.oup.com
For high dimensional classification, it is well known that naively performing the Fisher
discriminant rule leads to poor results due to diverging spectra and accumulation of noise …

Sensitivity and specificity based multiobjective approach for feature selection: Application to cancer diagnosis

J García-Nieto, E Alba, L Jourdan, E Talbi - Information Processing Letters, 2009 - Elsevier
The study of the sensitivity and the specificity of a classification test constitute a powerful
kind of analysis since it provides specialists with very detailed information useful for cancer …

A high performance centroid-based classification approach for language identification

H Takçı, T Güngör - Pattern Recognition Letters, 2012 - Elsevier
Centroid-based classification is a machine learning approach used in the text classification
domain. The main advantage of centroid-based classifiers is their high performance during …

[PDF][PDF] CODA: High dimensional copula discriminant analysis

F Han, T Zhao, H Liu - Journal of Machine Learning Research, 2013 - jmlr.org
We propose a high dimensional classification method, named the Copula Discriminant
Analysis (CODA). The CODA generalizes the normal-based linear discriminant analysis to …

Incorporating prior knowledge of gene functional groups into regularized discriminant analysis of microarray data

F Tai, W Pan - Bioinformatics, 2007 - academic.oup.com
Motivation: Discriminant analysis for high-dimensional and low-sample-sized data has
become a hot research topic in bioinformatics, mainly motivated by its importance and …

Nonnegative principal component analysis for cancer molecular pattern discovery

X Han - IEEE/ACM Transactions on Computational Biology and …, 2009 - ieeexplore.ieee.org
As a well-established feature selection algorithm, principal component analysis (PCA) is
often combined with the state-of-the-art classification algorithms to identify cancer molecular …