Reflections on univariate and multivariate analysis of metabolomics data
Metabolomics experiments usually result in a large quantity of data. Univariate and
multivariate analysis techniques are routinely used to extract relevant information from the …
multivariate analysis techniques are routinely used to extract relevant information from the …
Novel ensemble machine learning models in flood susceptibility mapping
The research aims to propose the new ensemble models by combining the machine
learning techniques, such as rotation forest (RF), nearest shrunken centroids (NSC), k …
learning techniques, such as rotation forest (RF), nearest shrunken centroids (NSC), k …
A cancer biologist's primer on machine learning applications in high‐dimensional cytometry
The application of machine learning and artificial intelligence to high‐dimensional cytometry
data sets has increasingly become a staple of bioinformatic data analysis over the past …
data sets has increasingly become a staple of bioinformatic data analysis over the past …
A direct approach to sparse discriminant analysis in ultra-high dimensions
Sparse discriminant methods based on independence rules, such as the nearest shrunken
centroids classifier (Tibshirani et al., 2002) and features annealed independence rules (Fan …
centroids classifier (Tibshirani et al., 2002) and features annealed independence rules (Fan …
A road to classification in high dimensional space: the regularized optimal affine discriminant
For high dimensional classification, it is well known that naively performing the Fisher
discriminant rule leads to poor results due to diverging spectra and accumulation of noise …
discriminant rule leads to poor results due to diverging spectra and accumulation of noise …
Sensitivity and specificity based multiobjective approach for feature selection: Application to cancer diagnosis
The study of the sensitivity and the specificity of a classification test constitute a powerful
kind of analysis since it provides specialists with very detailed information useful for cancer …
kind of analysis since it provides specialists with very detailed information useful for cancer …
A high performance centroid-based classification approach for language identification
Centroid-based classification is a machine learning approach used in the text classification
domain. The main advantage of centroid-based classifiers is their high performance during …
domain. The main advantage of centroid-based classifiers is their high performance during …
[PDF][PDF] CODA: High dimensional copula discriminant analysis
We propose a high dimensional classification method, named the Copula Discriminant
Analysis (CODA). The CODA generalizes the normal-based linear discriminant analysis to …
Analysis (CODA). The CODA generalizes the normal-based linear discriminant analysis to …
Incorporating prior knowledge of gene functional groups into regularized discriminant analysis of microarray data
F Tai, W Pan - Bioinformatics, 2007 - academic.oup.com
Motivation: Discriminant analysis for high-dimensional and low-sample-sized data has
become a hot research topic in bioinformatics, mainly motivated by its importance and …
become a hot research topic in bioinformatics, mainly motivated by its importance and …
Nonnegative principal component analysis for cancer molecular pattern discovery
X Han - IEEE/ACM Transactions on Computational Biology and …, 2009 - ieeexplore.ieee.org
As a well-established feature selection algorithm, principal component analysis (PCA) is
often combined with the state-of-the-art classification algorithms to identify cancer molecular …
often combined with the state-of-the-art classification algorithms to identify cancer molecular …