[HTML][HTML] Random forests for genomic data analysis

X Chen, H Ishwaran - Genomics, 2012 - Elsevier
Random forests (RF) is a popular tree-based ensemble machine learning tool that is highly
data adaptive, applies to “large p, small n” problems, and is able to account for correlation as …

A review of ensemble methods in bioinformatics

P Yang, Y Hwa Yang, BB Zhou… - Current …, 2010 - ingentaconnect.com
Ensemble learning is an intensively studied technique in machine learning and pattern
recognition. Recent work in computational biology has seen an increasing use of ensemble …

MRMD2. 0: a python tool for machine learning with feature ranking and reduction

S He, F Guo, Q Zou - Current Bioinformatics, 2020 - ingentaconnect.com
Aims: The study aims to find a way to reduce the dimensionality of the dataset. Background:
Dimensionality reduction is the key issue of the machine learning process. It does not only …

Robustness and applicability of transcription factor and pathway analysis tools on single-cell RNA-seq data

CH Holland, J Tanevski, J Perales-Patón, J Gleixner… - Genome biology, 2020 - Springer
Background Many functional analysis tools have been developed to extract functional and
mechanistic insight from bulk transcriptome data. With the advent of single-cell RNA …

Data mining in the Life Sciences with Random Forest: a walk in the park or lost in the jungle?

WG Touw, JR Bayjanov, L Overmars… - Briefings in …, 2013 - academic.oup.com
Abstract In the Life Sciences 'omics' data is increasingly generated by different high-
throughput technologies. Often only the integration of these data allows uncovering …

Metabolism dysregulation induces a specific lipid signature of nonalcoholic steatohepatitis in patients

F Chiappini, A Coilly, H Kadar, P Gual, A Tran… - Scientific reports, 2017 - nature.com
Nonalcoholic steatohepatitis (NASH) is a condition which can progress to cirrhosis and
hepatocellular carcinoma. Markers for NASH diagnosis are still lacking. We performed a …

Genomic prediction of breeding values using a subset of SNPs identified by three machine learning methods

B Li, N Zhang, YG Wang, AW George, A Reverter… - Frontiers in …, 2018 - frontiersin.org
The analysis of large genomic data is hampered by issues such as a small number of
observations and a large number of predictive variables (commonly known as “large P small …

A comparison of random forest regression and multiple linear regression for prediction in neuroscience

PF Smith, S Ganesh, P Liu - Journal of neuroscience methods, 2013 - Elsevier
Background Regression is a common statistical tool for prediction in neuroscience.
However, linear regression is by far the most common form of regression used, with …

Signalling pathway for RKIP and Let‐7 regulates and predicts metastatic breast cancer

J Yun, CA Frankenberger, WL Kuo, MC Boelens… - The EMBO …, 2011 - embopress.org
Tumour metastasis suppressors are inhibitors of metastasis but their mechanisms of action
are generally not understood. We previously showed that the suppressor Raf kinase …

[HTML][HTML] Eye movements reveal epistemic curiosity in human observers

A Baranes, PY Oudeyer, J Gottlieb - Vision research, 2015 - Elsevier
Saccadic (rapid) eye movements are primary means by which humans and non-human
primates sample visual information. However, while saccadic decisions are intensively …