Extreme phenotype sampling improves lasso and random forest marker selection for complex traits

C John, W Muchero, S Emrich - 2020 IEEE International …, 2020 - ieeexplore.ieee.org
Most attempts to fit a supervised machine learning (ML) model in bioinformatics try to predict
the full range of trait or response values. While such prediction tasks effectively capture the …

EPS-LASSO: test for high-dimensional regression under extreme phenotype sampling of continuous traits

C Xu, J Fang, H Shen, YP Wang, HW Deng - Bioinformatics, 2018 - academic.oup.com
Motivation Extreme phenotype sampling (EPS) is a broadly-used design to identify
candidate genetic factors contributing to the variation of quantitative traits. By enriching the …

Performance Comparison of LASSO Variants with Genome-Wide Association Studies (GWAS)

N Puthiyedth, N Zhang, Z Wang… - 2021 IEEE International …, 2021 - ieeexplore.ieee.org
GWAS are popular approaches to associate genetic variations on a population of individuals
with particular traits. Despite widely applications, GWAS are heavily depend on the accuracy …

EPS: automated feature selection in case–control studies using extreme pseudo-sampling

R Shemirani, S Wenric, E Kenny, JL Ambite - Bioinformatics, 2021 - academic.oup.com
Finding informative predictive features in high-dimensional biological case–control datasets
is challenging. The Extreme Pseudo-Sampling (EPS) algorithm offers a solution to the …

[HTML][HTML] VSOLassoBag: a variable-selection oriented LASSO bagging algorithm for biomarker discovery in omic-based translational research

J Liang, C Wang, D Zhang, Y Xie, Y Zeng, T Li… - Journal of Genetics and …, 2023 - Elsevier
Screening biomolecular markers from high-dimensional biological data is one of the long-
standing tasks for biomedical translational research. With its advantages in both feature …

A Fitted Sparse-Group Lasso for Genome-Based Evaluations

J Klosa, N Simon, V Liebscher… - IEEE/ACM Transactions …, 2022 - ieeexplore.ieee.org
In life sciences, high-throughput techniques typically lead to high-dimensional data and
often the number of covariates is much larger than the number of observations. This …

A sparse regression method for group-wise feature selection with false discovery rate control

A Gossmann, S Cao, D Brzyski, LJ Zhao… - … ACM transactions on …, 2017 - ieeexplore.ieee.org
The method of Sorted L-One Penalized Estimation, or SLOPE, is a sparse regression
method recently introduced by Bogdan et. al.[1]. It can be used to identify significant …

Gene selection from biological data via group LASSO for logistic regression model: Effects of different clustering algorithms

S Chen, P Wang - 2021 40th Chinese Control Conference …, 2021 - ieeexplore.ieee.org
With the accumulation of various high-throughput biological data, an urgent task is to
develop efficient methods to explore useful bioinformatics from massive data. Various …

Comparison of statistical approaches to rare variant analysis for quantitative traits

H Chen, AE Hendricks, Y Cheng, AL Cupples… - BMC proceedings, 2011 - Springer
With recent advances in technology, deep sequencing data will be widely used to further the
understanding of genetic influence on traits of interest. Therefore not only common variants …

Combining sparse group lasso and linear mixed model improves power to detect genetic variants underlying quantitative traits

Y Guo, C Wu, M Guo, Q Zou, X Liu, A Keinan - Frontiers in genetics, 2019 - frontiersin.org
Genome-Wide association studies (GWAS), based on testing one single nucleotide
polymorphism (SNP) at a time, have revolutionized our understanding of the genetics of …