Challenges of big data analysis
Big Data bring new opportunities to modern society and challenges to data scientists. On the
one hand, Big Data hold great promises for discovering subtle population patterns and …
one hand, Big Data hold great promises for discovering subtle population patterns and …
[HTML][HTML] A selective overview of variable selection in high dimensional feature space
High dimensional statistical problems arise from diverse fields of scientific research and
technological development. Variable selection plays a pivotal role in contemporary statistical …
technological development. Variable selection plays a pivotal role in contemporary statistical …
[图书][B] Statistical foundations of data science
Statistical Foundations of Data Science gives a thorough introduction to commonly used
statistical models, contemporary statistical machine learning techniques and algorithms …
statistical models, contemporary statistical machine learning techniques and algorithms …
Feature screening via distance correlation learning
This article is concerned with screening features in ultrahigh-dimensional data analysis,
which has become increasingly important in diverse scientific fields. We develop a sure …
which has become increasingly important in diverse scientific fields. We develop a sure …
Model-free feature screening for ultrahigh-dimensional data
With the recent explosion of scientific data of unprecedented size and complexity, feature
ranking and screening are playing an increasingly important role in many scientific studies …
ranking and screening are playing an increasingly important role in many scientific studies …
[HTML][HTML] Distributed testing and estimation under sparse high dimensional models
This paper studies hypothesis testing and parameter estimation in the context of the divide-
and-conquer algorithm. In a unified likelihood based framework, we propose new test …
and-conquer algorithm. In a unified likelihood based framework, we propose new test …
Nonparametric independence screening in sparse ultra-high-dimensional additive models
A variable screening procedure via correlation learning was proposed by Fan and Lv (2008)
to reduce dimensionality in sparse ultra-high-dimensional models. Even when the true …
to reduce dimensionality in sparse ultra-high-dimensional models. Even when the true …
Tuning parameter selection in high dimensional penalized likelihood
Determining how to select the tuning parameter appropriately is essential in penalized
likelihood methods for high dimensional data analysis. We examine this problem in the …
likelihood methods for high dimensional data analysis. We examine this problem in the …
[PDF][PDF] Ultrahigh dimensional feature selection: beyond the linear model
J Fan, R Samworth, Y Wu - The Journal of Machine Learning Research, 2009 - jmlr.org
Variable selection in high-dimensional space characterizes many contemporary problems in
scientific discovery and decision making. Many frequently-used techniques are based on …
scientific discovery and decision making. Many frequently-used techniques are based on …
[HTML][HTML] Strong oracle optimality of folded concave penalized estimation
Folded concave penalization methods have been shown to enjoy the strong oracle property
for high-dimensional sparse estimation. However, a folded concave penalization problem …
for high-dimensional sparse estimation. However, a folded concave penalization problem …