Challenges of big data analysis

J Fan, F Han, H Liu - National science review, 2014 - academic.oup.com
Big Data bring new opportunities to modern society and challenges to data scientists. On the
one hand, Big Data hold great promises for discovering subtle population patterns and …

[HTML][HTML] A selective overview of variable selection in high dimensional feature space

J Fan, J Lv - Statistica Sinica, 2010 - ncbi.nlm.nih.gov
High dimensional statistical problems arise from diverse fields of scientific research and
technological development. Variable selection plays a pivotal role in contemporary statistical …

[图书][B] Statistical foundations of data science

J Fan, R Li, CH Zhang, H Zou - 2020 - taylorfrancis.com
Statistical Foundations of Data Science gives a thorough introduction to commonly used
statistical models, contemporary statistical machine learning techniques and algorithms …

Feature screening via distance correlation learning

R Li, W Zhong, L Zhu - Journal of the American Statistical …, 2012 - Taylor & Francis
This article is concerned with screening features in ultrahigh-dimensional data analysis,
which has become increasingly important in diverse scientific fields. We develop a sure …

Model-free feature screening for ultrahigh-dimensional data

LP Zhu, L Li, R Li, LX Zhu - Journal of the American Statistical …, 2011 - Taylor & Francis
With the recent explosion of scientific data of unprecedented size and complexity, feature
ranking and screening are playing an increasingly important role in many scientific studies …

[HTML][HTML] Distributed testing and estimation under sparse high dimensional models

H Battey, J Fan, H Liu, J Lu, Z Zhu - Annals of statistics, 2018 - ncbi.nlm.nih.gov
This paper studies hypothesis testing and parameter estimation in the context of the divide-
and-conquer algorithm. In a unified likelihood based framework, we propose new test …

Nonparametric independence screening in sparse ultra-high-dimensional additive models

J Fan, Y Feng, R Song - Journal of the American Statistical …, 2011 - Taylor & Francis
A variable screening procedure via correlation learning was proposed by Fan and Lv (2008)
to reduce dimensionality in sparse ultra-high-dimensional models. Even when the true …

Tuning parameter selection in high dimensional penalized likelihood

Y Fan, CY Tang - Journal of the Royal Statistical Society Series …, 2013 - academic.oup.com
Determining how to select the tuning parameter appropriately is essential in penalized
likelihood methods for high dimensional data analysis. We examine this problem in the …

[PDF][PDF] Ultrahigh dimensional feature selection: beyond the linear model

J Fan, R Samworth, Y Wu - The Journal of Machine Learning Research, 2009 - jmlr.org
Variable selection in high-dimensional space characterizes many contemporary problems in
scientific discovery and decision making. Many frequently-used techniques are based on …

[HTML][HTML] Strong oracle optimality of folded concave penalized estimation

J Fan, L Xue, H Zou - Annals of statistics, 2014 - ncbi.nlm.nih.gov
Folded concave penalization methods have been shown to enjoy the strong oracle property
for high-dimensional sparse estimation. However, a folded concave penalization problem …