A review of distributed statistical inference
The rapid emergence of massive datasets in various fields poses a serious challenge to
traditional statistical methods. Meanwhile, it provides opportunities for researchers to …
traditional statistical methods. Meanwhile, it provides opportunities for researchers to …
Distributed Computing and Inference for Big Data
L Zhou, Z Gong, P Xiang - Annual Review of Statistics and Its …, 2023 - annualreviews.org
Data are distributed across different sites due to computing facility limitations or data privacy
considerations. Conventional centralized methods—those in which all datasets are stored …
considerations. Conventional centralized methods—those in which all datasets are stored …
Efficient distributed learning with sparsity
We propose a novel, efficient approach for distributed sparse learning with observations
randomly partitioned across machines. In each round of the proposed method, worker …
randomly partitioned across machines. In each round of the proposed method, worker …
Distributed inference for linear support vector machine
X Wang, Z Yang, X Chen, W Liu - Journal of machine learning research, 2019 - jmlr.org
The growing size of modern data brings many new challenges to existing statistical
inference methodologies and theories, and calls for the development of distributed …
inference methodologies and theories, and calls for the development of distributed …
Robust distributed modal regression for massive data
K Wang, S Li - Computational Statistics & Data Analysis, 2021 - Elsevier
Modal regression is a good alternative of the mean regression and likelihood based
methods, because of its robustness and high efficiency. A robust communication-efficient …
methods, because of its robustness and high efficiency. A robust communication-efficient …
Learning competing risks across multiple hospitals: one-shot distributed algorithms
Objectives To characterize the complex interplay between multiple clinical conditions in a
time-to-event analysis framework using data from multiple hospitals, we developed two …
time-to-event analysis framework using data from multiple hospitals, we developed two …
First-order newton-type estimator for distributed estimation and inference
This article studies distributed estimation and inference for a general statistical problem with
a convex loss that could be nondifferentiable. For the purpose of efficient computation, we …
a convex loss that could be nondifferentiable. For the purpose of efficient computation, we …
Distributed kernel ridge regression with communications
This paper focuses on generalization performance analysis for distributed algorithms in the
framework of learning theory. Taking distributed kernel ridge regression (DKRR) for …
framework of learning theory. Taking distributed kernel ridge regression (DKRR) for …
Subsampling and jackknifing: a practically convenient solution for large data analysis with limited computational resources
Modern statistical analysis often encounters datasets with large sizes. For these datasets,
conventional estimation methods can hardly be used immediately because practitioners …
conventional estimation methods can hardly be used immediately because practitioners …
Federated adaptive causal estimation (face) of target treatment effects
Federated learning of causal estimands may greatly improve estimation efficiency by
leveraging data from multiple study sites, but robustness to heterogeneity and model …
leveraging data from multiple study sites, but robustness to heterogeneity and model …