A review of distributed statistical inference

Y Gao, W Liu, H Wang, X Wang, Y Yan… - Statistical Theory and …, 2022 - Taylor & Francis
The rapid emergence of massive datasets in various fields poses a serious challenge to
traditional statistical methods. Meanwhile, it provides opportunities for researchers to …

Distributed Computing and Inference for Big Data

L Zhou, Z Gong, P Xiang - Annual Review of Statistics and Its …, 2023 - annualreviews.org
Data are distributed across different sites due to computing facility limitations or data privacy
considerations. Conventional centralized methods—those in which all datasets are stored …

Efficient distributed learning with sparsity

J Wang, M Kolar, N Srebro… - … conference on machine …, 2017 - proceedings.mlr.press
We propose a novel, efficient approach for distributed sparse learning with observations
randomly partitioned across machines. In each round of the proposed method, worker …

Distributed inference for linear support vector machine

X Wang, Z Yang, X Chen, W Liu - Journal of machine learning research, 2019 - jmlr.org
The growing size of modern data brings many new challenges to existing statistical
inference methodologies and theories, and calls for the development of distributed …

Robust distributed modal regression for massive data

K Wang, S Li - Computational Statistics & Data Analysis, 2021 - Elsevier
Modal regression is a good alternative of the mean regression and likelihood based
methods, because of its robustness and high efficiency. A robust communication-efficient …

Learning competing risks across multiple hospitals: one-shot distributed algorithms

D Zhang, J Tong, N Jing, Y Yang, C Luo… - Journal of the …, 2024 - academic.oup.com
Objectives To characterize the complex interplay between multiple clinical conditions in a
time-to-event analysis framework using data from multiple hospitals, we developed two …

First-order newton-type estimator for distributed estimation and inference

X Chen, W Liu, Y Zhang - Journal of the American Statistical …, 2022 - Taylor & Francis
This article studies distributed estimation and inference for a general statistical problem with
a convex loss that could be nondifferentiable. For the purpose of efficient computation, we …

Distributed kernel ridge regression with communications

SB Lin, D Wang, DX Zhou - Journal of Machine Learning Research, 2020 - jmlr.org
This paper focuses on generalization performance analysis for distributed algorithms in the
framework of learning theory. Taking distributed kernel ridge regression (DKRR) for …

Subsampling and jackknifing: a practically convenient solution for large data analysis with limited computational resources

S Wu, X Zhu, H Wang - arXiv preprint arXiv:2304.06231, 2023 - arxiv.org
Modern statistical analysis often encounters datasets with large sizes. For these datasets,
conventional estimation methods can hardly be used immediately because practitioners …

Federated adaptive causal estimation (face) of target treatment effects

L Han, J Hou, K Cho, R Duan, T Cai - arXiv preprint arXiv:2112.09313, 2021 - arxiv.org
Federated learning of causal estimands may greatly improve estimation efficiency by
leveraging data from multiple study sites, but robustness to heterogeneity and model …