Recent advances in algorithmic high-dimensional robust statistics

I Diakonikolas, DM Kane - arXiv preprint arXiv:1911.05911, 2019 - arxiv.org
Learning in the presence of outliers is a fundamental problem in statistics. Until recently, all
known efficient unsupervised learning algorithms were very sensitive to outliers in high …

Robust aggregation for federated learning

K Pillutla, SM Kakade… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
We present a novel approach to federated learning that endows its aggregation process with
greater robustness to potential poisoning of local data or model parameters of participating …

Dataset security for machine learning: Data poisoning, backdoor attacks, and defenses

M Goldblum, D Tsipras, C Xie, X Chen… - … on Pattern Analysis …, 2022 - ieeexplore.ieee.org
As machine learning systems grow in scale, so do their training data requirements, forcing
practitioners to automate and outsource the curation of training data in order to achieve state …

Adaptive huber regression

Q Sun, WX Zhou, J Fan - Journal of the American Statistical …, 2020 - Taylor & Francis
Big data can easily be contaminated by outliers or contain variables with heavy-tailed
distributions, which makes many conventional methods inadequate. To address this …

Robust estimation via robust gradient estimation

A Prasad, AS Suggala, S Balakrishnan… - Journal of the Royal …, 2020 - academic.oup.com
We provide a new computationally efficient class of estimators for risk minimization. We
show that these estimators are robust for general statistical models, under varied robustness …

Differential privacy and robust statistics in high dimensions

X Liu, W Kong, S Oh - Conference on Learning Theory, 2022 - proceedings.mlr.press
We introduce a universal framework for characterizing the statistical efficiency of a statistical
estimation problem with differential privacy guarantees. Our framework, which we call High …

Robust multivariate mean estimation: the optimality of trimmed mean

G Lugosi, S Mendelson - 2021 - projecteuclid.org
Robust multivariate mean estimation: The optimality of trimmed mean Page 1 The Annals of
Statistics 2021, Vol. 49, No. 1, 393–410 https://doi.org/10.1214/20-AOS1961 © Institute of …

High-dimensional robust mean estimation in nearly-linear time

Y Cheng, I Diakonikolas, R Ge - Proceedings of the thirtieth annual ACM-SIAM …, 2019 - SIAM
We study the fundamental problem of high-dimensional mean estimation in a robust model
where a constant fraction of the samples are adversarially corrupted. Recent work gave the …

Selective inference for k-means clustering

YT Chen, DM Witten - Journal of Machine Learning Research, 2023 - jmlr.org
We consider the problem of testing for a difference in means between clusters of
observations identified via k-means clustering. In this setting, classical hypothesis tests lead …

Robust sub-Gaussian estimation of a mean vector in nearly linear time

J Depersin, G Lecué - The Annals of Statistics, 2022 - projecteuclid.org
We construct an algorithm for estimating the mean of a heavy-tailed random variable when
given an adversarial corrupted sample of N independent observations. The only assumption …