Tighter lower bounds for shuffling SGD: Random permutations and beyond

J Cha, J Lee, C Yun - International Conference on Machine …, 2023 - proceedings.mlr.press
We study convergence lower bounds of without-replacement stochastic gradient descent
(SGD) for solving smooth (strongly-) convex finite-sum minimization problems. Unlike most …

Smoothed analysis with adaptive adversaries

N Haghtalab, T Roughgarden, A Shetty - Journal of the ACM, 2024 - dl.acm.org
We prove novel algorithmic guarantees for several online problems in the smoothed
analysis model. In this model, at each time step an adversary chooses an input distribution …

Grab: Finding provably better data permutations than random reshuffling

Y Lu, W Guo, CM De Sa - Advances in Neural Information …, 2022 - proceedings.neurips.cc
Random reshuffling, which randomly permutes the dataset each epoch, is widely adopted in
model training because it yields faster convergence than with-replacement sampling …

Algorithms and barriers in the symmetric binary perceptron model

D Gamarnik, EC Kızıldağ, W Perkins… - 2022 IEEE 63rd Annual …, 2022 - ieeexplore.ieee.org
The binary (or Ising) perceptron is a toy model of a single-layer neural network and can be
viewed as a random constraint satisfaction problem with a high degree of connectivity. The …

Kernel thinning

R Dwivedi, L Mackey - arXiv preprint arXiv:2105.05842, 2021 - arxiv.org
We introduce kernel thinning, a new procedure for compressing a distribution $\mathbb {P} $
more effectively than iid sampling or standard thinning. Given a suitable reproducing kernel …

Sharp threshold sequence and universality for ising perceptron models

S Nakajima, N Sun - Proceedings of the 2023 Annual ACM-SIAM …, 2023 - SIAM
We study a family of Ising perceptron models with {0, 1}-valued activation functions. This
includes the classical half-space models, as well as some of the symmetric models …

Algorithmic pure states for the negative spherical perceptron

A El Alaoui, M Sellke - Journal of Statistical Physics, 2022 - Springer
We consider the spherical perceptron with Gaussian disorder. This is the set S of points σ∈
RN on the sphere of radius N satisfying⟨ ga, σ⟩≥ κ N for all 1≤ a≤ M, where (ga) a= 1 M …

Online discrepancy minimization for stochastic arrivals

N Bansal, H Jiang, R Meka, S Singla, M Sinha - Proceedings of the 2021 ACM …, 2021 - SIAM
In the stochastic online vector balancing problem, vectors v 1, v 2,…, vT chosen
independently from an arbitrary distribution in ℝ n arrive one-by-one and must be …

Linear-sized sparsifiers via near-linear time discrepancy theory

A Jambulapati, V Reis, K Tian - Proceedings of the 2024 Annual ACM-SIAM …, 2024 - SIAM
Discrepancy theory has provided powerful tools for producing higher-quality objects which
“beat the union bound” in fundamental settings throughout combinatorics and computer …

Discrepancy algorithms for the binary perceptron

S Li, T Schramm, K Zhou - arXiv preprint arXiv:2408.00796, 2024 - arxiv.org
The binary perceptron problem asks us to find a sign vector in the intersection of
independently chosen random halfspaces with intercept $-\kappa $. We analyze the …