Tighter lower bounds for shuffling SGD: Random permutations and beyond
We study convergence lower bounds of without-replacement stochastic gradient descent
(SGD) for solving smooth (strongly-) convex finite-sum minimization problems. Unlike most …
(SGD) for solving smooth (strongly-) convex finite-sum minimization problems. Unlike most …
Smoothed analysis with adaptive adversaries
We prove novel algorithmic guarantees for several online problems in the smoothed
analysis model. In this model, at each time step an adversary chooses an input distribution …
analysis model. In this model, at each time step an adversary chooses an input distribution …
Grab: Finding provably better data permutations than random reshuffling
Random reshuffling, which randomly permutes the dataset each epoch, is widely adopted in
model training because it yields faster convergence than with-replacement sampling …
model training because it yields faster convergence than with-replacement sampling …
Algorithms and barriers in the symmetric binary perceptron model
The binary (or Ising) perceptron is a toy model of a single-layer neural network and can be
viewed as a random constraint satisfaction problem with a high degree of connectivity. The …
viewed as a random constraint satisfaction problem with a high degree of connectivity. The …
Sharp threshold sequence and universality for ising perceptron models
S Nakajima, N Sun - Proceedings of the 2023 Annual ACM-SIAM …, 2023 - SIAM
We study a family of Ising perceptron models with {0, 1}-valued activation functions. This
includes the classical half-space models, as well as some of the symmetric models …
includes the classical half-space models, as well as some of the symmetric models …
Algorithmic pure states for the negative spherical perceptron
A El Alaoui, M Sellke - Journal of Statistical Physics, 2022 - Springer
We consider the spherical perceptron with Gaussian disorder. This is the set S of points σ∈
RN on the sphere of radius N satisfying⟨ ga, σ⟩≥ κ N for all 1≤ a≤ M, where (ga) a= 1 M …
RN on the sphere of radius N satisfying⟨ ga, σ⟩≥ κ N for all 1≤ a≤ M, where (ga) a= 1 M …
Online discrepancy minimization for stochastic arrivals
In the stochastic online vector balancing problem, vectors v 1, v 2,…, vT chosen
independently from an arbitrary distribution in ℝ n arrive one-by-one and must be …
independently from an arbitrary distribution in ℝ n arrive one-by-one and must be …
Linear-sized sparsifiers via near-linear time discrepancy theory
Discrepancy theory has provided powerful tools for producing higher-quality objects which
“beat the union bound” in fundamental settings throughout combinatorics and computer …
“beat the union bound” in fundamental settings throughout combinatorics and computer …
Discrepancy algorithms for the binary perceptron
The binary perceptron problem asks us to find a sign vector in the intersection of
independently chosen random halfspaces with intercept $-\kappa $. We analyze the …
independently chosen random halfspaces with intercept $-\kappa $. We analyze the …