A framework for bilevel optimization that enables stochastic and global variance reduction algorithms
Bilevel optimization, the problem of minimizing a value function which involves the arg-
minimum of another function, appears in many areas of machine learning. In a large scale …
minimum of another function, appears in many areas of machine learning. In a large scale …
Optimal algorithms for stochastic bilevel optimization under relaxed smoothness conditions
We consider stochastic bilevel optimization problems involving minimizing an upper-level
($\texttt {UL} $) function that is dependent on the arg-min of a strongly-convex lower-level …
($\texttt {UL} $) function that is dependent on the arg-min of a strongly-convex lower-level …
Beyond l1: Faster and better sparse models with skglm
Q Bertrand, Q Klopfenstein… - Advances in …, 2022 - proceedings.neurips.cc
We propose a new fast algorithm to estimate any sparse generalized linear model with
convex or non-convex separable penalties. Our algorithm is able to solve problems with …
convex or non-convex separable penalties. Our algorithm is able to solve problems with …
ReHLine: regularized composite ReLU-ReHU loss minimization with linear computation and linear convergence
B Dai, Y Qiu - Advances in Neural Information Processing …, 2024 - proceedings.neurips.cc
Empirical risk minimization (ERM) is a crucial framework that offers a general approach to
handling a broad range of machine learning tasks. In this paper, we propose a novel …
handling a broad range of machine learning tasks. In this paper, we propose a novel …
Experimenting with normalization layers in federated learning on non-iid scenarios
B Casella, R Esposito, A Sciarappa, C Cavazzoni… - IEEE …, 2024 - ieeexplore.ieee.org
Training Deep Learning (DL) models require large, high-quality datasets, often assembled
with data from different institutions. Federated Learning (FL) has been emerging as a …
with data from different institutions. Federated Learning (FL) has been emerging as a …
Longevity of Artifacts in Leading Parallel and Distributed Systems Conferences: a Review of the State of the Practice in 2023
Q Guilloteau, F Ciorba, M Poquet, D Goepp… - Proceedings of the 2nd …, 2024 - dl.acm.org
Reproducibility is the cornerstone of science. Many scientific communities have been struck
by the reproducibility crisis, and computer science is no exception. Its answer has been to …
by the reproducibility crisis, and computer science is no exception. Its answer has been to …
Benchmarking the Operation of Quantum Heuristics and Ising Machines: Scoring Parameter Setting Strategies on Optimization Applications
We discuss guidelines for evaluating the performance of parameterized stochastic solvers
for optimization problems, with particular attention to systems that employ novel hardware …
for optimization problems, with particular attention to systems that employ novel hardware …
Convolutional Deep Kernel Machines
E Milsom, B Anson, L Aitchison - arXiv preprint arXiv:2309.09814, 2023 - arxiv.org
Deep kernel machines (DKMs) are a recently introduced kernel method with the flexibility of
other deep models including deep NNs and deep Gaussian processes. DKMs work purely …
other deep models including deep NNs and deep Gaussian processes. DKMs work purely …
A lower bound and a near-optimal algorithm for bilevel empirical risk minimization
Bilevel optimization problems, which are problems where two optimization problems are
nested, have more and more applications in machine learning. In many practical cases, the …
nested, have more and more applications in machine learning. In many practical cases, the …
Okridge: Scalable optimal k-sparse ridge regression
We consider an important problem in scientific discovery, namely identifying sparse
governing equations for nonlinear dynamical systems. This involves solving sparse ridge …
governing equations for nonlinear dynamical systems. This involves solving sparse ridge …