A framework for bilevel optimization that enables stochastic and global variance reduction algorithms

M Dagréou, P Ablin, S Vaiter… - Advances in Neural …, 2022 - proceedings.neurips.cc
Bilevel optimization, the problem of minimizing a value function which involves the arg-
minimum of another function, appears in many areas of machine learning. In a large scale …

Optimal algorithms for stochastic bilevel optimization under relaxed smoothness conditions

X Chen, T Xiao, K Balasubramanian - Journal of Machine Learning …, 2024 - jmlr.org
We consider stochastic bilevel optimization problems involving minimizing an upper-level
($\texttt {UL} $) function that is dependent on the arg-min of a strongly-convex lower-level …

Beyond l1: Faster and better sparse models with skglm

Q Bertrand, Q Klopfenstein… - Advances in …, 2022 - proceedings.neurips.cc
We propose a new fast algorithm to estimate any sparse generalized linear model with
convex or non-convex separable penalties. Our algorithm is able to solve problems with …

ReHLine: regularized composite ReLU-ReHU loss minimization with linear computation and linear convergence

B Dai, Y Qiu - Advances in Neural Information Processing …, 2024 - proceedings.neurips.cc
Empirical risk minimization (ERM) is a crucial framework that offers a general approach to
handling a broad range of machine learning tasks. In this paper, we propose a novel …

Experimenting with normalization layers in federated learning on non-iid scenarios

B Casella, R Esposito, A Sciarappa, C Cavazzoni… - IEEE …, 2024 - ieeexplore.ieee.org
Training Deep Learning (DL) models require large, high-quality datasets, often assembled
with data from different institutions. Federated Learning (FL) has been emerging as a …

Longevity of Artifacts in Leading Parallel and Distributed Systems Conferences: a Review of the State of the Practice in 2023

Q Guilloteau, F Ciorba, M Poquet, D Goepp… - Proceedings of the 2nd …, 2024 - dl.acm.org
Reproducibility is the cornerstone of science. Many scientific communities have been struck
by the reproducibility crisis, and computer science is no exception. Its answer has been to …

Benchmarking the Operation of Quantum Heuristics and Ising Machines: Scoring Parameter Setting Strategies on Optimization Applications

DEB Neira, R Brown, P Sathe, F Wudarski… - arXiv preprint arXiv …, 2024 - arxiv.org
We discuss guidelines for evaluating the performance of parameterized stochastic solvers
for optimization problems, with particular attention to systems that employ novel hardware …

Convolutional Deep Kernel Machines

E Milsom, B Anson, L Aitchison - arXiv preprint arXiv:2309.09814, 2023 - arxiv.org
Deep kernel machines (DKMs) are a recently introduced kernel method with the flexibility of
other deep models including deep NNs and deep Gaussian processes. DKMs work purely …

A lower bound and a near-optimal algorithm for bilevel empirical risk minimization

M Dagréou, T Moreau, S Vaiter… - … Conference on Artificial …, 2024 - proceedings.mlr.press
Bilevel optimization problems, which are problems where two optimization problems are
nested, have more and more applications in machine learning. In many practical cases, the …

Okridge: Scalable optimal k-sparse ridge regression

J Liu, S Rosen, C Zhong… - Advances in neural …, 2024 - proceedings.neurips.cc
We consider an important problem in scientific discovery, namely identifying sparse
governing equations for nonlinear dynamical systems. This involves solving sparse ridge …