Dynamic regret of convex and smooth functions
We investigate online convex optimization in non-stationary environments and choose the
dynamic regret as the performance measure, defined as the difference between cumulative …
dynamic regret as the performance measure, defined as the difference between cumulative …
Adapting to online label shift with provable guarantees
The standard supervised learning paradigm works effectively when training data shares the
same distribution as the upcoming testing samples. However, this stationary assumption is …
same distribution as the upcoming testing samples. However, this stationary assumption is …
Optimal dynamic regret in exp-concave online learning
We consider the problem of the Zinkevich (2003)-style dynamic regret minimization in online
learning with\emph {exp-concave} losses. We show that whenever improper learning is …
learning with\emph {exp-concave} losses. We show that whenever improper learning is …
Adaptivity and non-stationarity: Problem-dependent dynamic regret for online convex optimization
We investigate online convex optimization in non-stationary environments and choose
dynamic regret as the performance measure, defined as the difference between cumulative …
dynamic regret as the performance measure, defined as the difference between cumulative …
Optimal dynamic regret in proper online learning with strongly convex losses and beyond
We study the framework of universal dynamic regret minimization with strongly convex
losses. We answer an open problem in Baby and Wang 2021 by showing that in a proper …
losses. We answer an open problem in Baby and Wang 2021 by showing that in a proper …
Non-stationary online learning with memory and non-stochastic control
We study the problem of Online Convex Optimization (OCO) with memory, which allows loss
functions to depend on past decisions and thus captures temporal effects of learning …
functions to depend on past decisions and thus captures temporal effects of learning …
Adapting to continuous covariate shift via online density ratio estimation
Dealing with distribution shifts is one of the central challenges for modern machine learning.
One fundamental situation is the covariate shift, where the input distributions of data change …
One fundamental situation is the covariate shift, where the input distributions of data change …
Dynamic regret of online markov decision processes
Abstract We investigate online Markov Decision Processes (MDPs) with adversarially
changing loss functions and known transitions. We choose dynamic regret as the …
changing loss functions and known transitions. We choose dynamic regret as the …
Unconstrained dynamic regret via sparse coding
Z Zhang, A Cutkosky… - Advances in Neural …, 2024 - proceedings.neurips.cc
Motivated by the challenge of nonstationarity in sequential decision making, we study Online
Convex Optimization (OCO) under the coupling of two problem structures: the domain is …
Convex Optimization (OCO) under the coupling of two problem structures: the domain is …
Improved analysis for dynamic regret of strongly convex and smooth functions
In this paper, we present an improved analysis for dynamic regret of strongly convex and
smooth functions. Specifically, we investigate the Online Multiple Gradient Descent (OMGD) …
smooth functions. Specifically, we investigate the Online Multiple Gradient Descent (OMGD) …