Stochastic methods in variational inequalities: Ergodicity, bias and refinements
EV Vlatakis-Gkaragkounis, A Giannou… - International …, 2024 - proceedings.mlr.press
For min-max optimization and variational inequalities problems (VIPs), Stochastic
Extragradient (SEG) and Stochastic Gradient Descent Ascent (SGDA) have emerged as …
Extragradient (SEG) and Stochastic Gradient Descent Ascent (SGDA) have emerged as …
Improved High-Probability Bounds for the Temporal Difference Learning Algorithm via Exponential Stability
In this paper we consider the problem of obtaining sharp bounds for the performance of
temporal difference (TD) methods with linear function approximation for policy evaluation in …
temporal difference (TD) methods with linear function approximation for policy evaluation in …
Finite-sample analysis of the Temporal Difference Learning
In this paper we consider the problem of obtaining sharp bounds for the performance of
temporal difference (TD) methods with linear functional approximation for policy evaluation …
temporal difference (TD) methods with linear functional approximation for policy evaluation …
Effectiveness of Constant Stepsize in Markovian LSA and Statistical Inference
In this paper, we study the effectiveness of using a constant stepsize in statistical inference
via linear stochastic approximation (LSA) algorithms with Markovian data. After establishing …
via linear stochastic approximation (LSA) algorithms with Markovian data. After establishing …
Rates of Convergence in the Central Limit Theorem for Markov Chains, with an Application to TD Learning
R Srikant - arXiv preprint arXiv:2401.15719, 2024 - arxiv.org
We prove a non-asymptotic central limit theorem for vector-valued martingale differences
using Stein's method, and use Poisson's equation to extend the result to functions of Markov …
using Stein's method, and use Poisson's equation to extend the result to functions of Markov …
The curse of memory in stochastic approximation
Theory and application of stochastic approximation (SA) has grown within the control
systems community since the earliest days of adaptive control. This paper takes a new look …
systems community since the earliest days of adaptive control. This paper takes a new look …
Computing the Bias of Constant-step Stochastic Approximation with Markovian Noise
S Allmeier, N Gast - arXiv preprint arXiv:2405.14285, 2024 - arxiv.org
We study stochastic approximation algorithms with Markovian noise and constant step-size
$\alpha $. We develop a method based on infinitesimal generator comparisons to study the …
$\alpha $. We develop a method based on infinitesimal generator comparisons to study the …
Rosenthal-type inequalities for linear statistics of markov chains
In this paper, we establish novel deviation bounds for additive functionals of geometrically
ergodic Markov chains similar to Rosenthal and Bernstein inequalities for sums of …
ergodic Markov chains similar to Rosenthal and Bernstein inequalities for sums of …
Asymptotics of Stochastic Gradient Descent with Dropout Regularization in Linear Models
This paper proposes an asymptotic theory for online inference of the stochastic gradient
descent (SGD) iterates with dropout regularization in linear regression. Specifically, we …
descent (SGD) iterates with dropout regularization in linear regression. Specifically, we …
Gaussian Approximation and Multiplier Bootstrap for Polyak-Ruppert Averaged Linear Stochastic Approximation with Applications to TD Learning
In this paper, we obtain the Berry-Esseen bound for multivariate normal approximation for
the Polyak-Ruppert averaged iterates of the linear stochastic approximation (LSA) algorithm …
the Polyak-Ruppert averaged iterates of the linear stochastic approximation (LSA) algorithm …