Bias and extrapolation in Markovian linear stochastic approximation with constant stepsizes

EV Vlatakis-Gkaragkounis, A Giannou… - International …, 2024 - proceedings.mlr.press

For min-max optimization and variational inequalities problems (VIPs), Stochastic
Extragradient (SEG) and Stochastic Gradient Descent Ascent (SGDA) have emerged as …

被引用次数：3 相关文章所有 3 个版本

[PDF] mlr.press

Improved High-Probability Bounds for the Temporal Difference Learning Algorithm via Exponential Stability

S Samsonov, D Tiapkin, A Naumov… - The Thirty Seventh …, 2024 - proceedings.mlr.press

In this paper we consider the problem of obtaining sharp bounds for the performance of
temporal difference (TD) methods with linear function approximation for policy evaluation in …

被引用次数：2 相关文章

[PDF] arxiv.org

Finite-sample analysis of the Temporal Difference Learning

S Samsonov, D Tiapkin, A Naumov… - arXiv preprint arXiv …, 2023 - arxiv.org

In this paper we consider the problem of obtaining sharp bounds for the performance of
temporal difference (TD) methods with linear functional approximation for policy evaluation …

被引用次数：3 相关文章所有 2 个版本

[PDF] aaai.org

Effectiveness of Constant Stepsize in Markovian LSA and Statistical Inference

DL Huo, Y Chen, Q Xie - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org

In this paper, we study the effectiveness of using a constant stepsize in statistical inference
via linear stochastic approximation (LSA) algorithms with Markovian data. After establishing …

被引用次数：2 相关文章所有 3 个版本

[PDF] arxiv.org

Rates of Convergence in the Central Limit Theorem for Markov Chains, with an Application to TD Learning

R Srikant - arXiv preprint arXiv:2401.15719, 2024 - arxiv.org

We prove a non-asymptotic central limit theorem for vector-valued martingale differences
using Stein's method, and use Poisson's equation to extend the result to functions of Markov …

被引用次数：3 相关文章所有 2 个版本

[PDF] researchgate.net

The curse of memory in stochastic approximation

CK Lauand, S Meyn - 2023 62nd IEEE Conference on Decision …, 2023 - ieeexplore.ieee.org

Theory and application of stochastic approximation (SA) has grown within the control
systems community since the earliest days of adaptive control. This paper takes a new look …

被引用次数：5 相关文章所有 6 个版本

[PDF] arxiv.org

Computing the Bias of Constant-step Stochastic Approximation with Markovian Noise

S Allmeier, N Gast - arXiv preprint arXiv:2405.14285, 2024 - arxiv.org

We study stochastic approximation algorithms with Markovian noise and constant step-size
$\alpha $. We develop a method based on infinitesimal generator comparisons to study the …

被引用次数：3 相关文章所有 2 个版本

[PDF] arxiv.org

Rosenthal-type inequalities for linear statistics of markov chains

A Durmus, E Moulines, A Naumov, S Samsonov… - arXiv preprint arXiv …, 2023 - arxiv.org

In this paper, we establish novel deviation bounds for additive functionals of geometrically
ergodic Markov chains similar to Rosenthal and Bernstein inequalities for sums of …

被引用次数：4 相关文章所有 3 个版本

[PDF] arxiv.org

Asymptotics of Stochastic Gradient Descent with Dropout Regularization in Linear Models

J Li, J Schmidt-Hieber, WB Wu - arXiv preprint arXiv:2409.07434, 2024 - arxiv.org

This paper proposes an asymptotic theory for online inference of the stochastic gradient
descent (SGD) iterates with dropout regularization in linear regression. Specifically, we …

Gaussian Approximation and Multiplier Bootstrap for Polyak-Ruppert Averaged Linear Stochastic Approximation with Applications to TD Learning

S Samsonov, E Moulines, QM Shao, ZS Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org

In this paper, we obtain the Berry-Esseen bound for multivariate normal approximation for
the Polyak-Ruppert averaged iterates of the linear stochastic approximation (LSA) algorithm …