A finite time analysis of temporal difference learning with linear function approximation

J Bhandari, D Russo, R Singal - Conference on learning …, 2018 - proceedings.mlr.press
… explicit finite time analysis of temporal difference learning with linear function approximation.
… and explicit finite time analysis of temporal difference learning. We draw inspiration from the …

Finite-time analysis of decentralized temporal-difference learning with linear function approximation

J Sun, G Wang, GB Giannakis… - International …, 2020 - proceedings.mlr.press
… of a decentralized linear function approximation variant of the vanilla TD(0) learning, for …
We proved that such decentralized TD(0) algorithms converge linearly to a small neighborhood …

Finite time analysis of temporal difference learning with linear function approximation: Tail averaging and regularisation

G Patil, LA Prashanth, D Nagaraj… - International …, 2023 - proceedings.mlr.press
TD with function approximation used for our analysis. In Section 3, we describe the tail-averaged
TD algorithm, and also present the finite time … in a TD algorithm, and provide finite time

Finite-time performance of distributed temporal-difference learning with linear function approximation

TT Doan, ST Maguluri, J Romberg - SIAM Journal on Mathematics of Data …, 2021 - SIAM
… we study a distributed variant of the temporaldifference learning method for solving the policy
evaluation problem in multi-agent reinforcement learning… , followed by a local TD(λ) update. …

Analysis of temporal-diffference learning with function approximation

J Tsitsiklis, B Van Roy - Advances in neural information …, 1996 - proceedings.neurips.cc
… of a Markov chain using linear function approximators. The algorithm we … P and that at
time t the parameter vector r has been set to some value rt. We define the temporal difference dt …

Adaptive temporal difference learning with linear function approximation

T Sun, H Shen, T Chen, D Li - … Transactions on Pattern Analysis …, 2021 - ieeexplore.ieee.org
… of the TD(0) learning algorithm with linear function approximation that we term AdaTD (0). In
contrast to the TD(0)… Singal, “A finite time analysis of temporal difference learning with linear

[PDF][PDF] Improved temporal difference methods with linear function approximation

DP Bertsekas, VS Borkar, A Nedic - Learning and Approximate Dynamic …, 2004 - mit.edu
Summary: This chapter considers temporal difference algorithms within the context of
infinite-horizon finite… discounted cost and linear cost function approximation. This problem arises …

On the convergence of temporal-difference learning with linear function approximation

V Tadić - Machine learning, 2001 - Springer
… and asymptotic approximation error of temporal-difference learning algorithms … linear function
approximation are analyzed. The analysis is carried out in the context of the approximation

Fast gradient-descent methods for temporal-difference learning with linear function approximation

RS Sutton, HR Maei, D Precup, S Bhatnagar… - … on machine learning, 2009 - dl.acm.org
… We have introduced two new gradient-based temporaldifference learning algorithms … linear
function approximation in a general setting that includes both on-policy and off-policy learning

A Convergent Temporal-difference Algorithm for Off-policy Learning with Linear Function Approximation

RS Sutton, H Maei… - Advances in neural …, 2008 - proceedings.neurips.cc
… be practical to approximate the value of each state individually. Here we consider linear
function approximation, in which … The approximation to the value function is then required to be …