[PDF][PDF] Federated natural policy gradient methods for multi-task reinforcement learning

T Yang, S Cen, Y Wei, Y Chen… - arXiv preprint arXiv …, 2023 - yuxinchen2020.github.io
Federated reinforcement learning (RL) enables collaborative decision making of multiple
distributed agents without sharing local data trajectories. In this work, we consider a multi …

Global Convergence of Natural Policy Gradient with Hessian-Aided Momentum Variance Reduction

J Feng, K Wei, J Chen - Journal of Scientific Computing, 2024 - Springer
Natural policy gradient (NPG) and its variants are widely-used policy search methods in
reinforcement learning. Inspired by prior work, a new NPG variant coined NPG-HM is …

Natural Policy Gradient and Actor Critic Methods for Constrained Multi-Task Reinforcement Learning

S Zeng, TT Doan, J Romberg - arXiv preprint arXiv:2405.02456, 2024 - arxiv.org
Multi-task reinforcement learning (RL) aims to find a single policy that effectively solves
multiple tasks at the same time. This paper presents a constrained formulation for multi-task …

A Zeroth-Order Variance-Reduced Method for Decentralized Stochastic Non-convex Optimization

H Chen, J Chen, K Wei - arXiv preprint arXiv:2310.18883, 2023 - arxiv.org
In this paper, we consider a distributed stochastic non-convex optimization problem, which is
about minimizing a sum of $ n $ local cost functions over a network with only zeroth-order …