Reinforcement learning for control: Performance, stability, and deep approximators

L Buşoniu, T De Bruin, D Tolić, J Kober… - Annual Reviews in …, 2018 - Elsevier
Reinforcement learning (RL) offers powerful algorithms to search for optimal controllers of
systems with nonlinear, possibly stochastic dynamics that are unknown or highly uncertain …

On the sample complexity of the linear quadratic regulator

S Dean, H Mania, N Matni, B Recht, S Tu - Foundations of Computational …, 2020 - Springer
This paper addresses the optimal control problem known as the linear quadratic regulator in
the case when the dynamics are unknown. We propose a multistage procedure, called …

An approximate neuro-optimal solution of discounted guaranteed cost control design

D Wang, J Qiao, L Cheng - IEEE Transactions on Cybernetics, 2020 - ieeexplore.ieee.org
The adaptive optimal feedback stabilization is investigated in this article for discounted
guaranteed cost control of uncertain nonlinear dynamical systems. Via theoretical analysis …

Optimal tracking control of nonlinear multiagent systems using internal reinforce Q-learning

Z Peng, R Luo, J Hu, K Shi, SK Nguang… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
In this article, a novel reinforcement learning (RL) method is developed to solve the optimal
tracking control problem of unknown nonlinear multiagent systems (MASs). Different from …

Deep reinforcement learning approaches for process control

SPK Spielberg, RB Gopaluni… - 2017 6th international …, 2017 - ieeexplore.ieee.org
In this work, we have extended the current success of deep learning and reinforcement
learning to process control problems. We have shown that if reward hypothesis functions are …

Actor-critic reinforcement learning for control with stability guarantee

M Han, L Zhang, J Wang, W Pan - IEEE Robotics and …, 2020 - ieeexplore.ieee.org
Reinforcement Learning (RL) and its integration with deep learning have achieved
impressive performance in various robotic control tasks, ranging from motion planning and …

Offline and online adaptive critic control designs with stability guarantee through value iteration

M Ha, D Wang, D Liu - IEEE Transactions on Cybernetics, 2021 - ieeexplore.ieee.org
This article is concerned with the stability of the closed-loop system using various control
policies generated by value iteration. Some stability properties involving admissibility …

Output feedback Q-learning for discrete-time linear zero-sum games with application to the H-infinity control

SAA Rizvi, Z Lin - Automatica, 2018 - Elsevier
Approximate dynamic programming techniques usually rely on the feedback of the
measurement of the complete state, which is generally not available in practical situations. In …

Input–output data-based output antisynchronization control of multiagent systems using reinforcement learning approach

Z Peng, Y Zhao, J Hu, R Luo, BK Ghosh… - IEEE Transactions …, 2021 - ieeexplore.ieee.org
This article investigates an output antisynchronization problem of multiagent systems by
using an input-output data-based reinforcement learning approach. Till now, most of the …

Machine learning in event-triggered control: Recent advances and open issues

L Sedghi, Z Ijaz, M Noor-A-Rahim… - IEEE …, 2022 - ieeexplore.ieee.org
Networked control systems have gained considerable attention over the last decade as a
result of the trend towards decentralised control applications and the emergence of cyber …