A Review of Safe Reinforcement Learning: Methods, Theories and Applications
Reinforcement Learning (RL) has achieved tremendous success in many complex decision-
making tasks. However, safety concerns are raised during deploying RL in real-world …
making tasks. However, safety concerns are raised during deploying RL in real-world …
Safe learning in robotics: From learning-based control to safe reinforcement learning
The last half decade has seen a steep rise in the number of contributions on safe learning
methods for real-world robotic deployments from both the control and reinforcement learning …
methods for real-world robotic deployments from both the control and reinforcement learning …
[图书][B] Reinforcement learning and optimal control
D Bertsekas - 2019 - books.google.com
This book considers large and challenging multistage decision problems, which can be
solved in principle by dynamic programming (DP), but their exact solution is computationally …
solved in principle by dynamic programming (DP), but their exact solution is computationally …
Unmanned aerial vehicles: Control methods and future challenges
With the rapid development of computer technology, automatic control technology and
communication technology, research on unmanned aerial vehicles (UAVs) has attracted …
communication technology, research on unmanned aerial vehicles (UAVs) has attracted …
Optimal and autonomous control using reinforcement learning: A survey
B Kiumarsi, KG Vamvoudakis… - IEEE transactions on …, 2017 - ieeexplore.ieee.org
This paper reviews the current state of the art on reinforcement learning (RL)-based
feedback control solutions to optimal regulation and tracking of single and multiagent …
feedback control solutions to optimal regulation and tracking of single and multiagent …
Optnet: Differentiable optimization as a layer in neural networks
This paper presents OptNet, a network architecture that integrates optimization problems
(here, specifically in the form of quadratic programs) as individual layers in larger end-to-end …
(here, specifically in the form of quadratic programs) as individual layers in larger end-to-end …
[图书][B] Sliding mode control and observation
Control in the presence of uncertainty is one of the main topics of modern control theory. In
the formulation of any control problem there is always a discrepancy between the actual …
the formulation of any control problem there is always a discrepancy between the actual …
A historical perspective of adaptive control and learning
AM Annaswamy, AL Fradkov - Annual Reviews in Control, 2021 - Elsevier
This article provides a historical perspective of the field of adaptive control over the past
seven decades and its intersection with learning. A chronology of key events over this large …
seven decades and its intersection with learning. A chronology of key events over this large …
Analysis and design of optimization algorithms via integral quadratic constraints
This paper develops a new framework to analyze and design iterative optimization
algorithms built on the notion of integral quadratic constraints (IQCs) from robust control …
algorithms built on the notion of integral quadratic constraints (IQCs) from robust control …
Parameters estimation via dynamic regressor extension and mixing
A new way to design parameter estimators with enhanced performance is proposed in the
paper. The procedure consists of two stages, first, the generation of new regression forms …
paper. The procedure consists of two stages, first, the generation of new regression forms …