Reinforcement learning and optimal adaptive control: An overview and implementation examples

SG Khan, G Herrmann, FL Lewis, T Pipe… - Annual reviews in …, 2012 - Elsevier
This paper provides an overview of the reinforcement learning and optimal adaptive control
literature and its application to robotics. Reinforcement learning is bridging the gap between …

Reinforcement learning with guarantees: a review

P Osinenko, D Dobriborsci, W Aumer - IFAC-PapersOnLine, 2022 - Elsevier
Reinforcement learning is concerned with a generic concept of an agent acting in an
environment. From the control theory standpoint, reinforcement learning may be considered …

Experience replay for real-time reinforcement learning control

S Adam, L Busoniu, R Babuska - IEEE Transactions on …, 2011 - ieeexplore.ieee.org
Reinforcement-learning (RL) algorithms can automatically learn optimal control strategies
for nonlinear, possibly stochastic systems. A promising approach for RL control is …

Approximate policy iteration: A survey and some new methods

DP Bertsekas - Journal of Control Theory and Applications, 2011 - Springer
We consider the classical policy iteration method of dynamic programming (DP), where
approximations and simulation are used to deal with the curse of dimensionality. We survey …

Real-time measurement-driven reinforcement learning control approach for uncertain nonlinear systems

M Abouheaf, D Boase, W Gueaieb, D Spinello… - … Applications of Artificial …, 2023 - Elsevier
The paper introduces an interactive machine learning mechanism to process the
measurements of an uncertain, nonlinear dynamic process and hence advise an actuation …

A data-driven reinforcement learning solution framework for optimal and adaptive personalization of a hip exoskeleton

X Tu, M Li, M Liu, J Si, HH Huang - 2021 IEEE international …, 2021 - ieeexplore.ieee.org
Robotic exoskeletons are exciting technologies for augmenting human mobility. However,
designing such a device for seamless integration with the human user and to assist human …

Approximate reinforcement learning: An overview

L Buşoniu, D Ernst, B De Schutter… - 2011 IEEE symposium …, 2011 - ieeexplore.ieee.org
Reinforcement learning (RL) allows agents to learn how to optimally interact with complex
environments. Fueled by recent advances in approximation-based algorithms, RL has …

Overcoming the challenges in cost estimation for distributed software projects

N Ramasubbu, RK Balan - 2012 34th international conference …, 2012 - ieeexplore.ieee.org
We describe how we studied, in-situ, the operational processes of three large high process
maturity distributed software development companies and discovered three common …

Pneumatic artificial muscle-driven robot control using local update reinforcement learning

Y Cui, T Matsubara, K Sugimoto - Advanced Robotics, 2017 - Taylor & Francis
In this study, a new value function based Reinforcement learning (RL) algorithm, Local
Update Dynamic Policy Programming (LUDPP), is proposed. It exploits the nature of smooth …

Q-learning and enhanced policy iteration in discounted dynamic programming

DP Bertsekas, H Yu - Mathematics of Operations Research, 2012 - pubsonline.informs.org
We consider the classical finite-state discounted Markovian decision problem, and we
introduce a new policy iteration-like algorithm for finding the optimal state costs or Q-factors …