Online least-squares policy iteration for reinforcement learning control

SG Khan, G Herrmann, FL Lewis, T Pipe… - Annual reviews in …, 2012 - Elsevier

This paper provides an overview of the reinforcement learning and optimal adaptive control
literature and its application to robotics. Reinforcement learning is bridging the gap between …

被引用次数：251 相关文章所有 8 个版本

Reinforcement learning with guarantees: a review

P Osinenko, D Dobriborsci, W Aumer - IFAC-PapersOnLine, 2022 - Elsevier

Reinforcement learning is concerned with a generic concept of an agent acting in an
environment. From the control theory standpoint, reinforcement learning may be considered …

被引用次数：20 相关文章

[PDF] busoniu.net

Experience replay for real-time reinforcement learning control

S Adam, L Busoniu, R Babuska - IEEE Transactions on …, 2011 - ieeexplore.ieee.org

Reinforcement-learning (RL) algorithms can automatically learn optimal control strategies
for nonlinear, possibly stochastic systems. A promising approach for RL control is …

被引用次数：312 相关文章所有 13 个版本

[PDF] washington.edu

Approximate policy iteration: A survey and some new methods

DP Bertsekas - Journal of Control Theory and Applications, 2011 - Springer

We consider the classical policy iteration method of dynamic programming (DP), where
approximations and simulation are used to deal with the curse of dimensionality. We survey …

被引用次数：328 相关文章所有 21 个版本

[PDF] arxiv.org

Real-time measurement-driven reinforcement learning control approach for uncertain nonlinear systems

M Abouheaf, D Boase, W Gueaieb, D Spinello… - … Applications of Artificial …, 2023 - Elsevier

The paper introduces an interactive machine learning mechanism to process the
measurements of an uncertain, nonlinear dynamic process and hence advise an actuation …

被引用次数：10 相关文章所有 6 个版本

[PDF] arxiv.org

A data-driven reinforcement learning solution framework for optimal and adaptive personalization of a hip exoskeleton

X Tu, M Li, M Liu, J Si, HH Huang - 2021 IEEE international …, 2021 - ieeexplore.ieee.org

Robotic exoskeletons are exciting technologies for augmenting human mobility. However,
designing such a device for seamless integration with the human user and to assist human …

被引用次数：28 相关文章所有 5 个版本

[PDF] uliege.be

Approximate reinforcement learning: An overview

L Buşoniu, D Ernst, B De Schutter… - 2011 IEEE symposium …, 2011 - ieeexplore.ieee.org

Reinforcement learning (RL) allows agents to learn how to optimally interact with complex
environments. Fueled by recent advances in approximation-based algorithms, RL has …

被引用次数：84 相关文章所有 15 个版本

[PDF] smu.edu.sg

Overcoming the challenges in cost estimation for distributed software projects

N Ramasubbu, RK Balan - 2012 34th international conference …, 2012 - ieeexplore.ieee.org

We describe how we studied, in-situ, the operational processes of three large high process
maturity distributed software development companies and discovered three common …

被引用次数：57 相关文章所有 7 个版本

[PDF] researchgate.net

Pneumatic artificial muscle-driven robot control using local update reinforcement learning

Y Cui, T Matsubara, K Sugimoto - Advanced Robotics, 2017 - Taylor & Francis

In this study, a new value function based Reinforcement learning (RL) algorithm, Local
Update Dynamic Policy Programming (LUDPP), is proposed. It exploits the nature of smooth …

被引用次数：38 相关文章所有 3 个版本

[PDF] mit.edu

Q-learning and enhanced policy iteration in discounted dynamic programming

DP Bertsekas, H Yu - Mathematics of Operations Research, 2012 - pubsonline.informs.org

We consider the classical finite-state discounted Markovian decision problem, and we
introduce a new policy iteration-like algorithm for finding the optimal state costs or Q-factors …

被引用次数：60 相关文章所有 24 个版本