Input Generalization in Delayed Reinforcement Learning: An Algorithm and Performance Comparisons.

E Bout, V Loscri, A Gallais - IEEE Communications Surveys & …, 2021 - ieeexplore.ieee.org

The Internet of Things (IoT) has continued gaining in popularity and importance in everyday
life in recent years. However, this development does not only present advantages. Indeed …

被引用次数：106 相关文章所有 5 个版本

[PDF] jair.org

Reinforcement learning: A survey

LP Kaelbling, ML Littman, AW Moore - Journal of artificial intelligence …, 1996 - jair.org

This paper surveys the field of reinforcement learning from a computer-science perspective.
It is written to be accessible to researchers familiar with machine learning. Both the historical …

被引用次数：12417 相关文章所有 77 个版本

[PDF] ieee.org

Q-learning algorithms: A comprehensive classification and applications

B Jang, M Kim, G Harerimana, JW Kim - IEEE access, 2019 - ieeexplore.ieee.org

Q-learning is arguably one of the most applied representative reinforcement learning
approaches and one of the off-policy strategies. Since the emergence of Q-learning, many …

被引用次数：585 相关文章所有 6 个版本

[PDF] sciltp.com

Reinforcement learning for disassembly system optimization problems: A survey

X Guo, Z Bi, J Wang, S Qin, S Liu, L Qi - International Journal of Network …, 2023 - sciltp.com

The disassembly complexity of end-of-life products increases continuously. Traditional
methods are facing difficulties in solving the decision-making and control problems of …

被引用次数：66 相关文章所有 2 个版本

[PDF] springer.com

Q-learning

CJCH Watkins, P Dayan - Machine learning, 1992 - Springer

Abstract Q-learning (Watkins, 1989) is a simple way for agents to learn how to act optimally
in controlled Markovian domains. It amounts to an incremental method for dynamic …

被引用次数：20477 相关文章所有 35 个版本

[PDF] academia.edu

[图书][B] Reinforcement learning: An introduction

RS Sutton, AG Barto - 2018 - books.google.com

The significantly expanded and updated new edition of a widely used text on reinforcement
learning, one of the most active research areas in artificial intelligence. Reinforcement …

被引用次数：77905 相关文章所有 54 个版本

[PDF] springer.com

Self-improving reactive agents based on reinforcement learning, planning and teaching

LJ Lin - Machine learning, 1992 - Springer

To date, reinforcement learning has mostly been studied solving simple learning tasks.
Reinforcement learning methods that have been studied so far typically converge slowly …

被引用次数：2705 相关文章所有 16 个版本

[PDF] dtic.mil

[图书][B] Reinforcement learning for robots using neural networks

LJ Lin - 1992 - search.proquest.com

Reinforcement learning agents are adaptive, reactive, and self-supervised. The aim of this
dissertation is to extend the state of the art of reinforcement learning and enable its …

被引用次数：1450 相关文章所有 5 个版本

[PDF] psu.edu

Improving generalization for temporal difference learning: The successor representation

P Dayan - Neural computation, 1993 - ieeexplore.ieee.org

Estimation of returns over time, the focus of temporal difference (TD) algorithms, imposes
particular constraints on good function approximators or representations. Appropriate …

被引用次数：964 相关文章所有 25 个版本

[PDF] sciencedirect.com

Learning to act using real-time dynamic programming

AG Barto, SJ Bradtke, SP Singh - Artificial intelligence, 1995 - Elsevier

Learning methods based on dynamic programming (DP) are receiving increasing attention
in artificial intelligence. Researchers have argued that DP provides the appropriate basis for …

被引用次数：1691 相关文章所有 27 个版本