Reinforcement learning: A survey

LP Kaelbling, ML Littman, AW Moore - Journal of artificial intelligence …, 1996 - jair.org
This paper surveys the field of reinforcement learning from a computer-science perspective.
It is written to be accessible to researchers familiar with machine learning. Both the historical …

[PDF][PDF] Foundations of machine learning

M Mohri - 2018 - dlib.hust.edu.vn
A new edition of a graduate-level machine learning textbook that focuses on the analysis
and theory of algorithms. This book is a general introduction to machine learning that can …

[图书][B] Reinforcement learning: An introduction

RS Sutton, AG Barto - 2018 - books.google.com
The significantly expanded and updated new edition of a widely used text on reinforcement
learning, one of the most active research areas in artificial intelligence. Reinforcement …

[图书][B] On the sample complexity of reinforcement learning

SM Kakade - 2003 - search.proquest.com
This thesis is a detailed investigation into the following question: how much data must an
agent collect in order to perform" reinforcement learning" successfully? This question is …

Average reward reinforcement learning: Foundations, algorithms, and empirical results

S Mahadevan - Machine learning, 1996 - Springer
This paper presents a detailed study of average reward reinforcement learning, an
undiscounted optimality framework that is more appropriate for cyclical tasks than the much …

From implicit skills to explicit knowledge: A bottom‐up model of skill learning

R Sun, E Merrill, T Peterson - Cognitive science, 2001 - Wiley Online Library
This paper presents a skill learning model CLARION. Different from existing models of
mostly high‐level skill learning that use a top‐down approach (that is, turning declarative …

[图书][B] Algorithms for sequential decision-making

ML Littman - 1996 - search.proquest.com
Sequential decision making is a fundamental task faced by any intelligent agent in an
extended interaction with its environment; it is the act of answering the question" What …

[图书][B] Optimal Learning: Computational procedures for Bayes-adaptive Markov decision processes

MOG Duff - 2002 - search.proquest.com
This dissertation considers a particular aspect of sequential decision making under
uncertainty in which, at each stage, a decision-making agent operating in an uncertain world …

[图书][B] Duality of the mind: A bottom-up approach toward cognition

R Sun - 2001 - taylorfrancis.com
This book is a condensation of a large body of work concerning human learning carried out
over a period of more than five years by Dr. Sun and his collaborators. In a nutshell, this …

[图书][B] Handbook of neural computation

E Fiesler, R Beale - 2020 - books.google.com
The Handbook of Neural Computation is a practical, hands-on guide to the design and
implementation of neural networks used by scientists and engineers to tackle difficult and/or …