Knightcap: a chess program that learns by combining td (lambda) with game-tree search

Recent advances in deep reinforcement learning applications for solving partially observable markov decision processes (pomdp) problems: Part 1—fundamentals …

X Xiang, S Foo - Machine Learning and Knowledge Extraction, 2021 - mdpi.com

The first part of a two-part series of papers provides a survey on recent advances in Deep
Reinforcement Learning (DRL) applications for solving partially observable Markov decision …

被引用次数：59 相关文章所有 6 个版本

[PDF] hoasen.edu.vn

[图书][B] Neural networks and deep learning

CC Aggarwal - 2018 - Springer

“Any AI smart enough to pass a Turing test is smart enough to know to fail it.”–*** Ian
McDonald Neural networks were developed to simulate the human nervous system for …

被引用次数：3867 相关文章所有 15 个版本

[HTML] sciencedirect.com

[HTML][HTML] The hanabi challenge: A new frontier for ai research

N Bard, JN Foerster, S Chandar, N Burch, M Lanctot… - Artificial Intelligence, 2020 - Elsevier

From the early days of computing, games have been important testbeds for studying how
well machines can do sophisticated decision making. In recent years, machine learning has …

被引用次数：452 相关文章所有 9 个版本

[PDF] nowpublishers.com

Bayesian reinforcement learning: A survey

M Ghavamzadeh, S Mannor, J Pineau… - … and Trends® in …, 2015 - nowpublishers.com

Bayesian methods for machine learning have been widely investigated, yielding principled
methods for incorporating prior information into inference algorithms. In this survey, we …

被引用次数：586 相关文章所有 11 个版本

[PDF] neurips.cc

Incremental natural actor-critic algorithms

S Bhatnagar, M Ghavamzadeh… - Advances in neural …, 2007 - proceedings.neurips.cc

We present four new reinforcement learning algorithms based on actor-critic and natural-
gradient ideas, and provide their convergence proofs. Actor-critic rein-forcement learning …

被引用次数：1113 相关文章所有 33 个版本

[PDF] academia.edu

[PDF][PDF] Intelligent traffic light control

M Wiering, J Van Veenen, J Vreeken… - Institute of Information …, 2004 - academia.edu

Vehicular travel is increasing throughout the world, particularly in large urban areas.
Therefore the need arises for simulating and optimizing traffic control algorithms to better …

被引用次数：301 相关文章所有 10 个版本

[PDF] sciencedirect.com

Programming backgammon using self-teaching neural nets

G Tesauro - Artificial Intelligence, 2002 - Elsevier

TD-Gammon is a neural network that is able to teach itself to play backgammon solely by
playing against itself and learning from the results. Starting from random initial play, TD …

被引用次数：315 相关文章所有 15 个版本

[PDF] mlr.press

Learning to search with mctsnets

A Guez, T Weber, I Antonoglou… - International …, 2018 - proceedings.mlr.press

Planning problems are among the most important and well-studied problems in artificial
intelligence. They are most typically solved by tree search algorithms that simulate ahead …

被引用次数：98 相关文章所有 6 个版本

[PDF] tu-chemnitz.de

Td-gammon: A self-teaching backgammon program

G Tesauro - Applications of neural networks, 1995 - Springer

Furthermore, when a set of hand-crafted features is added to the network's input
representation, the result is a truly staggering level of performance: TO-Gammon is now …

被引用次数：151 相关文章所有 4 个版本

[PDF] 107.167.189.191

[PDF][PDF] Reinforcement learning in board games

I Ghory - Department of Computer Science, University of Bristol …, 2004 - 107.167.189.191

This project investigates the application of the TD (λ) reinforcement learning algorithm and
neural networks to the problem of producing an agent that can play board games. It provides …

被引用次数：137 相关文章所有 3 个版本