High-quality policies for the canadian traveler's problem

H Geffner, B Bonet - 2013 - books.google.com

Planning is the model-based approach to autonomous behavior where the agent behavior is
derived automatically from a model of the actions, sensors, and goals. The main challenges …

被引用次数：458 相关文章所有 4 个版本

[PDF] aaai.org

PROST: Probabilistic planning based on UCT

T Keller, P Eyerich - Proceedings of the International Conference on …, 2012 - ojs.aaai.org

We present PROST, a probabilistic planning system that is based on the UCT algorithm by
Kocsis and Szepesvari (2006), which has been applied successfully to many areas of …

被引用次数：209 相关文章所有 11 个版本

[PDF] aaai.org

Simulated penetration testing: From" dijkstra" to" turing test++"

J Hoffmann - Proceedings of the international conference on …, 2015 - ojs.aaai.org

Penetration testing (pentesting) is a well established method for identifying security
weaknesses, by conducting friendly attacks. Simulated pentesting automates this process …

被引用次数：145 相关文章所有 8 个版本

[PDF] aaai.org

Trial-based heuristic tree search for finite horizon MDPs

T Keller, M Helmert - Proceedings of the International Conference on …, 2013 - ojs.aaai.org

Dynamic programming is a well-known approach for solving MDPs. In large state spaces,
asynchronous versions like Real-Time Dynamic Programming have been applied …

被引用次数：158 相关文章所有 19 个版本

[PDF] aaai.org

MCTS based on simple regret

D Tolpin, S Shimony - Proceedings of the AAAI Conference on Artificial …, 2012 - ojs.aaai.org

UCT, a state-of-the art algorithm for Monte Carlo tree search (MCTS) in games and Markov
decision processes, is based on UCB, a sampling policy for the Multi-armed Bandit problem …

被引用次数：63 相关文章所有 14 个版本

[PDF] usc.edu

A comparison of Monte Carlo tree search and rolling horizon optimization for large-scale dynamic resource allocation problems

D Bertsimas, JD Griffith, V Gupta… - European Journal of …, 2017 - Elsevier

Dynamic resource allocation (DRA) problems constitute an important class of dynamic
stochastic optimization problems that arise in many real-world applications. DRA problems …

被引用次数：52 相关文章所有 6 个版本

[PDF] aaai.org

Multi-modal journey planning in the presence of uncertainty

A Botea, E Nikolova, M Berlingerio - Proceedings of the International …, 2013 - ojs.aaai.org

Multi-modal journey planning, which allows multiple types of transport within a single trip, is
becoming increasingly popular, due to a strong practical interest and an increasing …

被引用次数：62 相关文章所有 9 个版本

[PDF] jair.org

Simple regret optimization in online planning for Markov decision processes

Z Feldman, C Domshlak - Journal of Artificial Intelligence Research, 2014 - jair.org

We consider online planning in Markov decision processes (MDPs). In online planning, the
agent focuses on its current state only, deliberates about the set of possible policies from …

被引用次数：60 相关文章所有 11 个版本

[PDF] arxiv.org

Monte Carlo tree search with heuristic evaluations using implicit minimax backups

M Lanctot, MHM Winands, T Pepels… - … IEEE Conference on …, 2014 - ieeexplore.ieee.org

Monte Carlo Tree Search (MCTS) has improved the performance of game engines in
domains such as Go, Hex, and general game playing. MCTS has been shown to outperform …

被引用次数：55 相关文章所有 17 个版本

[PDF] arxiv.org

Long-term robot navigation in indoor environments estimating patterns in traversability changes

L Nardi, C Stachniss - 2020 IEEE International Conference on …, 2020 - ieeexplore.ieee.org

Nowadays, mobile robots are deployed in many indoor environments such as offices or
hospitals. These environments are subject to changes in the traversability that often happen …

被引用次数：26 相关文章所有 7 个版本