PROST: Probabilistic planning based on UCT

M Świechowski, K Godlewski, B Sawicki… - Artificial Intelligence …, 2023 - Springer

Abstract Monte Carlo Tree Search (MCTS) is a powerful approach to designing game-
playing bots or solving sequential decision problems. The method relies on intelligent tree …

被引用次数：291 相关文章所有 12 个版本

[PDF] jair.org

Goal probability analysis in probabilistic planning: Exploring and enhancing the state of the art

M Steinmetz, J Hoffmann, O Buffet - Journal of Artificial Intelligence …, 2016 - jair.org

Unavoidable dead-ends are common in many probabilistic planning problems, eg when
actions may fail or when operating under resource constraints. An important objective in …

被引用次数：54 相关文章所有 9 个版本

[PDF] academia.edu

[图书][B] Handbuch der künstlichen Intelligenz

G Görz, CR Rollinger, J Schneeberger - 2003 - degruyter.com

Liste der Autoren Page 1 Liste der Autoren Clemens Beckstein Gerhard Brewka Christian
Borgelt Wolfram Burgard Hans-Dieter Burkhard Stephan Busemann Thomas Christaller Leonie …

被引用次数：404 相关文章所有 9 个版本

[PDF] aaai.org

Trial-based heuristic tree search for finite horizon MDPs

T Keller, M Helmert - Proceedings of the International Conference on …, 2013 - ojs.aaai.org

Dynamic programming is a well-known approach for solving MDPs. In large state spaces,
asynchronous versions like Real-Time Dynamic Programming have been applied …

被引用次数：158 相关文章所有 19 个版本

[PDF] mlr.press

Symbolic network: generalized neural policies for relational MDPs

S Garg, A Bajpai - International Conference on Machine …, 2020 - proceedings.mlr.press

Abstract A Relational Markov Decision Process (RMDP) is a first-order representation to
express all instances of a single probabilistic planning domain with possibly unbounded …

被引用次数：55 相关文章所有 7 个版本

[PDF] arxiv.org

Relational abstractions for generalized reinforcement learning on symbolic problems

R Karia, S Srivastava - arXiv preprint arXiv:2204.12665, 2022 - arxiv.org

Reinforcement learning in problems with symbolic state spaces is challenging due to the
need for reasoning over long horizons. This paper presents a new approach that utilizes …

被引用次数：21 相关文章所有 8 个版本

[PDF] aaai.org

Size independent neural transfer for rddl planning

S Garg, A Bajpai - Proceedings of the International Conference on …, 2019 - aaai.org

Neural planners for RDDL MDPs produce deep reactive policies in an offline fashion. These
scale well with large domains, but are sample inefficient and time-consuming to train from …

被引用次数：50 相关文章所有 4 个版本

[PDF] arxiv.org

pyrddlgym: From rddl to gym environments

A Taitler, M Gimelfarb, J Jeong… - arXiv preprint arXiv …, 2022 - arxiv.org

We present pyRDDLGym, a Python framework for auto-generation of OpenAI Gym
environments from RDDL declerative description. The discrete time step evolution of …

被引用次数：9 相关文章所有 5 个版本

[PDF] neurips.cc

Monte carlo tree search with boltzmann exploration

M Painter, M Baioumy, N Hawes… - Advances in Neural …, 2024 - proceedings.neurips.cc

Abstract Monte-Carlo Tree Search (MCTS) methods, such as Upper Confidence Bound
applied to Trees (UCT), are instrumental to automated planning techniques. However, UCT …

被引用次数：4 相关文章所有 8 个版本

[PDF] csic.es

Probabilistic planning for robotics with ROSPlan

G Canal, M Cashmore, S Krivić, G Alenyà… - … Robotic Systems: 20th …, 2019 - Springer

Probabilistic planning is very useful for handling uncertainty in planning tasks to be carried
out by robots. ROSPlan is a framework for task planning in the Robot Operating System …

被引用次数：42 相关文章所有 5 个版本