Monte Carlo tree search: A review of recent modifications and applications

M Świechowski, K Godlewski, B Sawicki… - Artificial Intelligence …, 2023 - Springer
Abstract Monte Carlo Tree Search (MCTS) is a powerful approach to designing game-
playing bots or solving sequential decision problems. The method relies on intelligent tree …

Goal probability analysis in probabilistic planning: Exploring and enhancing the state of the art

M Steinmetz, J Hoffmann, O Buffet - Journal of Artificial Intelligence …, 2016 - jair.org
Unavoidable dead-ends are common in many probabilistic planning problems, eg when
actions may fail or when operating under resource constraints. An important objective in …

[图书][B] Handbuch der künstlichen Intelligenz

G Görz, CR Rollinger, J Schneeberger - 2003 - degruyter.com
Liste der Autoren Page 1 Liste der Autoren Clemens Beckstein Gerhard Brewka Christian
Borgelt Wolfram Burgard Hans-Dieter Burkhard Stephan Busemann Thomas Christaller Leonie …

Trial-based heuristic tree search for finite horizon MDPs

T Keller, M Helmert - Proceedings of the International Conference on …, 2013 - ojs.aaai.org
Dynamic programming is a well-known approach for solving MDPs. In large state spaces,
asynchronous versions like Real-Time Dynamic Programming have been applied …

Symbolic network: generalized neural policies for relational MDPs

S Garg, A Bajpai - International Conference on Machine …, 2020 - proceedings.mlr.press
Abstract A Relational Markov Decision Process (RMDP) is a first-order representation to
express all instances of a single probabilistic planning domain with possibly unbounded …

Relational abstractions for generalized reinforcement learning on symbolic problems

R Karia, S Srivastava - arXiv preprint arXiv:2204.12665, 2022 - arxiv.org
Reinforcement learning in problems with symbolic state spaces is challenging due to the
need for reasoning over long horizons. This paper presents a new approach that utilizes …

Size independent neural transfer for rddl planning

S Garg, A Bajpai - Proceedings of the International Conference on …, 2019 - aaai.org
Neural planners for RDDL MDPs produce deep reactive policies in an offline fashion. These
scale well with large domains, but are sample inefficient and time-consuming to train from …

pyrddlgym: From rddl to gym environments

A Taitler, M Gimelfarb, J Jeong… - arXiv preprint arXiv …, 2022 - arxiv.org
We present pyRDDLGym, a Python framework for auto-generation of OpenAI Gym
environments from RDDL declerative description. The discrete time step evolution of …

Monte carlo tree search with boltzmann exploration

M Painter, M Baioumy, N Hawes… - Advances in Neural …, 2024 - proceedings.neurips.cc
Abstract Monte-Carlo Tree Search (MCTS) methods, such as Upper Confidence Bound
applied to Trees (UCT), are instrumental to automated planning techniques. However, UCT …

Probabilistic planning for robotics with ROSPlan

G Canal, M Cashmore, S Krivić, G Alenyà… - … Robotic Systems: 20th …, 2019 - Springer
Probabilistic planning is very useful for handling uncertainty in planning tasks to be carried
out by robots. ROSPlan is a framework for task planning in the Robot Operating System …