Monte Carlo tree search: A review of recent modifications and applications
Abstract Monte Carlo Tree Search (MCTS) is a powerful approach to designing game-
playing bots or solving sequential decision problems. The method relies on intelligent tree …
playing bots or solving sequential decision problems. The method relies on intelligent tree …
Goal probability analysis in probabilistic planning: Exploring and enhancing the state of the art
Unavoidable dead-ends are common in many probabilistic planning problems, eg when
actions may fail or when operating under resource constraints. An important objective in …
actions may fail or when operating under resource constraints. An important objective in …
[图书][B] Handbuch der künstlichen Intelligenz
G Görz, CR Rollinger, J Schneeberger - 2003 - degruyter.com
Liste der Autoren Page 1 Liste der Autoren Clemens Beckstein Gerhard Brewka Christian
Borgelt Wolfram Burgard Hans-Dieter Burkhard Stephan Busemann Thomas Christaller Leonie …
Borgelt Wolfram Burgard Hans-Dieter Burkhard Stephan Busemann Thomas Christaller Leonie …
Trial-based heuristic tree search for finite horizon MDPs
Dynamic programming is a well-known approach for solving MDPs. In large state spaces,
asynchronous versions like Real-Time Dynamic Programming have been applied …
asynchronous versions like Real-Time Dynamic Programming have been applied …
Symbolic network: generalized neural policies for relational MDPs
Abstract A Relational Markov Decision Process (RMDP) is a first-order representation to
express all instances of a single probabilistic planning domain with possibly unbounded …
express all instances of a single probabilistic planning domain with possibly unbounded …
Relational abstractions for generalized reinforcement learning on symbolic problems
R Karia, S Srivastava - arXiv preprint arXiv:2204.12665, 2022 - arxiv.org
Reinforcement learning in problems with symbolic state spaces is challenging due to the
need for reasoning over long horizons. This paper presents a new approach that utilizes …
need for reasoning over long horizons. This paper presents a new approach that utilizes …
Size independent neural transfer for rddl planning
Neural planners for RDDL MDPs produce deep reactive policies in an offline fashion. These
scale well with large domains, but are sample inefficient and time-consuming to train from …
scale well with large domains, but are sample inefficient and time-consuming to train from …
pyrddlgym: From rddl to gym environments
We present pyRDDLGym, a Python framework for auto-generation of OpenAI Gym
environments from RDDL declerative description. The discrete time step evolution of …
environments from RDDL declerative description. The discrete time step evolution of …
Monte carlo tree search with boltzmann exploration
Abstract Monte-Carlo Tree Search (MCTS) methods, such as Upper Confidence Bound
applied to Trees (UCT), are instrumental to automated planning techniques. However, UCT …
applied to Trees (UCT), are instrumental to automated planning techniques. However, UCT …
Probabilistic planning for robotics with ROSPlan
Probabilistic planning is very useful for handling uncertainty in planning tasks to be carried
out by robots. ROSPlan is a framework for task planning in the Robot Operating System …
out by robots. ROSPlan is a framework for task planning in the Robot Operating System …