Practical solution techniques for first-order MDPs
S Sanner, C Boutilier - Artificial Intelligence, 2009 - Elsevier
Many traditional solution approaches to relationally specified decision-theoretic planning
problems (eg, those stated in the probabilistic planning domain description language, or …
problems (eg, those stated in the probabilistic planning domain description language, or …
A theory of goal-oriented MDPs with dead ends
Stochastic Shortest Path (SSP) MDPs is a problem class widely studied in AI, especially in
probabilistic planning. They describe a wide range of scenarios but make the restrictive …
probabilistic planning. They describe a wide range of scenarios but make the restrictive …
LRTDP versus UCT for online probabilistic planning
UCT, the premier method for solving games such as Go, is also becoming the dominant
algorithm for probabilistic planning. Out of the five solvers at the International Probabilistic …
algorithm for probabilistic planning. Out of the five solvers at the International Probabilistic …
Shaping multi-agent systems with gradient reinforcement learning
O Buffet, A Dutech, F Charpillet - Autonomous Agents and Multi-Agent …, 2007 - Springer
An original reinforcement learning (RL) methodology is proposed for the design of multi-
agent systems. In the realistic setting of situated agents with local perception, the task of …
agent systems. In the realistic setting of situated agents with local perception, the task of …
ReTrASE: Intergating Paradigms for Approximate Probabilistic Planning
Past approaches for solving MDPs have several weaknesses: 1) Decision-theoretic
computation over the state space can yield optimal results but scales poorly. 2) Value …
computation over the state space can yield optimal results but scales poorly. 2) Value …
[PDF][PDF] FF+ FPG: Guiding a Policy-Gradient Planner.
O Buffet, D Aberdeen - ICAPS, 2007 - cdn.aaai.org
Abstract The Factored Policy-Gradient planner (FPG)(Buffet & Aberdeen 2006) was a
successful competitor in the probabilistic track of the 2006 International Planning …
successful competitor in the probabilistic track of the 2006 International Planning …
Combining policy search with planning in multi-agent cooperation
J Ma, S Cameron - RoboCup 2008: Robot Soccer World Cup XII 12, 2009 - Springer
It is cooperation that essentially differentiates multi-agent systems (MASs) from single-agent
intelligence. In realistic MAS applications such as RoboCup, repeated work has shown that …
intelligence. In realistic MAS applications such as RoboCup, repeated work has shown that …
Decision making under uncertainty: a quasimetric approach
We propose a new approach for solving a class of discrete decision making problems under
uncertainty with positive cost. This issue concerns multiple and diverse fields such as …
uncertainty with positive cost. This issue concerns multiple and diverse fields such as …
A proposal for semantic recommender for outdoor audio tour guides
A Koren, N Stash, A Andreev - … Applications (PeMA 2011) at the 5th …, 2011 - research.tue.nl
Location-based services are widely spread both as entertainment and business
applications. The focus of this work is on one particular area–tourist-oriented information …
applications. The focus of this work is on one particular area–tourist-oriented information …