Monte Carlo sampling methods for approximating interactive POMDPs
P Doshi, PJ Gmytrasiewicz - Journal of Artificial Intelligence Research, 2009 - jair.org
Partially observable Markov decision processes (POMDPs) provide a principled framework
for sequential planning in uncertain single agent settings. An extension of POMDPs to
multiagent settings, called interactive POMDPs (I-POMDPs), replaces POMDP belief spaces
with interactive hierarchical belief systems which represent an agents belief about the
physical world, about beliefs of other agents, and about their beliefs about others beliefs.
This modification makes the difficulties of obtaining solutions due to complexity of the belief …
for sequential planning in uncertain single agent settings. An extension of POMDPs to
multiagent settings, called interactive POMDPs (I-POMDPs), replaces POMDP belief spaces
with interactive hierarchical belief systems which represent an agents belief about the
physical world, about beliefs of other agents, and about their beliefs about others beliefs.
This modification makes the difficulties of obtaining solutions due to complexity of the belief …
Monte Carlo Sampling Methods for Approximating Interactive POMDPs
P Doshi, PJ Gmytrasiewicz - arXiv e-prints, 2014 - ui.adsabs.harvard.edu
Partially observable Markov decision processes (POMDPs) provide a principled framework
for sequential planning in uncertain single agent settings. An extension of POMDPs to
multiagent settings, called interactive POMDPs (I-POMDPs), replaces POMDP belief spaces
with interactive hierarchical belief systems which represent an agent's belief about the
physical world, about beliefs of other agents, and about their beliefs about others' beliefs.
This modification makes the difficulties of obtaining solutions due to complexity of the belief …
for sequential planning in uncertain single agent settings. An extension of POMDPs to
multiagent settings, called interactive POMDPs (I-POMDPs), replaces POMDP belief spaces
with interactive hierarchical belief systems which represent an agent's belief about the
physical world, about beliefs of other agents, and about their beliefs about others' beliefs.
This modification makes the difficulties of obtaining solutions due to complexity of the belief …
以上显示的是最相近的搜索结果。 查看全部搜索结果