POMDPs. jl: A framework for sequential decision making under uncertainty

MJ Kochenderfer, TA Wheeler, KH Wray - 2022 - books.google.com

A broad introduction to algorithms for decision making under uncertainty, introducing the
underlying mathematical problem formulations and the algorithms for solving them …

被引用次数：188 相关文章所有 8 个版本

[PDF] aaai.org

Online algorithms for POMDPs with continuous state, action, and observation spaces

Z Sunberg, M Kochenderfer - Proceedings of the International …, 2018 - ojs.aaai.org

Online solvers for partially observable Markov decision processes have been applied to
problems with large discrete state spaces, but continuous state, action, and observation …

被引用次数：271 相关文章所有 7 个版本

[PDF] jmlr.org

Approximate information state for approximate planning and reinforcement learning in partially observed systems

J Subramanian, A Sinha, R Seraj, A Mahajan - Journal of Machine …, 2022 - jmlr.org

We propose a theoretical framework for approximate planning and learning in partially
observed systems. Our framework is based on the fundamental notion of information state …

被引用次数：85 相关文章所有 14 个版本

[PDF] arxiv.org

Dynamic multi-robot task allocation under uncertainty and temporal constraints

S Choudhury, JK Gupta, MJ Kochenderfer, D Sadigh… - Autonomous …, 2022 - Springer

We consider the problem of dynamically allocating tasks to multiple agents under time
window constraints and task completion uncertainty. Our objective is to minimize the number …

被引用次数：99 相关文章所有 14 个版本

[PDF] mlr.press

Information particle filter tree: An online algorithm for pomdps with belief-based rewards on continuous domains

J Fischer, ÖS Tas - International Conference on Machine …, 2020 - proceedings.mlr.press

Abstract Planning in Partially Observable Markov Decision Processes (POMDPs) inherently
gathers the information necessary to act optimally under uncertainties. The framework can …

被引用次数：45 相关文章所有 6 个版本

[PDF] neurips.cc

Reinforcement learning with state observation costs in action-contingent noiselessly observable markov decision processes

HJA Nam, S Fleming… - Advances in Neural …, 2021 - proceedings.neurips.cc

Many real-world problems that require making optimal sequences of decisions under
uncertainty involve costs when the agent wishes to obtain information about its environment …

被引用次数：19 相关文章所有 5 个版本

[PDF] aaai.org

Bayesian optimized monte carlo planning

J Mern, A Yildiz, Z Sunberg, T Mukerji… - Proceedings of the …, 2021 - ojs.aaai.org

Online solvers for partially observable Markov decision processes have difficulty scaling to
problems with large action spaces. Monte Carlo tree search with progressive widening …

被引用次数：33 相关文章所有 5 个版本

[PDF] sagepub.com

Scalable information-theoretic path planning for a rover-helicopter team in uncertain environments

L Folsom, M Ono, K Otsu… - International Journal of …, 2021 - journals.sagepub.com

Mission-critical exploration of uncertain environments requires reliable and robust
mechanisms for achieving information gain. Typical measures of information gain such as …

被引用次数：25 相关文章所有 3 个版本

[PDF] aaai.org

Online planning for constrained POMDPs with continuous spaces through dual ascent

A Jamgochian, A Corso, MJ Kochenderfer - Proceedings of the …, 2023 - ojs.aaai.org

Rather than augmenting rewards with penalties for undesired behavior, Constrained
Partially Observable Markov Decision Processes (CPOMDPs) plan safely by imposing …

被引用次数：10 相关文章所有 5 个版本

[PDF] neurips.cc

Online pomdp planning with anytime deterministic guarantees

M Barenboim, V Indelman - Advances in Neural Information …, 2024 - proceedings.neurips.cc

Autonomous agents operating in real-world scenarios frequently encounter uncertainty and
make decisions based on incomplete information. Planning under uncertainty can be …

被引用次数：4 相关文章所有 8 个版本