Solving continuous-state POMDPs via density projection

I Chadès, LV Pascal, S Nicol… - Methods in Ecology …, 2021 - Wiley Online Library

Partially observable Markov decision processes (POMDPs) are a convenient mathematical
model to solve sequential decision‐making problems under imperfect observations. Most …

被引用次数：35 相关文章所有 4 个版本

[PDF] google.com

Optimization methods to solve adaptive management problems

I Chadès, S Nicol, TM Rout, M Péron, Y Dujardin… - Theoretical …, 2017 - Springer

Determining the best management actions is challenging when critical information is
missing. However, urgency and limited resources require that decisions must be made …

被引用次数：68 相关文章所有 12 个版本

[PDF] arxiv.org

Convergence of finite memory Q learning for POMDPs and near optimality of learned policies under filter stability

AD Kara, S Yüksel - Mathematics of Operations Research, 2023 - pubsonline.informs.org

In this paper, for partially observed Markov decision problems (POMDPs), we provide the
convergence of a Q learning algorithm for control policies using a finite history of past …

被引用次数：42 相关文章所有 6 个版本

[PDF] jmlr.org

Near optimality of finite memory feedback policies in partially observed markov decision processes

A Kara, S Yuksel - Journal of Machine Learning Research, 2022 - jmlr.org

In the theory of Partially Observed Markov Decision Processes (POMDPs), existence of
optimal policies have in general been established via converting the original partially …

被引用次数：37 相关文章所有 6 个版本

Optimizing honeypot strategies against dynamic lateral movement using partially observable stochastic games

K Horák, B Bošanský, P Tomášek, C Kiekintveld… - Computers & …, 2019 - Elsevier

Partially observable stochastic games (POSGs) are a general game-theoretic model for
capturing dynamic interactions where players have partial information. The existing …

被引用次数：55 相关文章所有 2 个版本

Maintenance planning using continuous-state partially observable Markov decision processes and non-linear action models

R Schöbi, EN Chatzi - Structure and Infrastructure Engineering, 2016 - Taylor & Francis

The signs of deterioration in worldwide infrastructure and the associated socio-economic
and environmental losses call for sustainable resource management and policy-making. To …

被引用次数：67 相关文章所有 4 个版本

Condition-based maintenance for traction power supply equipment based on partially observable Markov decision process

S Lin, R Fan, D Feng, C Yang… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org

Actual condition-based maintenance for traction power supply equipment (TPSE) is almost
based on completely observable equipment state. However, it is unpractical to accurately …

被引用次数：31 相关文章所有 3 个版本

[图书][B] Finite Approximations in discrete-time stochastic control

N Saldi, T Linder, S Yüksel - 2018 - Springer

Control and optimization of dynamical systems in the presence of stochastic uncertainty is a
mature field with a large range of applications. A comprehensive treatment of such problems …

被引用次数：49 相关文章所有 4 个版本

Combating coordinated pricing cyberattack and energy theft in smart home cyber-physical systems

Y Liu, Y Zhou, S Hu - … on Computer-Aided Design of Integrated …, 2017 - ieeexplore.ieee.org

The information exchange between the utility company and the smart community is crucial to
the smart home cyber-physical systems. Yet the interaction between the two parties is …

被引用次数：46 相关文章所有 3 个版本

A Bayesian state‐space approach for invasive species management: the case of spotted wing drosophila

X Fan, MI Gómez, SS Atallah… - American Journal of …, 2020 - Wiley Online Library

Spotted wing drosophila (SWD) is an invasive pest with devastating effects on soft‐skinned
fruit crops. Due to its high economic impacts, current SWD management strategies usually …

被引用次数：25 相关文章所有 7 个版本