A primer on partially observable Markov decision processes (POMDPs)

I Chadès, LV Pascal, S Nicol… - Methods in Ecology …, 2021 - Wiley Online Library
Partially observable Markov decision processes (POMDPs) are a convenient mathematical
model to solve sequential decision‐making problems under imperfect observations. Most …

Optimization methods to solve adaptive management problems

I Chadès, S Nicol, TM Rout, M Péron, Y Dujardin… - Theoretical …, 2017 - Springer
Determining the best management actions is challenging when critical information is
missing. However, urgency and limited resources require that decisions must be made …

Convergence of finite memory Q learning for POMDPs and near optimality of learned policies under filter stability

AD Kara, S Yüksel - Mathematics of Operations Research, 2023 - pubsonline.informs.org
In this paper, for partially observed Markov decision problems (POMDPs), we provide the
convergence of a Q learning algorithm for control policies using a finite history of past …

Near optimality of finite memory feedback policies in partially observed markov decision processes

A Kara, S Yuksel - Journal of Machine Learning Research, 2022 - jmlr.org
In the theory of Partially Observed Markov Decision Processes (POMDPs), existence of
optimal policies have in general been established via converting the original partially …

Optimizing honeypot strategies against dynamic lateral movement using partially observable stochastic games

K Horák, B Bošanský, P Tomášek, C Kiekintveld… - Computers & …, 2019 - Elsevier
Partially observable stochastic games (POSGs) are a general game-theoretic model for
capturing dynamic interactions where players have partial information. The existing …

Maintenance planning using continuous-state partially observable Markov decision processes and non-linear action models

R Schöbi, EN Chatzi - Structure and Infrastructure Engineering, 2016 - Taylor & Francis
The signs of deterioration in worldwide infrastructure and the associated socio-economic
and environmental losses call for sustainable resource management and policy-making. To …

Condition-based maintenance for traction power supply equipment based on partially observable Markov decision process

S Lin, R Fan, D Feng, C Yang… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Actual condition-based maintenance for traction power supply equipment (TPSE) is almost
based on completely observable equipment state. However, it is unpractical to accurately …

[图书][B] Finite Approximations in discrete-time stochastic control

N Saldi, T Linder, S Yüksel - 2018 - Springer
Control and optimization of dynamical systems in the presence of stochastic uncertainty is a
mature field with a large range of applications. A comprehensive treatment of such problems …

Combating coordinated pricing cyberattack and energy theft in smart home cyber-physical systems

Y Liu, Y Zhou, S Hu - … on Computer-Aided Design of Integrated …, 2017 - ieeexplore.ieee.org
The information exchange between the utility company and the smart community is crucial to
the smart home cyber-physical systems. Yet the interaction between the two parties is …

A Bayesian state‐space approach for invasive species management: the case of spotted wing drosophila

X Fan, MI Gómez, SS Atallah… - American Journal of …, 2020 - Wiley Online Library
Spotted wing drosophila (SWD) is an invasive pest with devastating effects on soft‐skinned
fruit crops. Due to its high economic impacts, current SWD management strategies usually …