A primer on partially observable Markov decision processes (POMDPs)
Partially observable Markov decision processes (POMDPs) are a convenient mathematical
model to solve sequential decision‐making problems under imperfect observations. Most …
model to solve sequential decision‐making problems under imperfect observations. Most …
Optimization methods to solve adaptive management problems
Determining the best management actions is challenging when critical information is
missing. However, urgency and limited resources require that decisions must be made …
missing. However, urgency and limited resources require that decisions must be made …
Convergence of finite memory Q learning for POMDPs and near optimality of learned policies under filter stability
In this paper, for partially observed Markov decision problems (POMDPs), we provide the
convergence of a Q learning algorithm for control policies using a finite history of past …
convergence of a Q learning algorithm for control policies using a finite history of past …
Near optimality of finite memory feedback policies in partially observed markov decision processes
In the theory of Partially Observed Markov Decision Processes (POMDPs), existence of
optimal policies have in general been established via converting the original partially …
optimal policies have in general been established via converting the original partially …
Optimizing honeypot strategies against dynamic lateral movement using partially observable stochastic games
Partially observable stochastic games (POSGs) are a general game-theoretic model for
capturing dynamic interactions where players have partial information. The existing …
capturing dynamic interactions where players have partial information. The existing …
Maintenance planning using continuous-state partially observable Markov decision processes and non-linear action models
The signs of deterioration in worldwide infrastructure and the associated socio-economic
and environmental losses call for sustainable resource management and policy-making. To …
and environmental losses call for sustainable resource management and policy-making. To …
Condition-based maintenance for traction power supply equipment based on partially observable Markov decision process
S Lin, R Fan, D Feng, C Yang… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Actual condition-based maintenance for traction power supply equipment (TPSE) is almost
based on completely observable equipment state. However, it is unpractical to accurately …
based on completely observable equipment state. However, it is unpractical to accurately …
[图书][B] Finite Approximations in discrete-time stochastic control
Control and optimization of dynamical systems in the presence of stochastic uncertainty is a
mature field with a large range of applications. A comprehensive treatment of such problems …
mature field with a large range of applications. A comprehensive treatment of such problems …
Combating coordinated pricing cyberattack and energy theft in smart home cyber-physical systems
The information exchange between the utility company and the smart community is crucial to
the smart home cyber-physical systems. Yet the interaction between the two parties is …
the smart home cyber-physical systems. Yet the interaction between the two parties is …
A Bayesian state‐space approach for invasive species management: the case of spotted wing drosophila
Spotted wing drosophila (SWD) is an invasive pest with devastating effects on soft‐skinned
fruit crops. Due to its high economic impacts, current SWD management strategies usually …
fruit crops. Due to its high economic impacts, current SWD management strategies usually …