Constrained Markov control processes in Borel spaces: the discounted case

X Guo, O Hernández-Lerma, X Guo… - 2009 - Springer

In Chap. 2, we formally introduce the concepts associated to a continuous time MDP.
Namely, the basic model of continuous-time MDPs and the concept of a Markov policy are …

被引用次数：440 相关文章所有 11 个版本

Constrained-cost adaptive dynamic programming for optimal control of discrete-time nonlinear systems

Q Wei, T Li - IEEE Transactions on Neural Networks and …, 2023 - ieeexplore.ieee.org

For discrete-time nonlinear systems, this research is concerned with optimal control
problems (OCPs) with constrained cost, and a novel value iteration with constrained cost …

被引用次数：12 相关文章所有 3 个版本

A convex analytic approach to risk-aware Markov decision processes

WB Haskell, R Jain - SIAM Journal on Control and Optimization, 2015 - SIAM

In classical Markov decision process (MDP) theory, we search for a policy that, say,
minimizes the expected infinite horizon discounted cost. Expectation is, of course, a risk …

被引用次数：74 相关文章所有 3 个版本

[PDF] psu.edu

Continuous time discounted jump Markov decision processes: a discrete-event approach

EA Feinberg - Mathematics of Operations Research, 2004 - pubsonline.informs.org

This paper introduces and develops a new approach to the theory of continuous time jump
Markov decision processes (CTJMDP). This approach reduces discounted CTJMDPs to …

被引用次数：113 相关文章所有 11 个版本

[图书][B] Finite Approximations in discrete-time stochastic control

N Saldi, T Linder, S Yüksel - 2018 - Springer

Control and optimization of dynamical systems in the presence of stochastic uncertainty is a
mature field with a large range of applications. A comprehensive treatment of such problems …

被引用次数：42 相关文章所有 4 个版本

[PDF] researchgate.net

Constrained average cost Markov control processes in Borel spaces

O Hernández-Lerma, J González-Hernández… - SIAM Journal on Control …, 2003 - SIAM

This paper considers constrained Markov control processes in Borel spaces, with
unbounded costs. The criterion to be minimized is a long-run expected average cost, and …

被引用次数：87 相关文章所有 6 个版本

[PDF] arxiv.org

Maximum Causal Entropy Inverse Reinforcement Learning for Mean-Field Games

B Anahtarci, CD Kariksiz, N Saldi - arXiv preprint arXiv:2401.06566, 2024 - arxiv.org

In this paper, we introduce the maximum casual entropy Inverse Reinforcement Learning
(IRL) problem for discrete-time mean-field games (MFGs) under an infinite-horizon …

Sample-path optimality and variance-minimization of average cost Markov control processes

O Hernández-Lerma, O Vega-Amaya… - SIAM Journal on Control …, 1999 - SIAM

This paper studies several average-cost criteria for Markov control processes on Borel
spaces with possibly unbounded costs. Under suitable hypotheses we show (i) the …

被引用次数：73 相关文章所有 6 个版本

[PDF] arxiv.org

Discounted continuous-time constrained Markov decision processes in Polish spaces

X Guo, X Song - The Annals of Applied Probability, 2011 - JSTOR

This paper is devoted to studying constrained continuous-time Markov decision processes
(MDPs) in the class of randomized policies depending on state histories. The transition rates …

被引用次数：48 相关文章所有 9 个版本

[PDF] psu.edu

Splitting randomized stationary policies in total-reward Markov decision processes

EA Feinberg, UG Rothblum - Mathematics of Operations …, 2012 - pubsonline.informs.org

This paper studies a discrete-time total-reward Markov decision process (MDP) with a given
initial state distribution. A (randomized) stationary policy can be split on a given set of states …

被引用次数：49 相关文章所有 8 个版本