Further topics on discrete-time Markov control processes

JB Lasserre - 2009 - books.google.com

Many important applications in global optimization, algebra, probability and statistics,
applied mathematics, control theory, financial mathematics, inverse problems, etc. can be …

被引用次数：1417 相关文章所有 11 个版本

[PDF] semanticscholar.org

[图书][B] Markov decision processes with applications to finance

N Bäuerle, U Rieder - 2011 - books.google.com

The theory of Markov decision processes focuses on controlled Markov chains in discrete
time. The authors establish the theory for general state and action spaces and at the same …

被引用次数：542 相关文章所有 11 个版本

[图书][B] Handbook of Markov decision processes: methods and applications

EA Feinberg, A Shwartz - 2012 - books.google.com

Eugene A. Feinberg Adam Shwartz This volume deals with the theory of Markov Decision
Processes (MDPs) and their applications. Each chapter was written by a leading expert in …

被引用次数：649 相关文章所有 4 个版本

[PDF] researchgate.net

[图书][B] Control techniques for complex networks

S Meyn - 2008 - books.google.com

Power grids, flexible manufacturing, cellular communications: interconnectedness has
consequences. This remarkable book gives the tools and philosophy you need to build …

被引用次数：682 相关文章所有 13 个版本

[图书][B] Continuous-time Markov decision processes

X Guo, O Hernández-Lerma, X Guo… - 2009 - Springer

In Chap. 2, we formally introduce the concepts associated to a continuous time MDP.
Namely, the basic model of continuous-time MDPs and the concept of a Markov policy are …

被引用次数：445 相关文章所有 11 个版本

[HTML] nih.gov

[HTML][HTML] Batch policy learning in average reward markov decision processes

P Liao, Z Qi, R Wan, P Klasnja, SA Murphy - Annals of statistics, 2022 - ncbi.nlm.nih.gov

We consider the batch (off-line) policy learning problem in the infinite horizon Markov
Decision Process. Motivated by mobile health applications, we focus on learning a policy …

被引用次数：82 相关文章所有 9 个版本

[PDF] arxiv.org

Nonlinear optimal control via occupation measures and LMI-relaxations

JB Lasserre, D Henrion, C Prieur, E Trélat - SIAM journal on control and …, 2008 - SIAM

We consider the class of nonlinear optimal control problems (OCPs) with polynomial data,
ie, the differential equation, state and control constraints, and cost are all described by …

被引用次数：323 相关文章所有 26 个版本

[图书][B] Markov chains and invariant probabilities

O Hernández-Lerma, JB Lasserre - 2012 - books.google.com

This book is about discrete-time, time-homogeneous, Markov chains (Mes) and their ergodic
behavior. To this end, most of the material is in fact about stable Mes, by which we mean …

被引用次数：290 相关文章所有 8 个版本

[PDF] researchgate.net

Pre-IPO operational and financial decisions

V Babich, MJ Sobel - Management Science, 2004 - pubsonline.informs.org

Many owners of growing privately held firms make operational and financial decisions in an
effort to maximize the expected present value of the proceeds from an initial public offering …

被引用次数：298 相关文章所有 9 个版本

[PDF] hal.science

Performance Bounds in ‐norm for Approximate Value Iteration

R Munos - SIAM journal on control and optimization, 2007 - SIAM

Approximate value iteration (AVI) is a method for solving large Markov decision problems by
approximating the optimal value function with a sequence of value function representations …

被引用次数：210 相关文章所有 11 个版本