[图书][B] Moments, positive polynomials and their applications

JB Lasserre - 2009 - books.google.com
Many important applications in global optimization, algebra, probability and statistics,
applied mathematics, control theory, financial mathematics, inverse problems, etc. can be …

[图书][B] Markov decision processes with applications to finance

N Bäuerle, U Rieder - 2011 - books.google.com
The theory of Markov decision processes focuses on controlled Markov chains in discrete
time. The authors establish the theory for general state and action spaces and at the same …

[图书][B] Handbook of Markov decision processes: methods and applications

EA Feinberg, A Shwartz - 2012 - books.google.com
Eugene A. Feinberg Adam Shwartz This volume deals with the theory of Markov Decision
Processes (MDPs) and their applications. Each chapter was written by a leading expert in …

[图书][B] Control techniques for complex networks

S Meyn - 2008 - books.google.com
Power grids, flexible manufacturing, cellular communications: interconnectedness has
consequences. This remarkable book gives the tools and philosophy you need to build …

[图书][B] Continuous-time Markov decision processes

X Guo, O Hernández-Lerma, X Guo… - 2009 - Springer
In Chap. 2, we formally introduce the concepts associated to a continuous time MDP.
Namely, the basic model of continuous-time MDPs and the concept of a Markov policy are …

[HTML][HTML] Batch policy learning in average reward markov decision processes

P Liao, Z Qi, R Wan, P Klasnja, SA Murphy - Annals of statistics, 2022 - ncbi.nlm.nih.gov
We consider the batch (off-line) policy learning problem in the infinite horizon Markov
Decision Process. Motivated by mobile health applications, we focus on learning a policy …

Nonlinear optimal control via occupation measures and LMI-relaxations

JB Lasserre, D Henrion, C Prieur, E Trélat - SIAM journal on control and …, 2008 - SIAM
We consider the class of nonlinear optimal control problems (OCPs) with polynomial data,
ie, the differential equation, state and control constraints, and cost are all described by …

[图书][B] Markov chains and invariant probabilities

O Hernández-Lerma, JB Lasserre - 2012 - books.google.com
This book is about discrete-time, time-homogeneous, Markov chains (Mes) and their ergodic
behavior. To this end, most of the material is in fact about stable Mes, by which we mean …

Pre-IPO operational and financial decisions

V Babich, MJ Sobel - Management Science, 2004 - pubsonline.informs.org
Many owners of growing privately held firms make operational and financial decisions in an
effort to maximize the expected present value of the proceeds from an initial public offering …

Performance Bounds in ‐norm for Approximate Value Iteration

R Munos - SIAM journal on control and optimization, 2007 - SIAM
Approximate value iteration (AVI) is a method for solving large Markov decision problems by
approximating the optimal value function with a sequence of value function representations …