[图书][B] Moments, positive polynomials and their applications
JB Lasserre - 2009 - books.google.com
Many important applications in global optimization, algebra, probability and statistics,
applied mathematics, control theory, financial mathematics, inverse problems, etc. can be …
applied mathematics, control theory, financial mathematics, inverse problems, etc. can be …
[图书][B] Markov decision processes with applications to finance
The theory of Markov decision processes focuses on controlled Markov chains in discrete
time. The authors establish the theory for general state and action spaces and at the same …
time. The authors establish the theory for general state and action spaces and at the same …
[图书][B] Handbook of Markov decision processes: methods and applications
EA Feinberg, A Shwartz - 2012 - books.google.com
Eugene A. Feinberg Adam Shwartz This volume deals with the theory of Markov Decision
Processes (MDPs) and their applications. Each chapter was written by a leading expert in …
Processes (MDPs) and their applications. Each chapter was written by a leading expert in …
[图书][B] Control techniques for complex networks
S Meyn - 2008 - books.google.com
Power grids, flexible manufacturing, cellular communications: interconnectedness has
consequences. This remarkable book gives the tools and philosophy you need to build …
consequences. This remarkable book gives the tools and philosophy you need to build …
[图书][B] Continuous-time Markov decision processes
X Guo, O Hernández-Lerma, X Guo… - 2009 - Springer
In Chap. 2, we formally introduce the concepts associated to a continuous time MDP.
Namely, the basic model of continuous-time MDPs and the concept of a Markov policy are …
Namely, the basic model of continuous-time MDPs and the concept of a Markov policy are …
[HTML][HTML] Batch policy learning in average reward markov decision processes
We consider the batch (off-line) policy learning problem in the infinite horizon Markov
Decision Process. Motivated by mobile health applications, we focus on learning a policy …
Decision Process. Motivated by mobile health applications, we focus on learning a policy …
Nonlinear optimal control via occupation measures and LMI-relaxations
We consider the class of nonlinear optimal control problems (OCPs) with polynomial data,
ie, the differential equation, state and control constraints, and cost are all described by …
ie, the differential equation, state and control constraints, and cost are all described by …
[图书][B] Markov chains and invariant probabilities
O Hernández-Lerma, JB Lasserre - 2012 - books.google.com
This book is about discrete-time, time-homogeneous, Markov chains (Mes) and their ergodic
behavior. To this end, most of the material is in fact about stable Mes, by which we mean …
behavior. To this end, most of the material is in fact about stable Mes, by which we mean …
Pre-IPO operational and financial decisions
Many owners of growing privately held firms make operational and financial decisions in an
effort to maximize the expected present value of the proceeds from an initial public offering …
effort to maximize the expected present value of the proceeds from an initial public offering …
Performance Bounds in ‐norm for Approximate Value Iteration
R Munos - SIAM journal on control and optimization, 2007 - SIAM
Approximate value iteration (AVI) is a method for solving large Markov decision problems by
approximating the optimal value function with a sequence of value function representations …
approximating the optimal value function with a sequence of value function representations …