查看文章

bstu.by 中的 [PDF]

Hierarchical control and learning for Markov decision processes

作者

Ronald Edward Parr

发表日期

1998

机构

University of California, Berkeley

简介

This dissertation investigates the use of hierarchy and problem decomposition as a means of solving large, stochastic, sequential decision problems. These problems are framed as Markov decision problems (MDPs). The new technical content of this dissertation begins with a discussion of the concept of temporal abstraction. Temporal abstraction is shown to be equivalent to the transformation of a policy defined over a region of an MDP to an action in a semi-Markov decision problem (SMDP). Several algorithms are presented for performing this transformation efficiently.

引用总数

被引用次数：402

1998199920002001200220032004200520062007200820092010201120122013201420152016201720182019202020212022202320246 4 12 7 23 19 18 27 28 21 15 22 17 19 20 14 17 13 14 6 8 16 15 8 14 13 3

学术搜索中的文章

Hierarchical control and learning for Markov decision processes

RE Parr - 1998

被引用次数：402 相关文章所有 9 个版本