作者
Ronald Edward Parr
发表日期
1998
机构
University of California, Berkeley
简介
This dissertation investigates the use of hierarchy and problem decomposition as a means of solving large, stochastic, sequential decision problems. These problems are framed as Markov decision problems (MDPs). The new technical content of this dissertation begins with a discussion of the concept of temporal abstraction. Temporal abstraction is shown to be equivalent to the transformation of a policy defined over a region of an MDP to an action in a semi-Markov decision problem (SMDP). Several algorithms are presented for performing this transformation efficiently.
引用总数
19981999200020012002200320042005200620072008200920102011201220132014201520162017201820192020202120222023202464127231918272821152217192014171314681615814133