Imitation and transfer learning for LQG control

T Guo, AAR Al Makdah, V Krishnan… - IEEE Control Systems …, 2023 - ieeexplore.ieee.org
IEEE Control Systems Letters, 2023ieeexplore.ieee.org
In this letter we study an imitation and transfer learning setting for Linear Quadratic Gaussian
(LQG) control, where (i) the system dynamics, noise statistics and cost function are unknown
and expert data is provided (that is, sequences of optimal inputs and outputs) to learn the
LQG controller, and (ii) multiple control tasks are performed for the same system but with
different LQG costs. We show that the LQG controller can be learned from a set of expert
trajectories of length (l+ 2)− 1, with and the dimension of the system state and output …
In this letter we study an imitation and transfer learning setting for Linear Quadratic Gaussian (LQG) control, where (i) the system dynamics, noise statistics and cost function are unknown and expert data is provided (that is, sequences of optimal inputs and outputs) to learn the LQG controller, and (ii) multiple control tasks are performed for the same system but with different LQG costs. We show that the LQG controller can be learned from a set of expert trajectories of length (l+2)−1, with and the dimension of the system state and output, respectively. Further, the controller can be decomposed as the product of an estimation matrix, which depends only on the system dynamics, and a control matrix, which depends on the LQG cost. This data-based separation principle allows us to transfer the estimation matrix across different LQG tasks, and to reduce the length of the expert trajectories needed to learn the LQG controller to with the dimension of the inputs (for single-input systems with , this yields approximately a 50% reduction of the required expert data).
ieeexplore.ieee.org
以上显示的是最相近的搜索结果。 查看全部搜索结果