Reinforcement learning for control: Performance, stability, and deep approximators

L Buşoniu, T De Bruin, D Tolić, J Kober… - Annual Reviews in …, 2018 - Elsevier
Reinforcement learning (RL) offers powerful algorithms to search for optimal controllers of
systems with nonlinear, possibly stochastic dynamics that are unknown or highly uncertain …

High-speed finite control set model predictive control for power electronics

B Stellato, T Geyer, PJ Goulart - IEEE Transactions on power …, 2016 - ieeexplore.ieee.org
Common approaches for direct model predictive control (MPC) for current reference tracking
in power electronics suffer from the high computational complexity encountered when …

Proximal point imitation learning

L Viano, A Kamoutsi, G Neu… - Advances in Neural …, 2022 - proceedings.neurips.cc
This work develops new algorithms with rigorous efficiency guarantees for infinite horizon
imitation learning (IL) with linear function approximation without restrictive coherence …

Data-driven optimal control of affine systems: A linear programming perspective

A Martinelli, M Gargiani, M Draskovic… - IEEE Control Systems …, 2022 - ieeexplore.ieee.org
In this letter, we discuss the problem of optimal control for affine systems in the context of
data-driven linear programming. First, we introduce a unified framework for the fixed point …

Data-driven optimal control via linear programming: boundedness guarantees

L Falconi, A Martinelli, J Lygeros - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
The linear programming (LP) approach is, together with value iteration and policy iteration,
one of the three fundamental methods to solve optimal control problems in a dynamic …

Data-driven dynamic multiobjective optimal control: An aspiration-satisfying reinforcement learning approach

M Mazouchi, Y Yang, H Modares - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
This article presents an iterative data-driven algorithm for solving dynamic multiobjective
(MO) optimal control problems arising in control of nonlinear continuous-time systems. It is …

[HTML][HTML] Data-driven optimal control with a relaxed linear program

A Martinelli, M Gargiani, J Lygeros - Automatica, 2022 - Elsevier
The linear programming (LP) approach has a long history in the theory of approximate
dynamic programming. When it comes to computation, however, the LP approach often …

Efficient performance bounds for primal-dual reinforcement learning from demonstrations

A Kamoutsi, G Banjac… - … Conference on Machine …, 2021 - proceedings.mlr.press
We consider large-scale Markov decision processes with an unknown cost function and
address the problem of learning a policy from a finite set of expert demonstrations. We …

A data-driven policy iteration scheme based on linear programming

G Banjac, J Lygeros - 2019 IEEE 58th Conference on Decision …, 2019 - ieeexplore.ieee.org
We consider the problem of learning discounted-cost optimal control policies for unknown
deterministic discrete-time systems with continuous state and action spaces. We show that a …

A Q-learning predictive control scheme with guaranteed stability

L Beckenbach, P Osinenko, S Streif - European Journal of Control, 2020 - Elsevier
Abstract Model-based predictive controllers are used to tackle control tasks in which
constraints on state, input or both need to be satisfied. These controllers commonly optimize …