State-action control barrier functions: Imposing safety on learning-based control with low online computational costs

K He, S Shi, T Boom, B De Schutter - arXiv preprint arXiv:2312.11255, 2023 - arxiv.org
Learning-based control with safety guarantees usually requires real-time safety certification
and modifications of possibly unsafe learning-based policies. The control barrier function …

Accelerated point-wise maximum approach to approximate dynamic programming

PN Beuchat, J Warrington… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
In this article, we describe an approximate dynamic programming (ADP) approach to
compute lower bounds on the optimal value function for a discrete time, continuous space …

Point-wise maximum approach to approximate dynamic programming

PN Beuchat, J Warrington… - 2017 IEEE 56th Annual …, 2017 - ieeexplore.ieee.org
In this paper we study value function approximation techniques that are based on the Linear
Programming formulation of Approximate Dynamic Programming. We propose a point-wise …

An Approximate Quadratic Programming for Efficient Bellman Equation Solution

J Su, H Cheng, H Guo, R Huang, Z Peng - IEEE Access, 2019 - ieeexplore.ieee.org
This paper proposes an efficient algorithm which relies on quadratic programming for
approximately solving the Bellman equation in reinforcement learning problem and …