State-action control barrier functions: Imposing safety on learning-based control with low online computational costs
Learning-based control with safety guarantees usually requires real-time safety certification
and modifications of possibly unsafe learning-based policies. The control barrier function …
and modifications of possibly unsafe learning-based policies. The control barrier function …
Accelerated point-wise maximum approach to approximate dynamic programming
PN Beuchat, J Warrington… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
In this article, we describe an approximate dynamic programming (ADP) approach to
compute lower bounds on the optimal value function for a discrete time, continuous space …
compute lower bounds on the optimal value function for a discrete time, continuous space …
Point-wise maximum approach to approximate dynamic programming
PN Beuchat, J Warrington… - 2017 IEEE 56th Annual …, 2017 - ieeexplore.ieee.org
In this paper we study value function approximation techniques that are based on the Linear
Programming formulation of Approximate Dynamic Programming. We propose a point-wise …
Programming formulation of Approximate Dynamic Programming. We propose a point-wise …
An Approximate Quadratic Programming for Efficient Bellman Equation Solution
This paper proposes an efficient algorithm which relies on quadratic programming for
approximately solving the Bellman equation in reinforcement learning problem and …
approximately solving the Bellman equation in reinforcement learning problem and …