Approximate dynamic programming via penalty functions

文章

学术资源搜索

获得 4 条结果（用时0.01秒）

我的图书馆

Approximate dynamic programming via penalty functions

在引用文章中搜索

[PDF] arxiv.org

State-action control barrier functions: Imposing safety on learning-based control with low online computational costs

K He, S Shi, T Boom, B De Schutter - arXiv preprint arXiv:2312.11255, 2023 - arxiv.org

Learning-based control with safety guarantees usually requires real-time safety certification
and modifications of possibly unsafe learning-based policies. The control barrier function …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Accelerated point-wise maximum approach to approximate dynamic programming

PN Beuchat, J Warrington… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org

In this article, we describe an approximate dynamic programming (ADP) approach to
compute lower bounds on the optimal value function for a discrete time, continuous space …

被引用次数：6 相关文章所有 4 个版本

Point-wise maximum approach to approximate dynamic programming

PN Beuchat, J Warrington… - 2017 IEEE 56th Annual …, 2017 - ieeexplore.ieee.org

In this paper we study value function approximation techniques that are based on the Linear
Programming formulation of Approximate Dynamic Programming. We propose a point-wise …

被引用次数：7 相关文章所有 4 个版本

[PDF] ieee.org

An Approximate Quadratic Programming for Efficient Bellman Equation Solution

J Su, H Cheng, H Guo, R Huang, Z Peng - IEEE Access, 2019 - ieeexplore.ieee.org

This paper proposes an efficient algorithm which relies on quadratic programming for
approximately solving the Bellman equation in reinforcement learning problem and …

被引用次数：2 相关文章所有 3 个版本