Continuous-time fitted value iteration for robust policies

Continuous-time reinforcement learning control: A review of theoretical results, insights on performance, and needs for new designs

BA Wallace, J Si - IEEE Transactions on Neural Networks and …, 2023 - ieeexplore.ieee.org

This exposition discusses continuous-time reinforcement learning (CT-RL) for the control of
affine nonlinear systems. We review four seminal methods that are the centerpieces of the …

被引用次数：9 相关文章所有 4 个版本

[PDF] arxiv.org

Kernel-Based Optimal Control: An Infinitesimal Generator Approach

P Bevanda, N Hosichen, T Wittmann… - arXiv preprint arXiv …, 2024 - arxiv.org

This paper presents a novel approach for optimal control of nonlinear stochastic systems
using infinitesimal generator learning within infinite-dimensional reproducing kernel Hilbert …

被引用次数：2 相关文章所有 3 个版本

[PDF] arxiv.org

Reinforcement twinning: From digital twins to model-based reinforcement learning

L Schena, PA Marques, R Poletti, S Ahizi… - Journal of …, 2024 - Elsevier

The concept of digital twins promises to revolutionize engineering by offering new avenues
for optimization, control, and predictive maintenance. We propose a novel framework for …

被引用次数：3 相关文章所有 3 个版本

[PDF] openreview.net

Generalized Policy Iteration using Tensor Approximation for Hybrid Control

S Shetty, T Xue, S Calinon - The Twelfth International Conference …, 2024 - openreview.net

Control of dynamic systems involving hybrid actions is a challenging task in robotics. To
address this, we present a novel algorithm called Generalized Policy Iteration using Tensor …

被引用次数：6 相关文章所有 3 个版本

[PDF] arxiv.org

Tracking Control for a Spherical Pendulum via Curriculum Reinforcement Learning

P Klink, F Wolf, K Ploeger, J Peters… - arXiv preprint arXiv …, 2023 - arxiv.org

Reinforcement Learning (RL) allows learning non-trivial robot control laws purely from data.
However, many successful applications of RL have relied on ad-hoc regularizations, such as …

被引用次数：1 相关文章所有 4 个版本

[PDF] neurips.cc

Managing temporal resolution in continuous value estimation: A fundamental trade-off

ZV Zhang, J Kirschner, J Zhang… - Advances in …, 2024 - proceedings.neurips.cc

A default assumption in reinforcement learning (RL) and optimal control is that observations
arrive at discrete time points on a fixed clock cycle. Yet, many applications involve …

被引用次数：4 相关文章所有 8 个版本

[PDF] arxiv.org

Continuous-Time Reinforcement Learning: New Design Algorithms With Theoretical Insights and Performance Guarantees

BA Wallace, J Si - IEEE Transactions on Neural Networks and …, 2024 - ieeexplore.ieee.org

Continuous-time reinforcement learning (CT-RL) methods hold great promise in real-world
applications. Adaptive dynamic programming (ADP)-based CT-RL algorithms, especially …

被引用次数：4 相关文章所有 7 个版本

Reinforcement Learning Control of Hypersonic Vehicles and Performance Evaluations

BA Wallace, J Si - Journal of Guidance, Control, and Dynamics, 2024 - arc.aiaa.org

This work presents a new framework for model-based continuous-time reinforcement
learning (CT-RL) control of hypersonic vehicles (HSVs). The predominant classes of CT-RL …

Mitigating the curse of horizon in Monte-Carlo returns

A Ayoub, D Szepesvari, F Zanini, B Chan… - Reinforcement …, 2024 - openreview.net

The standard framework in reinforcement learning (RL) dictates that an agent should use
every observation collected from interactions with the environment when updating its value …

被引用次数：1 相关文章所有 2 个版本

[PDF] mit.edu

Predictive and Prescriptive Trees for Optimization and Control Problems

CW Kim - 2024 - dspace.mit.edu

This thesis introduces novel methods to expedite the solution of a broad range of
optimization and control problems using machine learning, specifically decision tree …