Continuous-time reinforcement learning control: A review of theoretical results, insights on performance, and needs for new designs

BA Wallace, J Si - IEEE Transactions on Neural Networks and …, 2023 - ieeexplore.ieee.org
This exposition discusses continuous-time reinforcement learning (CT-RL) for the control of
affine nonlinear systems. We review four seminal methods that are the centerpieces of the …

Kernel-Based Optimal Control: An Infinitesimal Generator Approach

P Bevanda, N Hosichen, T Wittmann… - arXiv preprint arXiv …, 2024 - arxiv.org
This paper presents a novel approach for optimal control of nonlinear stochastic systems
using infinitesimal generator learning within infinite-dimensional reproducing kernel Hilbert …

Reinforcement twinning: From digital twins to model-based reinforcement learning

L Schena, PA Marques, R Poletti, S Ahizi… - Journal of …, 2024 - Elsevier
The concept of digital twins promises to revolutionize engineering by offering new avenues
for optimization, control, and predictive maintenance. We propose a novel framework for …

Generalized Policy Iteration using Tensor Approximation for Hybrid Control

S Shetty, T Xue, S Calinon - The Twelfth International Conference …, 2024 - openreview.net
Control of dynamic systems involving hybrid actions is a challenging task in robotics. To
address this, we present a novel algorithm called Generalized Policy Iteration using Tensor …

Tracking Control for a Spherical Pendulum via Curriculum Reinforcement Learning

P Klink, F Wolf, K Ploeger, J Peters… - arXiv preprint arXiv …, 2023 - arxiv.org
Reinforcement Learning (RL) allows learning non-trivial robot control laws purely from data.
However, many successful applications of RL have relied on ad-hoc regularizations, such as …

Managing temporal resolution in continuous value estimation: A fundamental trade-off

ZV Zhang, J Kirschner, J Zhang… - Advances in …, 2024 - proceedings.neurips.cc
A default assumption in reinforcement learning (RL) and optimal control is that observations
arrive at discrete time points on a fixed clock cycle. Yet, many applications involve …

Continuous-Time Reinforcement Learning: New Design Algorithms With Theoretical Insights and Performance Guarantees

BA Wallace, J Si - IEEE Transactions on Neural Networks and …, 2024 - ieeexplore.ieee.org
Continuous-time reinforcement learning (CT-RL) methods hold great promise in real-world
applications. Adaptive dynamic programming (ADP)-based CT-RL algorithms, especially …

Reinforcement Learning Control of Hypersonic Vehicles and Performance Evaluations

BA Wallace, J Si - Journal of Guidance, Control, and Dynamics, 2024 - arc.aiaa.org
This work presents a new framework for model-based continuous-time reinforcement
learning (CT-RL) control of hypersonic vehicles (HSVs). The predominant classes of CT-RL …

Mitigating the curse of horizon in Monte-Carlo returns

A Ayoub, D Szepesvari, F Zanini, B Chan… - Reinforcement …, 2024 - openreview.net
The standard framework in reinforcement learning (RL) dictates that an agent should use
every observation collected from interactions with the environment when updating its value …

Predictive and Prescriptive Trees for Optimization and Control Problems

CW Kim - 2024 - dspace.mit.edu
This thesis introduces novel methods to expedite the solution of a broad range of
optimization and control problems using machine learning, specifically decision tree …