Continuous-time reinforcement learning control: A review of theoretical results, insights on performance, and needs for new designs
BA Wallace, J Si - IEEE Transactions on Neural Networks and …, 2023 - ieeexplore.ieee.org
This exposition discusses continuous-time reinforcement learning (CT-RL) for the control of
affine nonlinear systems. We review four seminal methods that are the centerpieces of the …
affine nonlinear systems. We review four seminal methods that are the centerpieces of the …
Kernel-Based Optimal Control: An Infinitesimal Generator Approach
P Bevanda, N Hosichen, T Wittmann… - arXiv preprint arXiv …, 2024 - arxiv.org
This paper presents a novel approach for optimal control of nonlinear stochastic systems
using infinitesimal generator learning within infinite-dimensional reproducing kernel Hilbert …
using infinitesimal generator learning within infinite-dimensional reproducing kernel Hilbert …
Reinforcement twinning: From digital twins to model-based reinforcement learning
L Schena, PA Marques, R Poletti, S Ahizi… - Journal of …, 2024 - Elsevier
The concept of digital twins promises to revolutionize engineering by offering new avenues
for optimization, control, and predictive maintenance. We propose a novel framework for …
for optimization, control, and predictive maintenance. We propose a novel framework for …
Generalized Policy Iteration using Tensor Approximation for Hybrid Control
Control of dynamic systems involving hybrid actions is a challenging task in robotics. To
address this, we present a novel algorithm called Generalized Policy Iteration using Tensor …
address this, we present a novel algorithm called Generalized Policy Iteration using Tensor …
Tracking Control for a Spherical Pendulum via Curriculum Reinforcement Learning
Reinforcement Learning (RL) allows learning non-trivial robot control laws purely from data.
However, many successful applications of RL have relied on ad-hoc regularizations, such as …
However, many successful applications of RL have relied on ad-hoc regularizations, such as …
Managing temporal resolution in continuous value estimation: A fundamental trade-off
A default assumption in reinforcement learning (RL) and optimal control is that observations
arrive at discrete time points on a fixed clock cycle. Yet, many applications involve …
arrive at discrete time points on a fixed clock cycle. Yet, many applications involve …
Continuous-Time Reinforcement Learning: New Design Algorithms With Theoretical Insights and Performance Guarantees
BA Wallace, J Si - IEEE Transactions on Neural Networks and …, 2024 - ieeexplore.ieee.org
Continuous-time reinforcement learning (CT-RL) methods hold great promise in real-world
applications. Adaptive dynamic programming (ADP)-based CT-RL algorithms, especially …
applications. Adaptive dynamic programming (ADP)-based CT-RL algorithms, especially …
Reinforcement Learning Control of Hypersonic Vehicles and Performance Evaluations
BA Wallace, J Si - Journal of Guidance, Control, and Dynamics, 2024 - arc.aiaa.org
This work presents a new framework for model-based continuous-time reinforcement
learning (CT-RL) control of hypersonic vehicles (HSVs). The predominant classes of CT-RL …
learning (CT-RL) control of hypersonic vehicles (HSVs). The predominant classes of CT-RL …
Mitigating the curse of horizon in Monte-Carlo returns
The standard framework in reinforcement learning (RL) dictates that an agent should use
every observation collected from interactions with the environment when updating its value …
every observation collected from interactions with the environment when updating its value …
Predictive and Prescriptive Trees for Optimization and Control Problems
CW Kim - 2024 - dspace.mit.edu
This thesis introduces novel methods to expedite the solution of a broad range of
optimization and control problems using machine learning, specifically decision tree …
optimization and control problems using machine learning, specifically decision tree …