[PDF][PDF] Scaling Learning based Policy Optimization for Temporal Logic Tasks by Controller Network Dropout
This paper introduces a model-based approach for training feedback controllers for an
autonomous agent operating in a highly nonlinear (albeit deterministic) environment. We …
autonomous agent operating in a highly nonlinear (albeit deterministic) environment. We …
Concurrent learning of control policy and unknown safety specifications in reinforcement learning
L Yifru, A Baheri - IEEE Open Journal of Control Systems, 2024 - ieeexplore.ieee.org
Reinforcement learning (RL) has revolutionized decision-making across a wide range of
domains over the past few decades. Yet, deploying RL policies in real-world scenarios …
domains over the past few decades. Yet, deploying RL policies in real-world scenarios …
Exploration in reward machines with low regret
H Bourel, A Jonsson, OA Maillard… - International …, 2023 - proceedings.mlr.press
We study reinforcement learning (RL) for decision processes with non-Markovian reward, in
which high-level knowledge in the form of reward machines is available to the learner …
which high-level knowledge in the form of reward machines is available to the learner …
Scaling Learning-based Policy Optimization for Temporal Logic Tasks by Controller Network Dropout
This article introduces a model-based approach for training feedback controllers for an
autonomous agent operating in a highly non-linear (albeit deterministic) environment. We …
autonomous agent operating in a highly non-linear (albeit deterministic) environment. We …
Repairing learning-enabled controllers while preserving what works
Learning-enabled controllers have been adopted in various cyber-physical systems (CPS).
When a learning-enabled controller fails to accomplish its task from a set of initial states …
When a learning-enabled controller fails to accomplish its task from a set of initial states …
Concurrent Learning of Policy and Unknown Safety Constraints in Reinforcement Learning
L Yifru, A Baheri - arXiv preprint arXiv:2402.15893, 2024 - arxiv.org
Reinforcement learning (RL) has revolutionized decision-making across a wide range of
domains over the past few decades. Yet, deploying RL policies in real-world scenarios …
domains over the past few decades. Yet, deploying RL policies in real-world scenarios …
Space Processor Computation Time Analysis for Reinforcement Learning and Run Time Assurance Control Policies
K Dunlap, N Hamilton, F Viramontes… - arXiv preprint arXiv …, 2024 - arxiv.org
As the number of spacecraft on orbit continues to grow, it is challenging for human operators
to constantly monitor and plan for all missions. Autonomous control methods such as …
to constantly monitor and plan for all missions. Autonomous control methods such as …
[PDF][PDF] RMLGym: a Formal Reward Machine Framework for Reinforcement Learning.
Reinforcement learning (RL) is a powerful technique for learning optimal policies from trial
and error. However, designing a reward function that captures the desired behavior of an …
and error. However, designing a reward function that captures the desired behavior of an …
Online control synthesis for uncertain systems under signal temporal logic specifications
Signal temporal logic (STL) formulas have been widely used as a formal language to
express complex robotic specifications, thanks to their rich expressiveness and explicit time …
express complex robotic specifications, thanks to their rich expressiveness and explicit time …
Space Processor Computation Time Analysis for Reinforcement Learning and Run Time Assurance Control Policies
NP Hamilton, K Dunlap, F Viramontes… - AIAA SCITECH 2025 …, 2025 - arc.aiaa.org
As the number of spacecraft on orbit continues to grow, it is challenging for human operators
to constantly monitor and plan for all missions. Autonomous control methods such as …
to constantly monitor and plan for all missions. Autonomous control methods such as …