[PDF][PDF] Scaling Learning based Policy Optimization for Temporal Logic Tasks by Controller Network Dropout

N Hashemi, B Hoxha, D Prokhorov… - arXiv preprint arXiv …, 2024 - researchgate.net
This paper introduces a model-based approach for training feedback controllers for an
autonomous agent operating in a highly nonlinear (albeit deterministic) environment. We …

Concurrent learning of control policy and unknown safety specifications in reinforcement learning

L Yifru, A Baheri - IEEE Open Journal of Control Systems, 2024 - ieeexplore.ieee.org
Reinforcement learning (RL) has revolutionized decision-making across a wide range of
domains over the past few decades. Yet, deploying RL policies in real-world scenarios …

Exploration in reward machines with low regret

H Bourel, A Jonsson, OA Maillard… - International …, 2023 - proceedings.mlr.press
We study reinforcement learning (RL) for decision processes with non-Markovian reward, in
which high-level knowledge in the form of reward machines is available to the learner …

Scaling Learning-based Policy Optimization for Temporal Logic Tasks by Controller Network Dropout

N Hashemi, B Hoxha, D Prokhorov… - ACM Transactions on …, 2024 - dl.acm.org
This article introduces a model-based approach for training feedback controllers for an
autonomous agent operating in a highly non-linear (albeit deterministic) environment. We …

Repairing learning-enabled controllers while preserving what works

P Lu, M Cleaveland, O Sokolsky, I Lee… - 2024 ACM/IEEE 15th …, 2024 - ieeexplore.ieee.org
Learning-enabled controllers have been adopted in various cyber-physical systems (CPS).
When a learning-enabled controller fails to accomplish its task from a set of initial states …

Concurrent Learning of Policy and Unknown Safety Constraints in Reinforcement Learning

L Yifru, A Baheri - arXiv preprint arXiv:2402.15893, 2024 - arxiv.org
Reinforcement learning (RL) has revolutionized decision-making across a wide range of
domains over the past few decades. Yet, deploying RL policies in real-world scenarios …

Space Processor Computation Time Analysis for Reinforcement Learning and Run Time Assurance Control Policies

K Dunlap, N Hamilton, F Viramontes… - arXiv preprint arXiv …, 2024 - arxiv.org
As the number of spacecraft on orbit continues to grow, it is challenging for human operators
to constantly monitor and plan for all missions. Autonomous control methods such as …

[PDF][PDF] RMLGym: a Formal Reward Machine Framework for Reinforcement Learning.

H Unniyankal, F Belardinelli, A Ferrando, V Malvone - WOA, 2023 - ceur-ws.org
Reinforcement learning (RL) is a powerful technique for learning optimal policies from trial
and error. However, designing a reward function that captures the desired behavior of an …

Online control synthesis for uncertain systems under signal temporal logic specifications

P Yu, Y Gao, FJ Jiang, KH Johansson… - … Journal of Robotics …, 2024 - journals.sagepub.com
Signal temporal logic (STL) formulas have been widely used as a formal language to
express complex robotic specifications, thanks to their rich expressiveness and explicit time …

Space Processor Computation Time Analysis for Reinforcement Learning and Run Time Assurance Control Policies

NP Hamilton, K Dunlap, F Viramontes… - AIAA SCITECH 2025 …, 2025 - arc.aiaa.org
As the number of spacecraft on orbit continues to grow, it is challenging for human operators
to constantly monitor and plan for all missions. Autonomous control methods such as …