Reinforcement Learning of Action and Query Policies with LTL Instructions under Uncertain Event Detector

W Hatanaka, R Yamashina… - IEEE Robotics and …, 2023 - ieeexplore.ieee.org
Reinforcement learning (RL) with linear temporal logic (LTL) objectives can allow robots to
carry out symbolic event plans in unknown environments. Most existing methods assume …

Hierarchies of reward machines

D Furelos-Blanco, M Law, A Jonsson… - International …, 2023 - proceedings.mlr.press
Reward machines (RMs) are a recent formalism for representing the reward function of a
reinforcement learning task through a finite-state machine whose edges encode subgoals of …

Generalization of temporal logic tasks via future dependent options

D Xu, F Fekri - Machine Learning, 2024 - Springer
Temporal logic (TL) tasks consist of complex and temporally extended subgoals and they
are common for many real-world applications, such as service and navigation robots …

Learning belief representations for partially observable deep RL

A Wang, AC Li, TQ Klassen, RT Icarte… - International …, 2023 - proceedings.mlr.press
Many important real-world Reinforcement Learning (RL) problems involve partial
observability and require policies with memory. Unfortunately, standard deep RL algorithms …

Grounding LTLf specifications in image sequences

E Umili, R Capobianco… - Proceedings of the …, 2023 - proceedings.kr.org
A critical challenge in neuro-symbolic (NeSy) approaches is to handle the symbol grounding
problem without direct supervision. That is mapping high-dimensional raw data into an …

Learning robust reward machines from noisy labels

R Parac, L Nodari, L Ardon, D Furelos-Blanco… - arXiv preprint arXiv …, 2024 - arxiv.org
This paper presents PROB-IRM, an approach that learns robust reward machines (RMs) for
reinforcement learning (RL) agents from noisy execution traces. The key aspect of RM …

Contributions to Data-Driven Combinatorial Solvers

P Vaezipoor - 2023 - search.proquest.com
Discrete optimization problems appear in everyday life, with a wide range of applications
including airline scheduling, telecommunications network design and healthcare. Modern …

[PDF][PDF] Learning and Exploiting Reward Machines for Reinforcement Learning

DF Blanco - 2023 - danielfurelos.com
Reinforcement learning (RL) with non-Markovian rewards requires agents to learn
historydependent policies, which is particularly challenging in long-horizon or sparse reward …

Discovering logical knowledge in non-symbolic domains

E Umili - 2023 - iris.uniroma1.it
Deep learning and symbolic artificial intelligence remain the two main paradigms in Artificial
Intelligence (AI), each presenting their own strengths and weaknesses. Artificial agents …

[PDF][PDF] Generalization of Temporal Logic Tasks via Future Depen-dent Options

PS Apple, G Banana - rlbrew-workshop.github.io
Temporal logic (TL) tasks consist of complex and temporally extended subgoals and they
are common for many real-world applications, such as service and navigation robots …