Noisy symbolic abstractions for deep RL: A case study with reward machines

Reinforcement Learning of Action and Query Policies with LTL Instructions under Uncertain Event Detector

W Hatanaka, R Yamashina… - IEEE Robotics and …, 2023 - ieeexplore.ieee.org

Reinforcement learning (RL) with linear temporal logic (LTL) objectives can allow robots to
carry out symbolic event plans in unknown environments. Most existing methods assume …

被引用次数：3 相关文章所有 3 个版本

[PDF] mlr.press

Hierarchies of reward machines

D Furelos-Blanco, M Law, A Jonsson… - International …, 2023 - proceedings.mlr.press

Reward machines (RMs) are a recent formalism for representing the reward function of a
reinforcement learning task through a finite-state machine whose edges encode subgoals of …

被引用次数：14 相关文章所有 9 个版本

[PDF] easychair.org

Generalization of temporal logic tasks via future dependent options

D Xu, F Fekri - Machine Learning, 2024 - Springer

Temporal logic (TL) tasks consist of complex and temporally extended subgoals and they
are common for many real-world applications, such as service and navigation robots …

被引用次数：2 相关文章所有 2 个版本

[PDF] mlr.press

Learning belief representations for partially observable deep RL

A Wang, AC Li, TQ Klassen, RT Icarte… - International …, 2023 - proceedings.mlr.press

Many important real-world Reinforcement Learning (RL) problems involve partial
observability and require policies with memory. Unfortunately, standard deep RL algorithms …

被引用次数：9 相关文章所有 6 个版本

[PDF] kr.org

Grounding LTLf specifications in image sequences

E Umili, R Capobianco… - Proceedings of the …, 2023 - proceedings.kr.org

A critical challenge in neuro-symbolic (NeSy) approaches is to handle the symbol grounding
problem without direct supervision. That is mapping high-dimensional raw data into an …

被引用次数：7 相关文章所有 9 个版本

[PDF] arxiv.org

Learning robust reward machines from noisy labels

R Parac, L Nodari, L Ardon, D Furelos-Blanco… - arXiv preprint arXiv …, 2024 - arxiv.org

This paper presents PROB-IRM, an approach that learns robust reward machines (RMs) for
reinforcement learning (RL) agents from noisy execution traces. The key aspect of RM …

Contributions to Data-Driven Combinatorial Solvers

P Vaezipoor - 2023 - search.proquest.com

Discrete optimization problems appear in everyday life, with a wide range of applications
including airline scheduling, telecommunications network design and healthcare. Modern …

[PDF][PDF] Learning and Exploiting Reward Machines for Reinforcement Learning

DF Blanco - 2023 - danielfurelos.com

Reinforcement learning (RL) with non-Markovian rewards requires agents to learn
historydependent policies, which is particularly challenging in long-horizon or sparse reward …

[PDF] uniroma1.it

Discovering logical knowledge in non-symbolic domains

E Umili - 2023 - iris.uniroma1.it

Deep learning and symbolic artificial intelligence remain the two main paradigms in Artificial
Intelligence (AI), each presenting their own strengths and weaknesses. Artificial agents …

[PDF][PDF] Generalization of Temporal Logic Tasks via Future Depen-dent Options

PS Apple, G Banana - rlbrew-workshop.github.io

Temporal logic (TL) tasks consist of complex and temporally extended subgoals and they
are common for many real-world applications, such as service and navigation robots …