Reinforcement Learning of Action and Query Policies with LTL Instructions under Uncertain Event Detector
W Hatanaka, R Yamashina… - IEEE Robotics and …, 2023 - ieeexplore.ieee.org
Reinforcement learning (RL) with linear temporal logic (LTL) objectives can allow robots to
carry out symbolic event plans in unknown environments. Most existing methods assume …
carry out symbolic event plans in unknown environments. Most existing methods assume …
Hierarchies of reward machines
Reward machines (RMs) are a recent formalism for representing the reward function of a
reinforcement learning task through a finite-state machine whose edges encode subgoals of …
reinforcement learning task through a finite-state machine whose edges encode subgoals of …
Generalization of temporal logic tasks via future dependent options
Temporal logic (TL) tasks consist of complex and temporally extended subgoals and they
are common for many real-world applications, such as service and navigation robots …
are common for many real-world applications, such as service and navigation robots …
Learning belief representations for partially observable deep RL
Many important real-world Reinforcement Learning (RL) problems involve partial
observability and require policies with memory. Unfortunately, standard deep RL algorithms …
observability and require policies with memory. Unfortunately, standard deep RL algorithms …
Grounding LTLf specifications in image sequences
E Umili, R Capobianco… - Proceedings of the …, 2023 - proceedings.kr.org
A critical challenge in neuro-symbolic (NeSy) approaches is to handle the symbol grounding
problem without direct supervision. That is mapping high-dimensional raw data into an …
problem without direct supervision. That is mapping high-dimensional raw data into an …
Learning robust reward machines from noisy labels
This paper presents PROB-IRM, an approach that learns robust reward machines (RMs) for
reinforcement learning (RL) agents from noisy execution traces. The key aspect of RM …
reinforcement learning (RL) agents from noisy execution traces. The key aspect of RM …
Contributions to Data-Driven Combinatorial Solvers
P Vaezipoor - 2023 - search.proquest.com
Discrete optimization problems appear in everyday life, with a wide range of applications
including airline scheduling, telecommunications network design and healthcare. Modern …
including airline scheduling, telecommunications network design and healthcare. Modern …
[PDF][PDF] Learning and Exploiting Reward Machines for Reinforcement Learning
DF Blanco - 2023 - danielfurelos.com
Reinforcement learning (RL) with non-Markovian rewards requires agents to learn
historydependent policies, which is particularly challenging in long-horizon or sparse reward …
historydependent policies, which is particularly challenging in long-horizon or sparse reward …
Discovering logical knowledge in non-symbolic domains
E Umili - 2023 - iris.uniroma1.it
Deep learning and symbolic artificial intelligence remain the two main paradigms in Artificial
Intelligence (AI), each presenting their own strengths and weaknesses. Artificial agents …
Intelligence (AI), each presenting their own strengths and weaknesses. Artificial agents …
[PDF][PDF] Generalization of Temporal Logic Tasks via Future Depen-dent Options
PS Apple, G Banana - rlbrew-workshop.github.io
Temporal logic (TL) tasks consist of complex and temporally extended subgoals and they
are common for many real-world applications, such as service and navigation robots …
are common for many real-world applications, such as service and navigation robots …