Hiql: Offline goal-conditioned rl with latent states as actions
Unsupervised pre-training has recently become the bedrock for computer vision and natural
language processing. In reinforcement learning (RL), goal-conditioned RL can potentially …
language processing. In reinforcement learning (RL), goal-conditioned RL can potentially …
Subgoal search for complex reasoning tasks
K Czechowski, T Odrzygóźdź… - Advances in …, 2021 - proceedings.neurips.cc
Humans excel in solving complex reasoning tasks through a mental process of moving from
one idea to a related one. Inspired by this, we propose Subgoal Search (kSubS) method. Its …
one idea to a related one. Inspired by this, we propose Subgoal Search (kSubS) method. Its …
DHRL: a graph-based approach for long-horizon and sparse hierarchical reinforcement learning
Abstract Hierarchical Reinforcement Learning (HRL) has made notable progress in complex
control tasks by leveraging temporal abstraction. However, previous HRL algorithms often …
control tasks by leveraging temporal abstraction. However, previous HRL algorithms often …
Planning irregular object packing via hierarchical reinforcement learning
Object packing by autonomous robots is an important challenge in warehouses and logistics
industry. Most conventional data-driven packing planning approaches focus on regular …
industry. Most conventional data-driven packing planning approaches focus on regular …
Hierarchical planning and learning for robots in stochastic settings using zero-shot option invention
N Shah, S Srivastava - Proceedings of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org
This paper addresses the problem of inventing and using hierarchical representations for
stochastic robot-planning problems. Rather than using hand-coded state or action …
stochastic robot-planning problems. Rather than using hand-coded state or action …
Machine learning and information theory concepts towards an AI Mathematician
The current state of the art in artificial intelligence is impressive, especially in terms of
mastery of language, but not so much in terms of mathematical reasoning. What could be …
mastery of language, but not so much in terms of mathematical reasoning. What could be …
Learning graph-enhanced commander-executor for multi-agent navigation
This paper investigates the multi-agent navigation problem, which requires multiple agents
to reach the target goals in a limited time. Multi-agent reinforcement learning (MARL) has …
to reach the target goals in a limited time. Multi-agent reinforcement learning (MARL) has …
Goal-Conditioned Hierarchical Reinforcement Learning With High-Level Model Approximation
Hierarchical reinforcement learning (HRL) exhibits remarkable potential in addressing large-
scale and long-horizon complex tasks. However, a fundamental challenge, which arises …
scale and long-horizon complex tasks. However, a fundamental challenge, which arises …
Imitating graph-based planning with goal-conditioned policies
Recently, graph-based planning algorithms have gained much attention to solve goal-
conditioned reinforcement learning (RL) tasks: they provide a sequence of subgoals to …
conditioned reinforcement learning (RL) tasks: they provide a sequence of subgoals to …
Hybrid search for efficient planning with completeness guarantees
Solving complex planning problems has been a long-standing challenge in computer
science. Learning-based subgoal search methods have shown promise in tackling these …
science. Learning-based subgoal search methods have shown promise in tackling these …