Hiql: Offline goal-conditioned rl with latent states as actions

S Park, D Ghosh, B Eysenbach… - Advances in Neural …, 2024 - proceedings.neurips.cc
Unsupervised pre-training has recently become the bedrock for computer vision and natural
language processing. In reinforcement learning (RL), goal-conditioned RL can potentially …

Subgoal search for complex reasoning tasks

K Czechowski, T Odrzygóźdź… - Advances in …, 2021 - proceedings.neurips.cc
Humans excel in solving complex reasoning tasks through a mental process of moving from
one idea to a related one. Inspired by this, we propose Subgoal Search (kSubS) method. Its …

DHRL: a graph-based approach for long-horizon and sparse hierarchical reinforcement learning

S Lee, J Kim, I Jang, HJ Kim - Advances in Neural …, 2022 - proceedings.neurips.cc
Abstract Hierarchical Reinforcement Learning (HRL) has made notable progress in complex
control tasks by leveraging temporal abstraction. However, previous HRL algorithms often …

Planning irregular object packing via hierarchical reinforcement learning

S Huang, Z Wang, J Zhou, J Lu - IEEE Robotics and …, 2022 - ieeexplore.ieee.org
Object packing by autonomous robots is an important challenge in warehouses and logistics
industry. Most conventional data-driven packing planning approaches focus on regular …

Hierarchical planning and learning for robots in stochastic settings using zero-shot option invention

N Shah, S Srivastava - Proceedings of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org
This paper addresses the problem of inventing and using hierarchical representations for
stochastic robot-planning problems. Rather than using hand-coded state or action …

Machine learning and information theory concepts towards an AI Mathematician

Y Bengio, N Malkin - Bulletin of the American Mathematical Society, 2024 - ams.org
The current state of the art in artificial intelligence is impressive, especially in terms of
mastery of language, but not so much in terms of mathematical reasoning. What could be …

Learning graph-enhanced commander-executor for multi-agent navigation

X Yang, S Huang, Y Sun, Y Yang, C Yu, WW Tu… - arXiv preprint arXiv …, 2023 - arxiv.org
This paper investigates the multi-agent navigation problem, which requires multiple agents
to reach the target goals in a limited time. Multi-agent reinforcement learning (MARL) has …

Goal-Conditioned Hierarchical Reinforcement Learning With High-Level Model Approximation

Y Luo, T Ji, F Sun, H Liu, J Zhang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Hierarchical reinforcement learning (HRL) exhibits remarkable potential in addressing large-
scale and long-horizon complex tasks. However, a fundamental challenge, which arises …

Imitating graph-based planning with goal-conditioned policies

J Kim, Y Seo, S Ahn, K Son, J Shin - arXiv preprint arXiv:2303.11166, 2023 - arxiv.org
Recently, graph-based planning algorithms have gained much attention to solve goal-
conditioned reinforcement learning (RL) tasks: they provide a sequence of subgoals to …

Hybrid search for efficient planning with completeness guarantees

K Kujanpää, J Pajarinen, A Ilin - Advances in Neural …, 2024 - proceedings.neurips.cc
Solving complex planning problems has been a long-standing challenge in computer
science. Learning-based subgoal search methods have shown promise in tackling these …