Landmark-guided subgoal generation in hierarchical reinforcement learning

S Park, D Ghosh, B Eysenbach… - Advances in Neural …, 2024 - proceedings.neurips.cc

Unsupervised pre-training has recently become the bedrock for computer vision and natural
language processing. In reinforcement learning (RL), goal-conditioned RL can potentially …

被引用次数：37 相关文章所有 6 个版本

[PDF] neurips.cc

Subgoal search for complex reasoning tasks

K Czechowski, T Odrzygóźdź… - Advances in …, 2021 - proceedings.neurips.cc

Humans excel in solving complex reasoning tasks through a mental process of moving from
one idea to a related one. Inspired by this, we propose Subgoal Search (kSubS) method. Its …

被引用次数：29 相关文章所有 8 个版本

[PDF] neurips.cc

DHRL: a graph-based approach for long-horizon and sparse hierarchical reinforcement learning

S Lee, J Kim, I Jang, HJ Kim - Advances in Neural …, 2022 - proceedings.neurips.cc

Abstract Hierarchical Reinforcement Learning (HRL) has made notable progress in complex
control tasks by leveraging temporal abstraction. However, previous HRL algorithms often …

被引用次数：16 相关文章所有 5 个版本

[PDF] arxiv.org

Planning irregular object packing via hierarchical reinforcement learning

S Huang, Z Wang, J Zhou, J Lu - IEEE Robotics and …, 2022 - ieeexplore.ieee.org

Object packing by autonomous robots is an important challenge in warehouses and logistics
industry. Most conventional data-driven packing planning approaches focus on regular …

被引用次数：19 相关文章所有 4 个版本

[PDF] aaai.org

Hierarchical planning and learning for robots in stochastic settings using zero-shot option invention

N Shah, S Srivastava - Proceedings of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org

This paper addresses the problem of inventing and using hierarchical representations for
stochastic robot-planning problems. Rather than using hand-coded state or action …

被引用次数：4 相关文章所有 2 个版本

[PDF] arxiv.org

Machine learning and information theory concepts towards an AI Mathematician

Y Bengio, N Malkin - Bulletin of the American Mathematical Society, 2024 - ams.org

The current state of the art in artificial intelligence is impressive, especially in terms of
mastery of language, but not so much in terms of mathematical reasoning. What could be …

被引用次数：10 相关文章所有 4 个版本

[PDF] arxiv.org

Learning graph-enhanced commander-executor for multi-agent navigation

X Yang, S Huang, Y Sun, Y Yang, C Yu, WW Tu… - arXiv preprint arXiv …, 2023 - arxiv.org

This paper investigates the multi-agent navigation problem, which requires multiple agents
to reach the target goals in a limited time. Multi-agent reinforcement learning (MARL) has …

被引用次数：6 相关文章所有 7 个版本

Goal-Conditioned Hierarchical Reinforcement Learning With High-Level Model Approximation

Y Luo, T Ji, F Sun, H Liu, J Zhang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Hierarchical reinforcement learning (HRL) exhibits remarkable potential in addressing large-
scale and long-horizon complex tasks. However, a fundamental challenge, which arises …

被引用次数：5 相关文章所有 3 个版本

[PDF] arxiv.org

Imitating graph-based planning with goal-conditioned policies

J Kim, Y Seo, S Ahn, K Son, J Shin - arXiv preprint arXiv:2303.11166, 2023 - arxiv.org

Recently, graph-based planning algorithms have gained much attention to solve goal-
conditioned reinforcement learning (RL) tasks: they provide a sequence of subgoals to …

被引用次数：13 相关文章所有 3 个版本

[PDF] neurips.cc

Hybrid search for efficient planning with completeness guarantees

K Kujanpää, J Pajarinen, A Ilin - Advances in Neural …, 2024 - proceedings.neurips.cc

Solving complex planning problems has been a long-standing challenge in computer
science. Learning-based subgoal search methods have shown promise in tackling these …

被引用次数：1 相关文章所有 6 个版本