- 学术资源搜索

Contrastive learning as goal-conditioned reinforcement learning

B Eysenbach, T Zhang, S Levine… - Advances in Neural …, 2022 - proceedings.neurips.cc

In reinforcement learning (RL), it is easier to solve a task if given a good representation.
While deep RL should automatically acquire such good representations, prior work often …

被引用次数：102 相关文章所有 6 个版本

[PDF] neurips.cc

Hiql: Offline goal-conditioned rl with latent states as actions

S Park, D Ghosh, B Eysenbach… - Advances in Neural …, 2024 - proceedings.neurips.cc

Unsupervised pre-training has recently become the bedrock for computer vision and natural
language processing. In reinforcement learning (RL), goal-conditioned RL can potentially …

被引用次数：21 相关文章所有 6 个版本

[PDF] neurips.cc

Compositional generalization from first principles

T Wiedemer, P Mayilvahanan… - Advances in Neural …, 2024 - proceedings.neurips.cc

Leveraging the compositional nature of our world to expedite learning and facilitate
generalization is a hallmark of human perception. In machine learning, on the other hand …

被引用次数：18 相关文章所有 6 个版本

[PDF] mlr.press

What is essential for unseen goal generalization of offline goal-conditioned rl?

R Yang, L Yong, X Ma, H Hu… - … on Machine Learning, 2023 - proceedings.mlr.press

Offline goal-conditioned RL (GCRL) offers a way to train general-purpose agents from fully
offline datasets. In addition to being conservative within the dataset, the generalization …

被引用次数：18 相关文章所有 7 个版本

[PDF] academia.edu

[PDF][PDF] Structure in reinforcement learning: A survey and open problems

A Mohan, A Zhang, M Lindauer - arXiv preprint arXiv:2306.16021, 2023 - academia.edu

Reinforcement Learning (RL), bolstered by the expressive capabilities of Deep Neural
Networks (DNNs) for function approximation, has demonstrated considerable success in …

被引用次数：11 相关文章所有 2 个版本

[PDF] arxiv.org

Learning to extrapolate: A transductive approach

A Netanyahu, A Gupta, M Simchowitz, K Zhang… - arXiv preprint arXiv …, 2023 - arxiv.org

Machine learning systems, especially with overparameterized deep neural networks, can
generalize to novel test instances drawn from the same distribution as the training data …

被引用次数：13 相关文章所有 4 个版本

[PDF] ieee.org

Drone Landing and Reinforcement Learning: State-of-art, Challenges and Opportunities

J Amendola, LR Cenkeramaddi… - IEEE Open Journal of …, 2024 - ieeexplore.ieee.org

Unmanned aerial vehicles, and special multirotor drones, have shown great relevance in a
plethora of missions that require high affordance, field of view, and precision. Their limited …

[PDF] neurips.cc

Constrained gpi for zero-shot transfer in reinforcement learning

J Kim, S Park, G Kim - Advances in Neural Information …, 2022 - proceedings.neurips.cc

For zero-shot transfer in reinforcement learning where the reward function varies between
different tasks, the successor features framework has been one of the popular approaches …

被引用次数：5 相关文章所有 4 个版本

[PDF] aaai.org

Metric Residual Network for Sample Efficient Goal-Conditioned Reinforcement Learning

B Liu, Y Feng, Q Liu, P Stone - Proceedings of the AAAI Conference on …, 2023 - ojs.aaai.org

Goal-conditioned reinforcement learning (GCRL) has a wide range of potential real-world
applications, including manipulation and navigation problems in robotics. Especially in such …

被引用次数：8 相关文章所有 8 个版本

[PDF] jair.org Full View

Structure in Deep Reinforcement Learning: A Survey and Open Problems

A Mohan, A Zhang, M Lindauer - Journal of Artificial Intelligence Research, 2024 - jair.org

Reinforcement Learning (RL), bolstered by the expressive capabilities of Deep Neural
Networks (DNNs) for function approximation, has demonstrated considerable success in …