Independent skill transfer for deep reinforcement learning

Y Lai, J Liu, Z Tang, B Wang, J Hao… - … on Machine Learning, 2023 - proceedings.mlr.press

Placement is a critical step in modern chip design, aiming to determine the positions of
circuit modules on the chip canvas. Recent works have shown that reinforcement learning …

被引用次数：36 相关文章所有 7 个版本

[PDF] arxiv.org

Behavior proximal policy optimization

Z Zhuang, K Lei, J Liu, D Wang, Y Guo - arXiv preprint arXiv:2302.11312, 2023 - arxiv.org

Offline reinforcement learning (RL) is a challenging setting where existing off-policy actor-
critic methods perform poorly due to the overestimation of out-of-distribution state-action …

被引用次数：38 相关文章所有 3 个版本

[PDF] mlr.press

Clue: Calibrated latent guidance for offline reinforcement learning

J Liu, L Zu, L He, D Wang - Conference on Robot Learning, 2023 - proceedings.mlr.press

Offline reinforcement learning (RL) aims to learn an optimal policy from pre-collected and
labeled datasets, which eliminates the time-consuming data collection in online RL …

被引用次数：7 相关文章所有 4 个版本

[PDF] neurips.cc

Unsupervised domain adaptation with dynamics-aware rewards in reinforcement learning

J Liu, H Shen, D Wang, Y Kang… - Advances in Neural …, 2021 - proceedings.neurips.cc

Unsupervised reinforcement learning aims to acquire skills without prior goal
representations, where an agent automatically explores an open-ended environment to …

被引用次数：21 相关文章所有 6 个版本

[PDF] arxiv.org

DIDI: Diffusion-Guided Diversity for Offline Behavioral Generation

J Liu, X Guo, Z Zhuang, D Wang - arXiv preprint arXiv:2405.14790, 2024 - arxiv.org

In this paper, we propose a novel approach called DIffusion-guided DIversity (DIDI) for
offline behavioral generation. The goal of DIDI is to learn a diverse set of skills from a …

被引用次数：1 相关文章所有 4 个版本

[PDF] researchgate.net

A methodical interpretation of adaptive robotics: Study and reformulation

AMS Enayati, Z Zhang, H Najjaran - Neurocomputing, 2022 - Elsevier

The recent development of industrial manufacturing and social services has witnessed a
significant trend of automation and intelligentization due to the wide application of robots …

被引用次数：7 相关文章所有 2 个版本

Hierarchical reinforcement learning with unlimited option scheduling for sparse rewards in continuous spaces

Z Huang, Q Liu, F Zhu, L Zhang, L Wu - Expert Systems with Applications, 2024 - Elsevier

The fundamental concept behind option-based hierarchical reinforcement learning (O-HRL)
is to obtain temporal coarse-grained actions and abstract complex situations. Although O …

被引用次数：5 相关文章所有 2 个版本

[PDF] ieee.org

Discovering and Exploiting Skills in Hierarchical Reinforcement Learning

Z Huang - IEEE Access, 2024 - ieeexplore.ieee.org

Humans can perform infinite diverse skills. These skills typically represent abstract
knowledge that is highly correlated with time series. To behave more like a human, we take …

Terrain-Aware Risk-Assessment-Network-Aided Deep Reinforcement Learning for Quadrupedal Locomotion in Tough Terrain

H Zhang, J Wang, Z Wu, Y Wang… - 2021 IEEE/RSJ …, 2021 - ieeexplore.ieee.org

When it comes to the control system of quadruped robots, deep reinforcement learning
(DRL) is considered to be a promising solution. Despite years of development in this field …

被引用次数：5 相关文章所有 2 个版本

[PDF] arxiv.org

KSG: Knowledge and skill graph

F Zhao, Z Zhang, D Wang - Proceedings of the 31st ACM International …, 2022 - dl.acm.org

The knowledge graph (KG) is an essential form of knowledge representation that has grown
in prominence in recent years. Because it concentrates on nominal entities and their …

被引用次数：1 相关文章所有 3 个版本