Chipformer: Transferable chip placement via offline decision transformer

Y Lai, J Liu, Z Tang, B Wang, J Hao… - … on Machine Learning, 2023 - proceedings.mlr.press
Placement is a critical step in modern chip design, aiming to determine the positions of
circuit modules on the chip canvas. Recent works have shown that reinforcement learning …

Behavior proximal policy optimization

Z Zhuang, K Lei, J Liu, D Wang, Y Guo - arXiv preprint arXiv:2302.11312, 2023 - arxiv.org
Offline reinforcement learning (RL) is a challenging setting where existing off-policy actor-
critic methods perform poorly due to the overestimation of out-of-distribution state-action …

Clue: Calibrated latent guidance for offline reinforcement learning

J Liu, L Zu, L He, D Wang - Conference on Robot Learning, 2023 - proceedings.mlr.press
Offline reinforcement learning (RL) aims to learn an optimal policy from pre-collected and
labeled datasets, which eliminates the time-consuming data collection in online RL …

Unsupervised domain adaptation with dynamics-aware rewards in reinforcement learning

J Liu, H Shen, D Wang, Y Kang… - Advances in Neural …, 2021 - proceedings.neurips.cc
Unsupervised reinforcement learning aims to acquire skills without prior goal
representations, where an agent automatically explores an open-ended environment to …

DIDI: Diffusion-Guided Diversity for Offline Behavioral Generation

J Liu, X Guo, Z Zhuang, D Wang - arXiv preprint arXiv:2405.14790, 2024 - arxiv.org
In this paper, we propose a novel approach called DIffusion-guided DIversity (DIDI) for
offline behavioral generation. The goal of DIDI is to learn a diverse set of skills from a …

A methodical interpretation of adaptive robotics: Study and reformulation

AMS Enayati, Z Zhang, H Najjaran - Neurocomputing, 2022 - Elsevier
The recent development of industrial manufacturing and social services has witnessed a
significant trend of automation and intelligentization due to the wide application of robots …

Hierarchical reinforcement learning with unlimited option scheduling for sparse rewards in continuous spaces

Z Huang, Q Liu, F Zhu, L Zhang, L Wu - Expert Systems with Applications, 2024 - Elsevier
The fundamental concept behind option-based hierarchical reinforcement learning (O-HRL)
is to obtain temporal coarse-grained actions and abstract complex situations. Although O …

Discovering and Exploiting Skills in Hierarchical Reinforcement Learning

Z Huang - IEEE Access, 2024 - ieeexplore.ieee.org
Humans can perform infinite diverse skills. These skills typically represent abstract
knowledge that is highly correlated with time series. To behave more like a human, we take …

Terrain-Aware Risk-Assessment-Network-Aided Deep Reinforcement Learning for Quadrupedal Locomotion in Tough Terrain

H Zhang, J Wang, Z Wu, Y Wang… - 2021 IEEE/RSJ …, 2021 - ieeexplore.ieee.org
When it comes to the control system of quadruped robots, deep reinforcement learning
(DRL) is considered to be a promising solution. Despite years of development in this field …

KSG: Knowledge and skill graph

F Zhao, Z Zhang, D Wang - Proceedings of the 31st ACM International …, 2022 - dl.acm.org
The knowledge graph (KG) is an essential form of knowledge representation that has grown
in prominence in recent years. Because it concentrates on nominal entities and their …