Chipformer: Transferable chip placement via offline decision transformer
Placement is a critical step in modern chip design, aiming to determine the positions of
circuit modules on the chip canvas. Recent works have shown that reinforcement learning …
circuit modules on the chip canvas. Recent works have shown that reinforcement learning …
Behavior proximal policy optimization
Offline reinforcement learning (RL) is a challenging setting where existing off-policy actor-
critic methods perform poorly due to the overestimation of out-of-distribution state-action …
critic methods perform poorly due to the overestimation of out-of-distribution state-action …
Clue: Calibrated latent guidance for offline reinforcement learning
Offline reinforcement learning (RL) aims to learn an optimal policy from pre-collected and
labeled datasets, which eliminates the time-consuming data collection in online RL …
labeled datasets, which eliminates the time-consuming data collection in online RL …
Unsupervised domain adaptation with dynamics-aware rewards in reinforcement learning
Unsupervised reinforcement learning aims to acquire skills without prior goal
representations, where an agent automatically explores an open-ended environment to …
representations, where an agent automatically explores an open-ended environment to …
DIDI: Diffusion-Guided Diversity for Offline Behavioral Generation
In this paper, we propose a novel approach called DIffusion-guided DIversity (DIDI) for
offline behavioral generation. The goal of DIDI is to learn a diverse set of skills from a …
offline behavioral generation. The goal of DIDI is to learn a diverse set of skills from a …
A methodical interpretation of adaptive robotics: Study and reformulation
The recent development of industrial manufacturing and social services has witnessed a
significant trend of automation and intelligentization due to the wide application of robots …
significant trend of automation and intelligentization due to the wide application of robots …
Hierarchical reinforcement learning with unlimited option scheduling for sparse rewards in continuous spaces
Z Huang, Q Liu, F Zhu, L Zhang, L Wu - Expert Systems with Applications, 2024 - Elsevier
The fundamental concept behind option-based hierarchical reinforcement learning (O-HRL)
is to obtain temporal coarse-grained actions and abstract complex situations. Although O …
is to obtain temporal coarse-grained actions and abstract complex situations. Although O …
Discovering and Exploiting Skills in Hierarchical Reinforcement Learning
Z Huang - IEEE Access, 2024 - ieeexplore.ieee.org
Humans can perform infinite diverse skills. These skills typically represent abstract
knowledge that is highly correlated with time series. To behave more like a human, we take …
knowledge that is highly correlated with time series. To behave more like a human, we take …
Terrain-Aware Risk-Assessment-Network-Aided Deep Reinforcement Learning for Quadrupedal Locomotion in Tough Terrain
When it comes to the control system of quadruped robots, deep reinforcement learning
(DRL) is considered to be a promising solution. Despite years of development in this field …
(DRL) is considered to be a promising solution. Despite years of development in this field …
KSG: Knowledge and skill graph
The knowledge graph (KG) is an essential form of knowledge representation that has grown
in prominence in recent years. Because it concentrates on nominal entities and their …
in prominence in recent years. Because it concentrates on nominal entities and their …