Clue: Calibrated latent guidance for offline reinforcement learning

J Liu, L Zu, L He, D Wang - Conference on Robot Learning, 2023 - proceedings.mlr.press
Offline reinforcement learning (RL) aims to learn an optimal policy from pre-collected and
labeled datasets, which eliminates the time-consuming data collection in online RL …

Computational Sensing, Understanding, and Reasoning: An Artificial Intelligence Approach to Physics-Informed World Modeling

B Moya, A Badías, D González, F Chinesta… - … Methods in Engineering, 2024 - Springer
This work offers a discussion on how computational mechanics and physics-informed
machine learning can be integrated into the process of sensing, understanding, and …

DIDI: Diffusion-Guided Diversity for Offline Behavioral Generation

J Liu, X Guo, Z Zhuang, D Wang - arXiv preprint arXiv:2405.14790, 2024 - arxiv.org
In this paper, we propose a novel approach called DIffusion-guided DIversity (DIDI) for
offline behavioral generation. The goal of DIDI is to learn a diverse set of skills from a …

Reinformer: Max-return sequence modeling for offline rl

Z Zhuang, D Peng, Z Zhang, D Wang - arXiv preprint arXiv:2405.08740, 2024 - arxiv.org
As a data-driven paradigm, offline reinforcement learning (RL) has been formulated as
sequence modeling that conditions on the hindsight information including returns, goal or …

[HTML][HTML] Continual Reinforcement Learning for Quadruped Robot Locomotion

S Gai, S Lyu, H Zhang, D Wang - Entropy, 2024 - mdpi.com
The ability to learn continuously is crucial for a robot to achieve a high level of intelligence
and autonomy. In this paper, we consider continual reinforcement learning (RL) for …

Causal prompting model-based offline reinforcement learning

X Yu, Y Guan, R Shen, X Li, C Tang, J Jiang - arXiv preprint arXiv …, 2024 - arxiv.org
Model-based offline Reinforcement Learning (RL) allows agents to fully utilise pre-collected
datasets without requiring additional or unethical explorations. However, applying model …

SERA: Sample Efficient Reward Augmentation in offline-to-online Reinforcement Learning

Z Zhang, X Xiong, Z Zhuang, J Liu, D Wang - arXiv preprint arXiv …, 2023 - arxiv.org
A prospective application of offline reinforcement learning (RL) involves initializing a pre-
trained policy using existing static datasets for subsequent online fine-tuning. However …

Scalable Particle Generation for Granular Shape Study

Y Zhao, J Liu, X Gao, S Torres, SZ Li - NeurIPS 2023 AI for Science …, 2023 - openreview.net
The shape of granular matter (particle) is crucial for understanding their properties and
assembly behavior. Existing studies often rely on intuitive or machine-derived shape …