Design from policies: Conservative test-time adaptation for offline policy optimization

J Liu, L Zu, L He, D Wang - Conference on Robot Learning, 2023 - proceedings.mlr.press

Offline reinforcement learning (RL) aims to learn an optimal policy from pre-collected and
labeled datasets, which eliminates the time-consuming data collection in online RL …

被引用次数：6 相关文章所有 4 个版本

[PDF] unizar.es

Computational Sensing, Understanding, and Reasoning: An Artificial Intelligence Approach to Physics-Informed World Modeling

B Moya, A Badías, D González, F Chinesta… - … Methods in Engineering, 2024 - Springer

This work offers a discussion on how computational mechanics and physics-informed
machine learning can be integrated into the process of sensing, understanding, and …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

DIDI: Diffusion-Guided Diversity for Offline Behavioral Generation

J Liu, X Guo, Z Zhuang, D Wang - arXiv preprint arXiv:2405.14790, 2024 - arxiv.org

In this paper, we propose a novel approach called DIffusion-guided DIversity (DIDI) for
offline behavioral generation. The goal of DIDI is to learn a diverse set of skills from a …

被引用次数：1 相关文章所有 4 个版本

[PDF] arxiv.org

Reinformer: Max-return sequence modeling for offline rl

Z Zhuang, D Peng, Z Zhang, D Wang - arXiv preprint arXiv:2405.08740, 2024 - arxiv.org

As a data-driven paradigm, offline reinforcement learning (RL) has been formulated as
sequence modeling that conditions on the hindsight information including returns, goal or …

被引用次数：1 相关文章所有 3 个版本

[HTML] mdpi.com

[HTML][HTML] Continual Reinforcement Learning for Quadruped Robot Locomotion

S Gai, S Lyu, H Zhang, D Wang - Entropy, 2024 - mdpi.com

The ability to learn continuously is crucial for a robot to achieve a high level of intelligence
and autonomy. In this paper, we consider continual reinforcement learning (RL) for …

被引用次数：1 相关文章所有 7 个版本

[PDF] arxiv.org