Clue: Calibrated latent guidance for offline reinforcement learning
Offline reinforcement learning (RL) aims to learn an optimal policy from pre-collected and
labeled datasets, which eliminates the time-consuming data collection in online RL …
labeled datasets, which eliminates the time-consuming data collection in online RL …
Computational Sensing, Understanding, and Reasoning: An Artificial Intelligence Approach to Physics-Informed World Modeling
This work offers a discussion on how computational mechanics and physics-informed
machine learning can be integrated into the process of sensing, understanding, and …
machine learning can be integrated into the process of sensing, understanding, and …
DIDI: Diffusion-Guided Diversity for Offline Behavioral Generation
In this paper, we propose a novel approach called DIffusion-guided DIversity (DIDI) for
offline behavioral generation. The goal of DIDI is to learn a diverse set of skills from a …
offline behavioral generation. The goal of DIDI is to learn a diverse set of skills from a …
Reinformer: Max-return sequence modeling for offline rl
As a data-driven paradigm, offline reinforcement learning (RL) has been formulated as
sequence modeling that conditions on the hindsight information including returns, goal or …
sequence modeling that conditions on the hindsight information including returns, goal or …
[HTML][HTML] Continual Reinforcement Learning for Quadruped Robot Locomotion
S Gai, S Lyu, H Zhang, D Wang - Entropy, 2024 - mdpi.com
The ability to learn continuously is crucial for a robot to achieve a high level of intelligence
and autonomy. In this paper, we consider continual reinforcement learning (RL) for …
and autonomy. In this paper, we consider continual reinforcement learning (RL) for …
Causal prompting model-based offline reinforcement learning
Model-based offline Reinforcement Learning (RL) allows agents to fully utilise pre-collected
datasets without requiring additional or unethical explorations. However, applying model …
datasets without requiring additional or unethical explorations. However, applying model …
SERA: Sample Efficient Reward Augmentation in offline-to-online Reinforcement Learning
A prospective application of offline reinforcement learning (RL) involves initializing a pre-
trained policy using existing static datasets for subsequent online fine-tuning. However …
trained policy using existing static datasets for subsequent online fine-tuning. However …
Scalable Particle Generation for Granular Shape Study
The shape of granular matter (particle) is crucial for understanding their properties and
assembly behavior. Existing studies often rely on intuitive or machine-derived shape …
assembly behavior. Existing studies often rely on intuitive or machine-derived shape …