Enhancing Deep Reinforcement Learning: A Tutorial on Generative Diffusion Models in Network Optimization

H Du, R Zhang, Y Liu, J Wang, Y Lin… - … Surveys & Tutorials, 2024 - ieeexplore.ieee.org
Generative Diffusion Models (GDMs) have emerged as a transformative force in the realm of
Generative Artificial Intelligence (GenAI), demonstrating their versatility and efficacy across …

One-Step Diffusion Policy: Fast Visuomotor Policies via Diffusion Distillation

Z Wang, Z Li, A Mandlekar, Z Xu, J Fan… - arXiv preprint arXiv …, 2024 - arxiv.org
Diffusion models, praised for their success in generative tasks, are increasingly being
applied to robotics, demonstrating exceptional performance in behavior cloning. However …

Integrating Reinforcement Learning with Foundation Models for Autonomous Robotics: Methods and Perspectives

A Moroncelli, V Soni, AA Shahid, M Maccarini… - arXiv preprint arXiv …, 2024 - arxiv.org
Foundation models (FMs), large deep learning models pre-trained on vast, unlabeled
datasets, exhibit powerful capabilities in understanding complex patterns and generating …

Diffcps: Diffusion model based constrained policy search for offline reinforcement learning

L He, L Shen, L Zhang, J Tan, X Wang - arXiv preprint arXiv:2310.05333, 2023 - arxiv.org
Constrained policy search (CPS) is a fundamental problem in offline reinforcement learning,
which is generally solved by advantage weighted regression (AWR). However, previous …

Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control

H Chen, K Zheng, H Su, J Zhu - arXiv preprint arXiv:2407.09024, 2024 - arxiv.org
Drawing upon recent advances in language model alignment, we formulate offline
Reinforcement Learning as a two-stage optimization problem: First pretraining expressive …

Diffusion Spectral Representation for Reinforcement Learning

D Shribak, CX Gao, Y Li, C Xiao, B Dai - arXiv preprint arXiv:2406.16121, 2024 - arxiv.org
Diffusion-based models have achieved notable empirical successes in reinforcement
learning (RL) due to their expressiveness in modeling complex distributions. Despite …

Contrastive Diffuser: Planning Towards High Return States via Contrastive Learning

Y Shan, Z Zhu, T Long, Q Liang, Y Chang… - arXiv preprint arXiv …, 2024 - arxiv.org
Applying diffusion models in reinforcement learning for long-term planning has gained much
attention recently. Several diffusion-based methods have successfully leveraged the …

UDQL: Bridging The Gap between MSE Loss and The Optimal Value Function in Offline Reinforcement Learning

Y Zhang, R Yu, Z Yao, W Zhang, J Wang… - arXiv preprint arXiv …, 2024 - arxiv.org
The Mean Square Error (MSE) is commonly utilized to estimate the solution of the optimal
value function in the vast majority of offline reinforcement learning (RL) models and has …

Diffusion Policies creating a Trust Region for Offline Reinforcement Learning

T Chen, Z Wang, M Zhou - arXiv preprint arXiv:2405.19690, 2024 - arxiv.org
Offline reinforcement learning (RL) leverages pre-collected datasets to train optimal policies.
Diffusion Q-Learning (DQL), introducing diffusion models as a powerful and expressive …

[引用][C] Real-Time Legged Locomotion Control with Diffusion from Offline Datasets

R Wang - 2024