Enhancing Deep Reinforcement Learning: A Tutorial on Generative Diffusion Models in Network Optimization
Generative Diffusion Models (GDMs) have emerged as a transformative force in the realm of
Generative Artificial Intelligence (GenAI), demonstrating their versatility and efficacy across …
Generative Artificial Intelligence (GenAI), demonstrating their versatility and efficacy across …
One-Step Diffusion Policy: Fast Visuomotor Policies via Diffusion Distillation
Diffusion models, praised for their success in generative tasks, are increasingly being
applied to robotics, demonstrating exceptional performance in behavior cloning. However …
applied to robotics, demonstrating exceptional performance in behavior cloning. However …
Integrating Reinforcement Learning with Foundation Models for Autonomous Robotics: Methods and Perspectives
Foundation models (FMs), large deep learning models pre-trained on vast, unlabeled
datasets, exhibit powerful capabilities in understanding complex patterns and generating …
datasets, exhibit powerful capabilities in understanding complex patterns and generating …
Diffcps: Diffusion model based constrained policy search for offline reinforcement learning
Constrained policy search (CPS) is a fundamental problem in offline reinforcement learning,
which is generally solved by advantage weighted regression (AWR). However, previous …
which is generally solved by advantage weighted regression (AWR). However, previous …
Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control
Drawing upon recent advances in language model alignment, we formulate offline
Reinforcement Learning as a two-stage optimization problem: First pretraining expressive …
Reinforcement Learning as a two-stage optimization problem: First pretraining expressive …
Diffusion Spectral Representation for Reinforcement Learning
Diffusion-based models have achieved notable empirical successes in reinforcement
learning (RL) due to their expressiveness in modeling complex distributions. Despite …
learning (RL) due to their expressiveness in modeling complex distributions. Despite …
Contrastive Diffuser: Planning Towards High Return States via Contrastive Learning
Applying diffusion models in reinforcement learning for long-term planning has gained much
attention recently. Several diffusion-based methods have successfully leveraged the …
attention recently. Several diffusion-based methods have successfully leveraged the …
UDQL: Bridging The Gap between MSE Loss and The Optimal Value Function in Offline Reinforcement Learning
Y Zhang, R Yu, Z Yao, W Zhang, J Wang… - arXiv preprint arXiv …, 2024 - arxiv.org
The Mean Square Error (MSE) is commonly utilized to estimate the solution of the optimal
value function in the vast majority of offline reinforcement learning (RL) models and has …
value function in the vast majority of offline reinforcement learning (RL) models and has …
Diffusion Policies creating a Trust Region for Offline Reinforcement Learning
Offline reinforcement learning (RL) leverages pre-collected datasets to train optimal policies.
Diffusion Q-Learning (DQL), introducing diffusion models as a powerful and expressive …
Diffusion Q-Learning (DQL), introducing diffusion models as a powerful and expressive …