Distributed distributional deterministic policy gradients

TT Nguyen, VJ Reddi - IEEE Transactions on Neural Networks …, 2021 - ieeexplore.ieee.org

The scale of Internet-connected systems has increased considerably, and these systems are
being exposed to cyberattacks more than ever. The complexity and dynamics of …

被引用次数：426 相关文章所有 16 个版本

Reinforcement learning-based physical cross-layer security and privacy in 6G

X Lu, L Xiao, P Li, X Ji, C Xu, S Yu… - … Surveys & Tutorials, 2022 - ieeexplore.ieee.org

Sixth-generation (6G) cellular systems will have an inherent vulnerability to physical (PHY)-
layer attacks and privacy leakage, due to the large-scale heterogeneous networks with …

被引用次数：42 相关文章所有 3 个版本

[PDF] arxiv.org

Mastering diverse domains through world models

D Hafner, J Pasukonis, J Ba, T Lillicrap - arXiv preprint arXiv:2301.04104, 2023 - arxiv.org

Developing a general algorithm that learns to solve tasks across a wide range of
applications has been a fundamental challenge in artificial intelligence. Although current …

被引用次数：318 相关文章所有 2 个版本

[PDF] arxiv.org

A generalist agent

S Reed, K Zolna, E Parisotto, SG Colmenarejo… - arXiv preprint arXiv …, 2022 - arxiv.org

Inspired by progress in large-scale language modeling, we apply a similar approach
towards building a single generalist agent beyond the realm of text outputs. The agent …

被引用次数：763 相关文章所有 4 个版本

[PDF] arxiv.org

Mastering visual continuous control: Improved data-augmented reinforcement learning

D Yarats, R Fergus, A Lazaric, L Pinto - arXiv preprint arXiv:2107.09645, 2021 - arxiv.org

We present DrQ-v2, a model-free reinforcement learning (RL) algorithm for visual
continuous control. DrQ-v2 builds on DrQ, an off-policy actor-critic approach that uses data …

被引用次数：253 相关文章所有 4 个版本

[PDF] openreview.net

Image augmentation is all you need: Regularizing deep reinforcement learning from pixels

D Yarats, I Kostrikov, R Fergus - International conference on …, 2021 - openreview.net

We propose a simple data augmentation technique that can be applied to standard model-
free reinforcement learning algorithms, enabling robust learning directly from pixels without …

被引用次数：381 相关文章所有 6 个版本

[PDF] arxiv.org

Solving rubik's cube with a robot hand

I Akkaya, M Andrychowicz, M Chociej, M Litwin… - arXiv preprint arXiv …, 2019 - arxiv.org

We demonstrate that models trained only in simulation can be used to solve a manipulation
problem of unprecedented complexity on a real robot. This is made possible by two key …

被引用次数：1098 相关文章所有 7 个版本

[PDF] neurips.cc

Critic regularized regression

Z Wang, A Novikov, K Zolna, JS Merel… - Advances in …, 2020 - proceedings.neurips.cc

Offline reinforcement learning (RL), also known as batch RL, offers the prospect of policy
optimization from large pre-recorded datasets without online environment interaction. It …

被引用次数：301 相关文章所有 9 个版本

[PDF] mlr.press

Learning latent dynamics for planning from pixels

D Hafner, T Lillicrap, I Fischer… - International …, 2019 - proceedings.mlr.press

Planning has been very successful for control tasks with known environment dynamics. To
leverage planning in unknown environments, the agent needs to learn the dynamics from …

被引用次数：1484 相关文章所有 10 个版本

[PDF] arxiv.org

Image augmentation is all you need: Regularizing deep reinforcement learning from pixels

I Kostrikov, D Yarats, R Fergus - arXiv preprint arXiv:2004.13649, 2020 - arxiv.org

We propose a simple data augmentation technique that can be applied to standard model-
free reinforcement learning algorithms, enabling robust learning directly from pixels without …

被引用次数：371 相关文章所有 3 个版本