Deep reinforcement learning for cyber security

TT Nguyen, VJ Reddi - IEEE Transactions on Neural Networks …, 2021 - ieeexplore.ieee.org
The scale of Internet-connected systems has increased considerably, and these systems are
being exposed to cyberattacks more than ever. The complexity and dynamics of …

Reinforcement learning-based physical cross-layer security and privacy in 6G

X Lu, L Xiao, P Li, X Ji, C Xu, S Yu… - … Surveys & Tutorials, 2022 - ieeexplore.ieee.org
Sixth-generation (6G) cellular systems will have an inherent vulnerability to physical (PHY)-
layer attacks and privacy leakage, due to the large-scale heterogeneous networks with …

Mastering diverse domains through world models

D Hafner, J Pasukonis, J Ba, T Lillicrap - arXiv preprint arXiv:2301.04104, 2023 - arxiv.org
Developing a general algorithm that learns to solve tasks across a wide range of
applications has been a fundamental challenge in artificial intelligence. Although current …

A generalist agent

S Reed, K Zolna, E Parisotto, SG Colmenarejo… - arXiv preprint arXiv …, 2022 - arxiv.org
Inspired by progress in large-scale language modeling, we apply a similar approach
towards building a single generalist agent beyond the realm of text outputs. The agent …

Mastering visual continuous control: Improved data-augmented reinforcement learning

D Yarats, R Fergus, A Lazaric, L Pinto - arXiv preprint arXiv:2107.09645, 2021 - arxiv.org
We present DrQ-v2, a model-free reinforcement learning (RL) algorithm for visual
continuous control. DrQ-v2 builds on DrQ, an off-policy actor-critic approach that uses data …

Image augmentation is all you need: Regularizing deep reinforcement learning from pixels

D Yarats, I Kostrikov, R Fergus - International conference on …, 2021 - openreview.net
We propose a simple data augmentation technique that can be applied to standard model-
free reinforcement learning algorithms, enabling robust learning directly from pixels without …

Solving rubik's cube with a robot hand

I Akkaya, M Andrychowicz, M Chociej, M Litwin… - arXiv preprint arXiv …, 2019 - arxiv.org
We demonstrate that models trained only in simulation can be used to solve a manipulation
problem of unprecedented complexity on a real robot. This is made possible by two key …

Critic regularized regression

Z Wang, A Novikov, K Zolna, JS Merel… - Advances in …, 2020 - proceedings.neurips.cc
Offline reinforcement learning (RL), also known as batch RL, offers the prospect of policy
optimization from large pre-recorded datasets without online environment interaction. It …

Learning latent dynamics for planning from pixels

D Hafner, T Lillicrap, I Fischer… - International …, 2019 - proceedings.mlr.press
Planning has been very successful for control tasks with known environment dynamics. To
leverage planning in unknown environments, the agent needs to learn the dynamics from …

Image augmentation is all you need: Regularizing deep reinforcement learning from pixels

I Kostrikov, D Yarats, R Fergus - arXiv preprint arXiv:2004.13649, 2020 - arxiv.org
We propose a simple data augmentation technique that can be applied to standard model-
free reinforcement learning algorithms, enabling robust learning directly from pixels without …