A survey of zero-shot generalisation in deep reinforcement learning
The study of zero-shot generalisation (ZSG) in deep Reinforcement Learning (RL) aims to
produce RL algorithms whose policies generalise well to novel unseen situations at …
produce RL algorithms whose policies generalise well to novel unseen situations at …
Why generalization in rl is difficult: Epistemic pomdps and implicit partial observability
Generalization is a central challenge for the deployment of reinforcement learning (RL)
systems in the real world. In this paper, we show that the sequential structure of the RL …
systems in the real world. In this paper, we show that the sequential structure of the RL …
Efficient knowledge distillation from model checkpoints
Abstract Knowledge distillation is an effective approach to learn compact models (students)
with the supervision of large and strong models (teachers). As empirically there exists a …
with the supervision of large and strong models (teachers). As empirically there exists a …
PID-inspired inductive biases for deep reinforcement learning in partially observable control tasks
I Char, J Schneider - Advances in Neural Information …, 2024 - proceedings.neurips.cc
Deep reinforcement learning (RL) has shown immense potential for learning to control
systems through data alone. However, one challenge deep RL faces is that the full state of …
systems through data alone. However, one challenge deep RL faces is that the full state of …
Robust predictable control
B Eysenbach, RR Salakhutdinov… - Advances in Neural …, 2021 - proceedings.neurips.cc
Many of the challenges facing today's reinforcement learning (RL) algorithms, such as
robustness, generalization, transfer, and computational efficiency are closely related to …
robustness, generalization, transfer, and computational efficiency are closely related to …
Selective visual representations improve convergence and generalization for embodied ai
Embodied AI models often employ off the shelf vision backbones like CLIP to encode their
visual observations. Although such general purpose representations encode rich syntactic …
visual observations. Although such general purpose representations encode rich syntactic …
Dynamics generalisation in reinforcement learning via adaptive context-aware policies
While reinforcement learning has achieved remarkable successes in several domains, its
real-world application is limited due to many methods failing to generalise to unfamiliar …
real-world application is limited due to many methods failing to generalise to unfamiliar …
Markov‐GAN: Markov image enhancement method for malicious encrypted traffic classification
Z Tang, J Wang, B Yuan, H Li, J Zhang… - IET Information …, 2022 - Wiley Online Library
The rapidly growing encrypted traffic hides a large number of malicious behaviours. The
difficulty of collecting and labelling encrypted traffic makes the class distribution of dataset …
difficulty of collecting and labelling encrypted traffic makes the class distribution of dataset …
A deep residual reinforcement learning algorithm based on Soft Actor-Critic for autonomous navigation
S Wen, Y Shu, A Rad, Z Wen, Z Guo, S Gong - Expert Systems with …, 2025 - Elsevier
The problem of autonomous navigation has attracted significant attention from robotics
research community in the last few decades. In this paper, we address the problem of low …
research community in the last few decades. In this paper, we address the problem of low …
Learn goal-conditioned policy with intrinsic motivation for deep reinforcement learning
It is of significance for an agent to autonomously explore the environment and learn a widely
applicable and general-purpose goal-conditioned policy that can achieve diverse goals …
applicable and general-purpose goal-conditioned policy that can achieve diverse goals …