Reinforcement learning in game industry—review, prospects and challenges

K Souchleris, GK Sidiropoulos, GA Papakostas - Applied Sciences, 2023 - mdpi.com
This article focuses on the recent advances in the field of reinforcement learning (RL) as well
as the present state–of–the–art applications in games. First, we give a general panorama of …

Why generalization in rl is difficult: Epistemic pomdps and implicit partial observability

D Ghosh, J Rahme, A Kumar, A Zhang… - Advances in neural …, 2021 - proceedings.neurips.cc
Generalization is a central challenge for the deployment of reinforcement learning (RL)
systems in the real world. In this paper, we show that the sequential structure of the RL …

Decoupling value and policy for generalization in reinforcement learning

R Raileanu, R Fergus - International Conference on …, 2021 - proceedings.mlr.press
Standard deep reinforcement learning algorithms use a shared representation for the policy
and value function, especially when training directly from images. However, we argue that …

Automatic data augmentation for generalization in reinforcement learning

R Raileanu, M Goldstein, D Yarats… - Advances in …, 2021 - proceedings.neurips.cc
Deep reinforcement learning (RL) agents often fail to generalize beyond their training
environments. To alleviate this problem, recent work has proposed the use of data …

On the importance of exploration for generalization in reinforcement learning

Y Jiang, JZ Kolter, R Raileanu - Advances in Neural …, 2024 - proceedings.neurips.cc
Existing approaches for improving generalization in deep reinforcement learning (RL) have
mostly focused on representation learning, neglecting RL-specific aspects such as …

Secant: Self-expert cloning for zero-shot generalization of visual policies

L Fan, G Wang, DA Huang, Z Yu, L Fei-Fei… - arXiv preprint arXiv …, 2021 - arxiv.org
Generalization has been a long-standing challenge for reinforcement learning (RL). Visual
RL, in particular, can be easily distracted by irrelevant factors in high-dimensional …

Transformer Decoder-Based Enhanced Exploration Method to Alleviate Initial Exploration Problems in Reinforcement Learning

D Kyoung, Y Sung - Sensors, 2023 - mdpi.com
In reinforcement learning, the epsilon (ε)-greedy strategy is commonly employed as an
exploration technique This method, however, leads to extensive initial exploration and …

The Generalization Gap in Offline Reinforcement Learning

I Mediratta, Q You, M Jiang, R Raileanu - arXiv preprint arXiv:2312.05742, 2023 - arxiv.org
Despite recent progress in offline learning, these methods are still trained and tested on the
same environment. In this paper, we compare the generalization abilities of widely used …

Continuous Autonomous Ship Learning Framework for Human Policies on Simulation

J Kim, J Park, K Cho - Applied Sciences, 2022 - mdpi.com
Considering autonomous navigation in busy marine traffic environments (including harbors
and coasts), major study issues to be solved for autonomous ships are avoidance of static …

A Study of Generalization in Offline Reinforcement Learning

I Mediratta, Q You, M Jiang… - NeurIPS 2023 Workshop …, 2023 - openreview.net
Despite the recent progress in offline reinforcement learning (RL) algorithms, agents are
usually trained and tested on the same environment. In this paper, we perform an in-depth …