Applications of multi-agent reinforcement learning in future internet: A comprehensive survey

T Li, K Zhu, NC Luong, D Niyato, Q Wu… - … Surveys & Tutorials, 2022 - ieeexplore.ieee.org
Future Internet involves several emerging technologies such as 5G and beyond 5G
networks, vehicular networks, unmanned aerial vehicle (UAV) networks, and Internet of …

Sunrise: A simple unified framework for ensemble learning in deep reinforcement learning

K Lee, M Laskin, A Srinivas… - … Conference on Machine …, 2021 - proceedings.mlr.press
Off-policy deep reinforcement learning (RL) has been successful in a range of challenging
domains. However, standard off-policy RL algorithms can suffer from several issues, such as …

Survey on Machine Learning-Enabled Network Slicing: Covering the Entire Life Cycle

A Donatti, SL Correa, JSB Martins… - … on Network and …, 2023 - ieeexplore.ieee.org
Network slicing (NS) is becoming an essential element of service management and
orchestration in communication networks, starting from mobile cellular networks and …

Dealing with sparse rewards in reinforcement learning

J Hare - arXiv preprint arXiv:1910.09281, 2019 - arxiv.org
Successfully navigating a complex environment to obtain a desired outcome is a difficult
task, that up to recently was believed to be capable only by humans. This perception has …

Reducing variance in temporal-difference value estimation via ensemble of deep networks

L Liang, Y Xu, S McAleer, D Hu, A Ihler… - International …, 2022 - proceedings.mlr.press
In temporal-difference reinforcement learning algorithms, variance in value estimation can
cause instability and overestimation of the maximal target value. Many algorithms have been …

Empirical design in reinforcement learning

A Patterson, S Neumann, M White, A White - arXiv preprint arXiv …, 2023 - arxiv.org
Empirical design in reinforcement learning is no small task. Running good experiments
requires attention to detail and at times significant computational resources. While compute …

Understanding Deep Neural Function Approximation in Reinforcement Learning via -Greedy Exploration

F Liu, L Viano, V Cevher - Advances in Neural Information …, 2022 - proceedings.neurips.cc
This paper provides a theoretical study of deep neural function approximation in
reinforcement learning (RL) with the $\epsilon $-greedy exploration under the online setting …

Constrained deep q-learning gradually approaching ordinary q-learning

S Ohnishi, E Uchibe, Y Yamaguchi… - Frontiers in …, 2019 - frontiersin.org
A deep Q network (DQN)(Mnih et al.,) is an extension of Q learning, which is a typical deep
reinforcement learning method. In DQN, a Q function expresses all action values under all …

Double sparse deep reinforcement learning via multilayer sparse coding and nonconvex regularized pruning

H Zhao, J Wu, Z Li, W Chen… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Deep reinforcement learning (DRL), which highly depends on the data representation, has
shown its potential in many practical decision-making problems. However, the process of …

T-soft update of target network for deep reinforcement learning

T Kobayashi, WEL Ilboudo - Neural Networks, 2021 - Elsevier
This paper proposes a new robust update rule of target network for deep reinforcement
learning (DRL), to replace the conventional update rule, given as an exponential moving …