Applications of multi-agent reinforcement learning in future internet: A comprehensive survey
Future Internet involves several emerging technologies such as 5G and beyond 5G
networks, vehicular networks, unmanned aerial vehicle (UAV) networks, and Internet of …
networks, vehicular networks, unmanned aerial vehicle (UAV) networks, and Internet of …
Sunrise: A simple unified framework for ensemble learning in deep reinforcement learning
Off-policy deep reinforcement learning (RL) has been successful in a range of challenging
domains. However, standard off-policy RL algorithms can suffer from several issues, such as …
domains. However, standard off-policy RL algorithms can suffer from several issues, such as …
Survey on Machine Learning-Enabled Network Slicing: Covering the Entire Life Cycle
A Donatti, SL Correa, JSB Martins… - … on Network and …, 2023 - ieeexplore.ieee.org
Network slicing (NS) is becoming an essential element of service management and
orchestration in communication networks, starting from mobile cellular networks and …
orchestration in communication networks, starting from mobile cellular networks and …
Dealing with sparse rewards in reinforcement learning
J Hare - arXiv preprint arXiv:1910.09281, 2019 - arxiv.org
Successfully navigating a complex environment to obtain a desired outcome is a difficult
task, that up to recently was believed to be capable only by humans. This perception has …
task, that up to recently was believed to be capable only by humans. This perception has …
Reducing variance in temporal-difference value estimation via ensemble of deep networks
In temporal-difference reinforcement learning algorithms, variance in value estimation can
cause instability and overestimation of the maximal target value. Many algorithms have been …
cause instability and overestimation of the maximal target value. Many algorithms have been …
Empirical design in reinforcement learning
Empirical design in reinforcement learning is no small task. Running good experiments
requires attention to detail and at times significant computational resources. While compute …
requires attention to detail and at times significant computational resources. While compute …
Understanding Deep Neural Function Approximation in Reinforcement Learning via -Greedy Exploration
This paper provides a theoretical study of deep neural function approximation in
reinforcement learning (RL) with the $\epsilon $-greedy exploration under the online setting …
reinforcement learning (RL) with the $\epsilon $-greedy exploration under the online setting …
Constrained deep q-learning gradually approaching ordinary q-learning
S Ohnishi, E Uchibe, Y Yamaguchi… - Frontiers in …, 2019 - frontiersin.org
A deep Q network (DQN)(Mnih et al.,) is an extension of Q learning, which is a typical deep
reinforcement learning method. In DQN, a Q function expresses all action values under all …
reinforcement learning method. In DQN, a Q function expresses all action values under all …
Double sparse deep reinforcement learning via multilayer sparse coding and nonconvex regularized pruning
H Zhao, J Wu, Z Li, W Chen… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Deep reinforcement learning (DRL), which highly depends on the data representation, has
shown its potential in many practical decision-making problems. However, the process of …
shown its potential in many practical decision-making problems. However, the process of …
T-soft update of target network for deep reinforcement learning
T Kobayashi, WEL Ilboudo - Neural Networks, 2021 - Elsevier
This paper proposes a new robust update rule of target network for deep reinforcement
learning (DRL), to replace the conventional update rule, given as an exponential moving …
learning (DRL), to replace the conventional update rule, given as an exponential moving …