- 学术资源搜索

Applications of multi-agent reinforcement learning in future internet: A comprehensive survey

T Li, K Zhu, NC Luong, D Niyato, Q Wu… - … Surveys & Tutorials, 2022 - ieeexplore.ieee.org

Future Internet involves several emerging technologies such as 5G and beyond 5G
networks, vehicular networks, unmanned aerial vehicle (UAV) networks, and Internet of …

被引用次数：141 相关文章所有 7 个版本

[PDF] mlr.press

Sunrise: A simple unified framework for ensemble learning in deep reinforcement learning

K Lee, M Laskin, A Srinivas… - … Conference on Machine …, 2021 - proceedings.mlr.press

Off-policy deep reinforcement learning (RL) has been successful in a range of challenging
domains. However, standard off-policy RL algorithms can suffer from several issues, such as …

被引用次数：256 相关文章所有 6 个版本

[PDF] usp.br

Survey on Machine Learning-Enabled Network Slicing: Covering the Entire Life Cycle

A Donatti, SL Correa, JSB Martins… - … on Network and …, 2023 - ieeexplore.ieee.org

Network slicing (NS) is becoming an essential element of service management and
orchestration in communication networks, starting from mobile cellular networks and …

被引用次数：18 相关文章所有 6 个版本

[PDF] arxiv.org

Dealing with sparse rewards in reinforcement learning

J Hare - arXiv preprint arXiv:1910.09281, 2019 - arxiv.org

Successfully navigating a complex environment to obtain a desired outcome is a difficult
task, that up to recently was believed to be capable only by humans. This perception has …

被引用次数：101 相关文章所有 3 个版本

[PDF] mlr.press

Reducing variance in temporal-difference value estimation via ensemble of deep networks

L Liang, Y Xu, S McAleer, D Hu, A Ihler… - International …, 2022 - proceedings.mlr.press

In temporal-difference reinforcement learning algorithms, variance in value estimation can
cause instability and overestimation of the maximal target value. Many algorithms have been …

被引用次数：22 相关文章所有 9 个版本

[PDF] arxiv.org

Empirical design in reinforcement learning

A Patterson, S Neumann, M White, A White - arXiv preprint arXiv …, 2023 - arxiv.org

Empirical design in reinforcement learning is no small task. Running good experiments
requires attention to detail and at times significant computational resources. While compute …

被引用次数：35 相关文章所有 2 个版本

[PDF] neurips.cc

Understanding Deep Neural Function Approximation in Reinforcement Learning via -Greedy Exploration

F Liu, L Viano, V Cevher - Advances in Neural Information …, 2022 - proceedings.neurips.cc

This paper provides a theoretical study of deep neural function approximation in
reinforcement learning (RL) with the $\epsilon $-greedy exploration under the online setting …

被引用次数：15 相关文章所有 11 个版本

[PDF] frontiersin.org

Constrained deep q-learning gradually approaching ordinary q-learning

S Ohnishi, E Uchibe, Y Yamaguchi… - Frontiers in …, 2019 - frontiersin.org

A deep Q network (DQN)(Mnih et al.,) is an extension of Q learning, which is a typical deep
reinforcement learning method. In DQN, a Q function expresses all action values under all …

被引用次数：71 相关文章所有 8 个版本

Double sparse deep reinforcement learning via multilayer sparse coding and nonconvex regularized pruning

H Zhao, J Wu, Z Li, W Chen… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

Deep reinforcement learning (DRL), which highly depends on the data representation, has
shown its potential in many practical decision-making problems. However, the process of …

被引用次数：26 相关文章所有 4 个版本

[PDF] arxiv.org

T-soft update of target network for deep reinforcement learning

T Kobayashi, WEL Ilboudo - Neural Networks, 2021 - Elsevier

This paper proposes a new robust update rule of target network for deep reinforcement
learning (DRL), to replace the conventional update rule, given as an exponential moving …

被引用次数：49 相关文章所有 6 个版本