An Entropy Regularization Free Mechanism for Policy-based Reinforcement Learning

文章

学术资源搜索

获得 5 条结果（用时0.04秒）

我的图书馆

An Entropy Regularization Free Mechanism for Policy-based Reinforcement Learning

在引用文章中搜索

[PDF] arxiv.org

Dataset condensation for recommendation

J Wu, W Fan, S Liu, Q Liu, R He, Q Li… - arXiv preprint arXiv …, 2023 - arxiv.org

Training recommendation models on large datasets often requires significant time and
computational resources. Consequently, an emergent imperative has arisen to construct …

被引用次数：6 相关文章所有 2 个版本

[PDF] arxiv.org

Generalized data distribution iteration

J Fan, C Xiao - arXiv preprint arXiv:2206.03192, 2022 - arxiv.org

To obtain higher sample efficiency and superior final performance simultaneously has been
one of the major challenges for deep reinforcement learning (DRL). Previous work could …

被引用次数：13 相关文章所有 8 个版本

[PDF] arxiv.org

A review for deep reinforcement learning in atari: Benchmarks, challenges, and solutions

J Fan - arXiv preprint arXiv:2112.04145, 2021 - arxiv.org

The Arcade Learning Environment (ALE) is proposed as an evaluation platform for
empirically assessing the generality of agents across dozens of Atari 2600 games. ALE …

被引用次数：15 相关文章所有 6 个版本

[PDF] arxiv.org

Differentiable arbitrating in zero-sum markov games

J Wang, M Song, F Gao, B Liu, Z Wang… - arXiv preprint arXiv …, 2023 - arxiv.org

We initiate the study of how to perturb the reward in a zero-sum Markov game with two
players to induce a desirable Nash equilibrium, namely arbitrating. Such a problem admits a …

被引用次数：2 相关文章所有 7 个版本

Optimization in Deep Learning: Loss Landscape, Optimizer Dynamics, and Bi-Level Settings

J Wang - 2024 - search.proquest.com

Abstract Machine learning has witnessed remarkable advancements in recent years,
transforming various industries and domains. Central to the success of machine learning …