The role of regularization in overparameterized neural networks

S Cayci, N He, R Srikant - SIAM Journal on Optimization, 2024 - SIAM

Natural policy gradient (NPG) methods, equipped with function approximation and entropy
regularization, achieve impressive empirical success in reinforcement learning problems …

被引用次数：34 相关文章所有 5 个版本

[PDF] arxiv.org

Finite-time analysis of entropy-regularized neural natural actor-critic algorithm

S Cayci, N He, R Srikant - arXiv preprint arXiv:2206.00833, 2022 - arxiv.org

Natural actor-critic (NAC) and its variants, equipped with the representation power of neural
networks, have demonstrated impressive empirical success in solving Markov decision …

被引用次数：19 相关文章所有 3 个版本

[PDF] arxiv.org

Sample complexity and overparameterization bounds for temporal-difference learning with neural network approximation

S Cayci, S Satpathi, N He… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

In this article, we study the dynamics of temporal-difference (TD) learning with neural
network-based value function approximation over a general state space, namely, neural TD …

被引用次数：8 相关文章所有 7 个版本

[PDF] acm.org

Investigating overparameterization for non-negative matrix factorization in collaborative filtering

Y Kawakami, M Sugiyama - Proceedings of the 15th ACM Conference …, 2021 - dl.acm.org

Overparameterization is one of the key techniques in modern machine learning, where a
model with the higher complexity can generalize better on test data against the common …

被引用次数：5 相关文章所有 3 个版本

[PDF] nsf.gov

TOPS: Transition-Based Volatility-Reduced Policy Search

X Liangliang, L Daoming, P Yangchen - Lecture notes in computer …, 2022 - par.nsf.gov

Existing risk-averse reinforcement learning approaches still face several challenges,
including the lack of global optimality guarantee and the necessity of learning from long-term …

[PDF] mit.edu