Wenhao Yang 个人学术档案 - 学术资源搜索

引用次数

	总计	2019 年至今
引用	2636	2632
h 指数	7	7
i10 指数	7	7

880

440

220

660

20192020202120222023202417 146 366 649 862 589

开放获取的出版物数量

查看全部

7 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Zhihua ZhangProfessor of Computer Science, Shanghai Jiao Tong University在 zju.edu.cn 的电子邮件经过验证
Shusen WangMeta在 meta.com 的电子邮件经过验证
Xiang LiUniversity of Pennsylvania在 upenn.edu 的电子邮件经过验证
Liangyu ZhangPhD student at Peking University在 pku.edu.cn 的电子邮件经过验证
Tadashi KozunoOMRON SINIC X在 alumni.oist.jp 的电子邮件经过验证
Hao JinPeking University在 pku.edu.cn 的电子邮件经过验证
Michael I. JordanProfessor of Electrical Engineering and Computer Sciences and Professor of Statistics, UC Berkeley在 cs.berkeley.edu 的电子邮件经过验证
Jiadong Liangpeking university在 pku.edu.cn 的电子邮件经过验证
Scott M. JordanPostdoctoral Fellow, University of Alberta在 ualberta.ca 的电子邮件经过验证
Jose BlanchetStanford University在 stanford.edu 的电子邮件经过验证

关注

Wenhao Yang

Stanford University

在 stanford.edu 的电子邮件经过验证 - 首页

Reinforcement Learning Optimization Statistics


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
On the Convergence of FedAvg on Non-IID Data X Li, K Huang, W Yang, S Wang, Z Zhang arXiv preprint arXiv:1907.02189, 2019	2324	2019
Communication-efficient local decentralized SGD methods X Li, W Yang, S Wang, Z Zhang arXiv preprint arXiv:1910.09126, 2019	112*	2019
Toward theoretical understandings of robust Markov decision processes: Sample complexity and asymptotics W Yang, L Zhang, Z Zhang The Annals of Statistics 50 (6), 3223-3248, 2022	59	2022
Federated Reinforcement Learning with Environment Heterogeneity H Jin, Y Peng, W Yang, S Wang, Z Zhang International Conference on Artificial Intelligence and Statistics, 18-37, 2022	56	2022
A regularized approach to sparse optimal policy in reinforcement learning W Yang, X Li, Z Zhang Advances in Neural Information Processing Systems 32, 2019	36*	2019
A Statistical Analysis of Polyak-Ruppert Averaged Q-Learning X Li, W Yang, J Liang, Z Zhang, MI Jordan International Conference on Artificial Intelligence and Statistics, 2207-2261, 2023	18*	2023
Robust Markov Decision Processes without Model Estimation W Yang, H Wang, T Kozuno, SM Jordan, Z Zhang arXiv preprint arXiv:2302.01248, 2023	10*	2023
KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal T Kozuno, W Yang, N Vieillard, T Kitamura, Y Tang, J Mei, P Ménard, ... arXiv preprint arXiv:2205.14211, 2022	7	2022
Semiparametrically efficient off-policy evaluation in linear Markov decision processes C Xie, W Yang, Z Zhang International Conference on Machine Learning, 38227-38257, 2023	4	2023
Finding the Near Optimal Policy via Adaptive Reduced Regularization in MDPs W Yang, X Li, G Xie, Z Zhang arXiv preprint arXiv:2011.00213, 2020	3	2020
Regularization and variance-weighted regression achieves minimax optimality in linear MDPs: theory and practice T Kitamura, T Kozuno, Y Tang, N Vieillard, M Valko, W Yang, J Mei, ... International Conference on Machine Learning, 17135-17175, 2023	2	2023
Semi-infinitely Constrained Markov Decision Processes L Zhang, Y Peng, W Yang, Z Zhang Advances in Neural Information Processing Systems 35, 16808-16820, 2022	2	2022
Statistical Estimation of Confounded Linear MDPs: An Instrumental Variable Approach M Lu, W Yang, L Zhang, Z Zhang arXiv preprint arXiv:2209.05186, 2022	2	2022
Estimation and Inference in Distributional Reinforcement Learning L Zhang, Y Peng, J Liang, W Yang, Z Zhang arXiv preprint arXiv:2309.17262, 2023	1	2023
Distributionally Robust Optimization as a Scalable Framework to Characterize Extreme Value Distributions PK Kuiper, A Hasan, W Yang, J Blanchet, V Tarokh, Y Ng, H Bidkhori The 40th Conference on Uncertainty in Artificial Intelligence, 2024		2024
Semi-Infinitely Constrained Markov Decision Processes and Provably Efficient Reinforcement Learning L Zhang, Y Peng, W Yang, Z Zhang IEEE Transactions on Pattern Analysis & Machine Intelligence, 1-14, 2023		2023

系统目前无法执行此操作，请稍后再试。

文章 1–16

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用