Yu Bai 个人学术档案 - 学术资源搜索

引用次数

	总计	2019 年至今
引用	2569	2508
h 指数	25	25
i10 指数	40	40

740

370

185

555

2017201820192020202120222023202417 44 83 170 352 480 739 682

开放获取的出版物数量

查看全部

16 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Song MeiAssistant Professor at UC Berkeley在 berkeley.edu 的电子邮件经过验证
Huan WangSalesforce Research在 yale.edu 的电子邮件经过验证
Caiming XiongSalesforce Research在 salesforce.com 的电子邮件经过验证
Chi JinAssistant Professor, Princeton University在 princeton.edu 的电子邮件经过验证
Yu-Xiang WangAssociate Professor @ UC San Diego在 ucsd.edu 的电子邮件经过验证
Tiancheng YuTwo Sigma在 mit.edu 的电子邮件经过验证
Nan JiangAssistant Professor of Computer Science, UIUC在 illinois.edu 的电子邮件经过验证
Jason D. LeeAssociate Professor of Electrical Engineering and Computer Science, Princeton University在 princeton.edu 的电子邮件经过验证
Tengyang XieAssistant Professor of Computer Science, University of Wisconsin-Madison在 cs.wisc.edu 的电子邮件经过验证
Minshuo ChenPrinceton University在 princeton.edu 的电子邮件经过验证
Andrea MontanariProfessor of Statistics and Mathematics, Stanford University在 stanford.edu 的电子邮件经过验证
Qinghua LiuPrinceton University在 princeton.edu 的电子邮件经过验证
Ming YinPrinceton University在 princeton.edu 的电子邮件经过验证
Ziang SongStanford University在 stanford.edu 的电子邮件经过验证
Sham M KakadeHarvard University在 seas.harvard.edu 的电子邮件经过验证
Tuo ZhaoAssociate Professor, Georgia Tech在 gatech.edu 的电子邮件经过验证
Mengdi WangCenter for Statistics & Machine Learning, ECE, Princeton University在 princeton.edu 的电子邮件经过验证
Edo Libertypinecone.io在 edoliberty.com 的电子邮件经过验证
Tianyu GuoPh.D. student, UC Berkeley在 berkeley.edu 的电子邮件经过验证
Aadyot BhatnagarMachine Learning Scientist, Profluent Bio在 profluent.bio 的电子邮件经过验证

关注

Yu Bai

OpenAI

在 salesforce.com 的电子邮件经过验证 - 首页

Machine Learning Statistics


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
The landscape of empirical risk for nonconvex losses S Mei, Y Bai, A Montanari The Annals of Statistics 46 (6A), 2747-2774, 2018	354	2018
Provable self-play algorithms for competitive reinforcement learning Y Bai, C Jin International conference on machine learning, 551-560, 2020	171	2020
Policy finetuning: Bridging sample-efficient offline and online reinforcement learning T Xie, N Jiang, H Wang, C Xiong, Y Bai Advances in neural information processing systems 34, 27395-27407, 2021	158	2021
Near-Optimal Reinforcement Learning with Self-Play Y Bai, C Jin, T Yu Advances in Neural Information Processing Systems, 2020, 2020	144	2020
A sharp analysis of model-based reinforcement learning with self-play Q Liu, T Yu, Y Bai, C Jin International Conference on Machine Learning, 7001-7010, 2021	142	2021
Beyond linearization: On quadratic and higher-order approximation of wide neural networks Y Bai, JD Lee International Conference on Learning Representations (ICLR) 2020, 2019	131	2019
Proxquant: Quantized neural networks via proximal operators Y Bai, YX Wang, E Liberty International Conference on Learning Representations (ICLR) 2019, 2018	120	2018
Provably Efficient Q-Learning with Low Switching Cost Y Bai, T Xie, N Jiang, YX Wang Advances in Neural Information Processing Systems, 2019, 2019	104	2019
Transformers as statisticians: Provable in-context learning with in-context algorithm selection Y Bai, F Chen, H Wang, C Xiong, S Mei Advances in neural information processing systems 36, 2024	100	2024
When can we learn general-sum Markov games with a large number of players sample-efficiently? Z Song, S Mei, Y Bai International Conference on Learning Representations (ICLR) 2022, 2021	99	2021
Near-optimal provable uniform convergence in offline policy evaluation for reinforcement learning M Yin, Y Bai, YX Wang International Conference on Artificial Intelligence and Statistics, 1567-1575, 2021	90*	2021
Approximability of discriminators implies diversity in GANs Y Bai, T Ma, A Risteski International Conference on Learning Representations (ICLR) 2019, 2018	85	2018
How important is the train-validation split in meta-learning? Y Bai, M Chen, P Zhou, T Zhao, J Lee, S Kakade, H Wang, C Xiong International Conference on Machine Learning, 543-553, 2021	78	2021
Sample-efficient learning of stackelberg equilibria in general-sum games Y Bai, C Jin, H Wang, C Xiong Advances in Neural Information Processing Systems 34, 25799-25811, 2021	71	2021
Near-optimal offline reinforcement learning via double variance reduction M Yin, Y Bai, YX Wang Advances in neural information processing systems 34, 7677-7688, 2021	66	2021
Subgradient descent learns orthogonal dictionaries Y Bai, Q Jiang, J Sun International Conference on Learning Representations (ICLR) 2019, 2018	59	2018
The role of coverage in online reinforcement learning T Xie, DJ Foster, Y Bai, N Jiang, SM Kakade arXiv preprint arXiv:2210.04157, 2022	57	2022
Towards understanding hierarchical learning: Benefits of neural representations M Chen, Y Bai, JD Lee, T Zhao, H Wang, C Xiong, R Socher Advances in Neural Information Processing Systems, 2020, 2020	48	2020
Don't Just Blame Over-parametrization for Over-confidence: Theoretical Analysis of Calibration in Binary Classification Y Bai, S Mei, H Wang, C Xiong International Conference on Machine Learning, 566-576, 2021	46	2021
Unified algorithms for rl with decision-estimation coefficients: No-regret, pac, and reward-free learning F Chen, S Mei, Y Bai arXiv preprint arXiv:2209.11745, 2022	34	2022

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用