Kefan Dong 个人学术档案 - 学术资源搜索

引用次数

	总计	2019 年至今
引用	392	392
h 指数	9	9
i10 指数	8	8

120

2019202020212022202320245 43 80 91 106 67

开放获取的出版物数量

查看全部

6 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Tengyu MAStanford University在 stanford.edu 的电子邮件经过验证
Yuan ZhouDepartment of ISE, University of Illinois Urbana-Champaign在 illinois.edu 的电子邮件经过验证
Jian PengHelixon在 helixon.com 的电子邮件经过验证
Liwei WangProfessor, Peking University在 cis.pku.edu.cn 的电子邮件经过验证
Xiaoyu ChenPeking University在 pku.edu.cn 的电子邮件经过验证
Yuanhao WangPrinceton University在 princeton.edu 的电子邮件经过验证
Zhizhou RenUniversity of Illinois at Urbana-Champaign在 illinois.edu 的电子邮件经过验证
Qiang LiuAssistant Professor of Computer Science, UT Austin在 cs.utexas.edu 的电子邮件经过验证
Yuping LuoComputer Science Department, Princeton University在 cs.princeton.edu 的电子邮件经过验证
Yingkai LiYale University在 yale.edu 的电子邮件经过验证

关注

Kefan Dong

Stanford University

在 stanford.edu 的电子邮件经过验证 - 首页

Reinforcement Learning


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Q-learning with ucb exploration is sample efficient for infinite-horizon mdp K Dong, Y Wang, X Chen, L Wang International Conference on Learning Representations, 2019	112	2019
Exploration via hindsight goal generation Z Ren, K Dong, Y Zhou, Q Liu, J Peng Advances in Neural Information Processing Systems 32, 2019	85	2019
Root-n-regret for learning in markov decision processes with function approximation and low bellman rank K Dong, J Peng, Y Wang, Y Zhou Conference on Learning Theory, 1554-1557, 2020	46	2020
Provable model-based nonlinear bandit and reinforcement learning: Shelve optimism, embrace virtual curvature K Dong, J Yang, T Ma Advances in neural information processing systems 34, 26168-26182, 2021	40	2021
On the expressivity of neural networks for deep reinforcement learning K Dong, Y Luo, T Yu, C Finn, T Ma International conference on machine learning, 2627-2637, 2020	34	2020
Design of experiments for stochastic contextual linear bandits A Zanette, K Dong, JN Lee, E Brunskill Advances in Neural Information Processing Systems 34, 22720-22731, 2021	23	2021
Multinomial logit bandit with low switching cost K Dong, Y Li, Q Zhang, Y Zhou International Conference on Machine Learning, 2607-2615, 2020	19	2020
First steps toward understanding the extrapolation of nonlinear models to unseen domains K Dong, T Ma arXiv preprint arXiv:2211.11719, 2022	11	2022
Asymptotic instance-optimal algorithms for interactive decision making K Dong, T Ma arXiv preprint arXiv:2206.02326, 2022	9	2022
Beyond ntk with vanilla gradient descent: A mean-field analysis of neural networks with polynomial width, samples, and time A Mahankali, H Zhang, K Dong, M Glasgow, T Ma Advances in Neural Information Processing Systems 36, 2024	8	2024
Toward L_∞ Recovery of Nonlinear Functions: A Polynomial Sample Complexity Bound for Gaussian Random Fields K Dong, T Ma The Thirty Sixth Annual Conference on Learning Theory, 2877-2918, 2023	2	2023
Refined analysis of fpl for adversarial markov decision processes Y Wang, K Dong arXiv preprint arXiv:2008.09251, 2020	2	2020
Model-based offline reinforcement learning with local misspecification K Dong, Y Flet-Berliac, A Nie, E Brunskill Proceedings of the AAAI Conference on Artificial Intelligence 37 (6), 7423-7431, 2023	1	2023

系统目前无法执行此操作，请稍后再试。

文章 1–13

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用