Qi Cai 个人学术档案 - 学术资源搜索

引用次数

	总计	2019 年至今
引用	1133	1131
h 指数	11	11
i10 指数	11	11

0

280

140

70

210

20192020202120222023202417 130 243 266 264 211

开放获取的出版物数量

1 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

Qi Cai

Qi Cai

Northwestern University

在 u.northwestern.edu 的电子邮件经过验证

reinforcement learning optimization machine learning


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Provably efficient exploration in policy optimization Q Cai, Z Yang, C Jin, Z Wang International Conference on Machine Learning, 1283-1294, 2020	305	2020
Neural policy gradient methods: Global optimality and rates of convergence L Wang, Q Cai, Z Yang, Z Wang International Conference on Learning Representations 2020, 2019	255	2019
Neural Trust Region/Proximal Policy Optimization Attains Globally Optimal Policy B Liu, Q Cai, Z Yang, Z Wang Advances in Neural Information Processing Systems, 10564-10575, 2019	213	2019
Neural temporal-difference learning converges to global optima Q Cai, Z Yang, JD Lee, Z Wang Advances in Neural Information Processing Systems 32, 2019	146*	2019
On the Global Optimality of Model-Agnostic Meta-Learning: Reinforcement Learning and Supervised Learning L Wang, Q Cai, Z Yang, Z Wang International Conference on Machine Learning, 9837-9846, 2020	45	2020
On the global convergence of imitation learning: A case for linear quadratic regulator Q Cai, M Hong, Y Chen, Z Wang arXiv preprint arXiv:1901.03674, 2019	37	2019
Generative adversarial imitation learning with neural network parameterization: Global optimality and convergence rate Y Zhang, Q Cai, Z Yang, Z Wang International conference on machine learning, 11044-11054, 2020	32*	2020
Reinforcement learning from partial observation: Linear function approximation with provable sample efficiency Q Cai, Z Yang, Z Wang International Conference on Machine Learning, 2485-2522, 2022	29*	2022
Represent to Control Partially Observed Systems: Representation Learning with Provable Sample Efficiency L Wang, Q Cai, Z Yang, Z Wang The Eleventh International Conference on Learning Representations, 0	24*
Provably efficient offline reinforcement learning for partially observable markov decision processes H Guo, Q Cai, Y Zhang, Z Yang, Z Wang International Conference on Machine Learning, 8016-8038, 2022	16	2022
Can Temporal-Diﬀerence and Q-Learning Learn Representation? A Mean-Field Theory Y Zhang, Q Cai, Z Yang, Y Chen, Z Wang Advances in Neural Information Processing Systems 33, 19680-19692, 2020	15	2020
Neural temporal difference and q learning provably converge to global optima Q Cai, Z Yang, JD Lee, Z Wang Mathematics of Operations Research 49 (1), 619-651, 2024	7	2024
An analysis of attention via the lens of exchangeability and latent variable models Y Zhang, B Liu, Q Cai, L Wang, Z Wang arXiv preprint arXiv:2212.14852, 2022	7	2022
Optimistic Policy Optimization with General Function Approximations Q Cai, Z Yang, C Szepesvari, Z Wang	2	2021
Provably Efficient Reinforcement Learning Q Cai Northwestern University, 2022		2022
BooVI: provably efficient bootstrapped value iteration B Liu, Q Cai, Z Yang, Z Wang Advances in Neural Information Processing Systems 34, 7041-7053, 2021		2021

系统目前无法执行此操作，请稍后再试。

文章 1–16

共建清朗的网络空间,如遇有害信息,请举报。
本站数据皆整合自互联网公开资源索引,方便科研学术方面查询,并不存储相关数据资源;如对此有异议,请联系我们解决.
© 2023 学术资源搜索 @联系我们 | 申请短期会员 | 数据源提交