关注
Gaurav Mahajan
Gaurav Mahajan
Postdoctoral Associate, Yale University
在 yale.edu 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
On the theory of policy gradient methods: Optimality, approximation, and distribution shift
A Agarwal, SM Kakade, JD Lee, G Mahajan
Journal of Machine Learning Research 22 (98), 1-76, 2021
4152021
Optimality and approximation with policy gradient methods in markov decision processes
A Agarwal, SM Kakade, JD Lee, G Mahajan
Conference on Learning Theory, 64-66, 2020
3822020
Bilinear classes: A structural framework for provable generalization in rl
S Du, S Kakade, J Lee, S Lovett, G Mahajan, W Sun, R Wang
International Conference on Machine Learning, 2826-2836, 2021
2212021
Agnostic -learning with Function Approximation in Deterministic Systems: Near-Optimal Bounds on Approximation Error and Sample Complexity
SS Du, JD Lee, G Mahajan, R Wang
Advances in Neural Information Processing Systems 33, 22327-22337, 2020
63*2020
Noise-tolerant, reliable active classification with comparison queries
M Hopkins, D Kane, S Lovett, G Mahajan
Conference on Learning Theory, 1957-2006, 2020
232020
Realizable learning is all you need
M Hopkins, DM Kane, S Lovett, G Mahajan
Conference on Learning Theory, 3015-3069, 2022
222022
Computational-statistical gap in reinforcement learning
D Kane, S Liu, S Lovett, G Mahajan
Conference on Learning Theory, 1282-1302, 2022
212022
Point location and active learning: Learning halfspaces almost optimally
M Hopkins, D Kane, S Lovett, G Mahajan
2020 IEEE 61st Annual Symposium on Foundations of Computer Science (FOCS …, 2020
192020
Exponential hardness of reinforcement learning with linear function approximation
S Liu, G Mahajan, D Kane, S Lovett, G Weisz, C Szepesvári
The Thirty Sixth Annual Conference on Learning Theory, 1588-1617, 2023
5*2023
Do PAC-Learners Learn the Marginal Distribution?
M Hopkins, DM Kane, S Lovett, G Mahajan
arXiv preprint arXiv:2302.06285, 2023
42023
Convergence of online k-means
G So, G Mahajan, S Dasgupta
International Conference on Artificial Intelligence and Statistics, 8534-8569, 2022
42022
Learning Hidden Markov Models Using Conditional Samples
G Mahajan, S Kakade, A Krishnamurthy, C Zhang
The Thirty Sixth Annual Conference on Learning Theory, 2014-2066, 2023
32023
Learning what to remember
R Bhattacharjee, G Mahajan
International Conference on Algorithmic Learning Theory, 70-89, 2022
22022
Improved classical shadows from local symmetries in the Schur basis
D Grier, S Liu, G Mahajan
arXiv preprint arXiv:2405.09525, 2024
12024
Learning hidden markov models using conditional samples
SM Kakade, A Krishnamurthy, G Mahajan, C Zhang
arXiv preprint arXiv:2302.14753, 2023
12023
Computational and Statistical Complexity of Learning in Sequential Models
G Mahajan
University of California, San Diego, 2023
2023
Convergence of online -means
S Dasgupta, G Mahajan, G So
arXiv preprint arXiv:2202.10640, 2022
2022
系统目前无法执行此操作,请稍后再试。
文章 1–17