Masatoshi Uehara 个人学术档案

引用次数

	总计	2019 年至今
引用	1745	1711
h 指数	22	22
i10 指数	33	33

560

280

140

420

2016201720182019202020212022202320245 7 20 28 120 205 343 552 453

开放获取的出版物数量

查看全部

13 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Nathan KallusCornell University在 cornell.edu 的电子邮件经过验证
Wen SunAssistant Professor, Cornell University在 cornell.edu 的电子邮件经过验证
Jason D. LeeAssociate Professor of Electrical Engineering and Computer Science, Princeton University在 princeton.edu 的电子邮件经过验证
Xiaojie Mao (毛小介)Assistant Professor, School of Economics and Management, Tsinghua University在 sem.tsinghua.edu.cn 的电子邮件经过验证
Nan JiangAssistant Professor of Computer Science, UIUC在 illinois.edu 的电子邮件经过验证
Wenhao ZhanGraduate Student, Princeton University在 princeton.edu 的电子邮件经过验证
Xuezhou ZhangBoston University在 bu.edu 的电子邮件经过验证
Chengchun ShiLondon School of Economics and Political Science在 lse.ac.uk 的电子邮件经过验证
Takeru MatsudaUniversity of Tokyo & RIKEN Center for Brain Science在 riken.jp 的电子邮件经过验证
Andrew BennettMorgan Stanley在 morganstanley.com 的电子邮件经过验证
Vasilis SyrgkanisAssistant Professor, Stanford University在 stanford.edu 的电子邮件经过验证
Sergey LevineUC Berkeley, Physical Intelligence在 eecs.berkeley.edu 的电子邮件经过验证
Mengdi WangCenter for Statistics & Machine Learning, ECE, Princeton University在 princeton.edu 的电子邮件经过验证
Yutaka MatsuoProfessor, University of Tokyo在 weblab.t.u-tokyo.ac.jp 的电子邮件经过验证
Ayush SekhariPostdoc, MIT在 mit.edu 的电子邮件经过验证
Whitney NeweyProfessor of Economics, MIT在 mit.edu 的电子邮件经过验证
Yuta SaitoCornell University在 cornell.edu 的电子邮件经过验证
Tommaso BiancalaniGenentech在 gene.com 的电子邮件经过验证
Gabriele ScaliaGenentech在 gene.com 的电子邮件经过验证
Alekh AgarwalGoogle在 google.com 的电子邮件经过验证

关注

Masatoshi Uehara

Genentech

在 gene.com 的电子邮件经过验证 - 首页

Machine Learning Reinforcement Learning Causal Inference Computational Biology Drug Discovery


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Double reinforcement learning for efficient off-policy evaluation in markov decision processes N Kallus, M Uehara Journal of Machine Learning Research 21 (167), 1-63, 2020	194	2020
Minimax weight and q-function learning for off-policy evaluation M Uehara, J Huang, N Jiang International Conference on Machine Learning, 9659-9668, 2020	181	2020
Pessimistic Model-based Offline Reinforcement Learning under Partial Coverage M Uehara, W Sun International Conference on Learning Representations, 2022	139	2022
Representation Learning for Online and Offline RL in Low-rank MDPs M Uehara, X Zhang, W Sun International Conference on Learning Representations, 2022	131	2022
Generative adversarial nets from a density ratio estimation perspective M Uehara, I Sato, M Suzuki, K Nakayama, Y Matsuo arXiv preprint arXiv:1610.02920, 2016	103	2016
Efficiently breaking the curse of horizon in off-policy evaluation with double reinforcement learning N Kallus, M Uehara Operations Research 70 (6), 3282-3302, 2022	97*	2022
Mitigating covariate shift in imitation learning via offline data with partial coverage J Chang, M Uehara, D Sreenivas, R Kidambi, W Sun Advances in Neural Information Processing Systems 34, 965-979, 2021	82	2021
Efficient reinforcement learning in block mdps: A model-free representation learning approach X Zhang, Y Song, M Uehara, M Wang, A Agarwal, W Sun International Conference on Machine Learning, 26517-26547, 2022	61	2022
Causal inference under unmeasured confounding with negative controls: A minimax learning approach N Kallus, X Mao, M Uehara arXiv preprint arXiv:2103.14029, 2021	61	2021
Finite sample analysis of minimax offline reinforcement learning: Completeness, fast rates and first-order efficiency M Uehara, M Imaizumi, N Jiang, N Kallus, W Sun, T Xie arXiv preprint arXiv:2102.02981, 2021	60	2021
Intrinsically efficient, stable, and bounded off-policy evaluation for reinforcement learning N Kallus, M Uehara Advances in Neural Information Processing Systems 32, 2019	54	2019
A review of off-policy evaluation in reinforcement learning M Uehara, C Shi, N Kallus arXiv preprint arXiv:2212.06355, 2022	45	2022
Off-policy evaluation and learning for external validity under a covariate shift M Uehara, M Kato, S Yasui Advances in Neural Information Processing Systems 33, 49-61, 2020	45*	2020
Statistically efficient off-policy policy gradients N Kallus, M Uehara Proceedings of the 37th International Conference on Machine Learning, 5089-5100, 2020	42	2020
PAC Reinforcement Learning for Predictive State Representations W Zhan, M Uehara, W Sun, JD Lee International Conference on Learning Representations, 2023	38	2023
A minimax learning approach to off-policy evaluation in confounded partially observable markov decision processes C Shi, M Uehara, J Huang, N Jiang International Conference on Machine Learning, 20057-20094, 2022	36	2022
Provably efficient reinforcement learning in partially observable dynamical systems M Uehara, A Sekhari, JD Lee, N Kallus, W Sun Advances in Neural Information Processing Systems 35, 578-592, 2022	34	2022
Localized debiased machine learning: Efficient inference on quantile treatment effects and beyond N Kallus, X Mao, M Uehara Journal of Machine Learning Research 25 (16), 1-59, 2024	30*	2024
Optimal off-policy evaluation from multiple logging policies N Kallus, Y Saito, M Uehara International Conference on Machine Learning, 5247-5256, 2021	29	2021
Provable offline reinforcement learning with human feedback W Zhan, M Uehara, N Kallus, JD Lee, W Sun ICML 2023 Workshop The Many Facets of Preference-Based Learning, 2023	25	2023

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用