Yan Duan 个人学术档案 - 学术资源搜索

引用次数

	总计	2019 年至今
引用	17374	14818
h 指数	22	20
i10 指数	23	22

3000

1500

750

2250

201520162017201820192020202120222023202451 172 683 1564 2190 2597 2993 2863 2794 1379

开放获取的出版物数量

查看全部

8 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Pieter AbbeelUC Berkeley | Covariant在 cs.berkeley.edu 的电子邮件经过验证
(Peter) Xi Chencovariant.ai | UC Berkeley在 berkeley.edu 的电子邮件经过验证
John SchulmanResearch Scientist, OpenAI在 openai.com 的电子邮件经过验证
Rein HouthooftNetflix Research在 netflix.com 的电子邮件经过验证
Ilya SutskeverCo-Founder and Chief Scientist of OpenAI在 openai.com 的电子邮件经过验证
Haoran TangPhD student in Applied Mathematics; University of California, Berkeley在 math.berkeley.edu 的电子邮件经过验证
Jonathan Ho在 berkeley.edu 的电子邮件经过验证
Ken GoldbergProfessor, UC Berkeley and UCSF在 berkeley.edu 的电子邮件经过验证
Sachin PatilNvidia在 nvidia.com 的电子邮件经过验证
Carlos FlorensaPhD from University of California at Berkeley在 berkeley.edu 的电子邮件经过验证
Ian GoodfellowDeepMind在 deepmind.com 的电子邮件经过验证
Nicolas PapernotUniversity of Toronto and Vector Institute在 utoronto.ca 的电子邮件经过验证
Alex X. LeeResearch Scientist, Google DeepMind在 google.com 的电子邮件经过验证
Sergey LevineUC Berkeley, Physical Intelligence在 eecs.berkeley.edu 的电子邮件经过验证
Trevor DarrellProfessor of Computer Science, U.C. Berkeley在 eecs.berkeley.edu 的电子邮件经过验证
Peter BartlettProfessor, EECS and Statistics, UC Berkeley在 cs.berkeley.edu 的电子邮件经过验证
Jia PanComputer Science, The University of Hong Kong在 cs.hku.hk 的电子邮件经过验证
Ibrahim AwwalPhD Student in Electrical and Computer Engineering, UC San Diego在 eng.ucsd.edu 的电子邮件经过验证
Diederik P. KingmaResearch Scientist, Google Brain在 google.com 的电子邮件经过验证
Prafulla DhariwalResearcher, OpenAI在 openai.com 的电子邮件经过验证

关注

Yan Duan

Covariant.AI

在 covariant.ai 的电子邮件经过验证 - 首页

Robotics Machine Learning Reinforcement Learning Meta Learning


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
InfoGAN: Interpretable representation learning by information maximizing generative adversarial nets X Chen, Y Duan, R Houthooft, J Schulman, I Sutskever, P Abbeel Advances in Neural Information Processing Systems, 2172-2180, 2016	5334	2016
Benchmarking deep reinforcement learning for continuous control Y Duan, X Chen, R Houthooft, J Schulman, P Abbeel International conference on machine learning, 1329-1338, 2016	2022	2016
RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning Y Duan, J Schulman, X Chen, PL Bartlett, I Sutskever, P Abbeel arXiv preprint arXiv:1611.02779, 2016	1117	2016
Adversarial attacks on neural network policies S Huang, N Papernot, I Goodfellow, Y Duan, P Abbeel arXiv preprint arXiv:1702.02284, 2017	982	2017
Vime: Variational information maximizing exploration R Houthooft, X Chen, Y Duan, J Schulman, F De Turck, P Abbeel Advances in neural information processing systems 29, 2016	931	2016
Motion planning with sequential convex optimization and convex collision checking J Schulman, Y Duan, J Ho, A Lee, I Awwal, H Bradlow, J Pan, S Patil, ... The International Journal of Robotics Research 33 (9), 1251-1270, 2014	858	2014
Evaluating protein transfer learning with TAPE R Rao, N Bhattacharya, N Thomas, Y Duan, P Chen, J Canny, P Abbeel, ... Advances in neural information processing systems 32, 2019	805	2019
Variational lossy autoencoder X Chen, DP Kingma, T Salimans, Y Duan, P Dhariwal, J Schulman, ... arXiv preprint arXiv:1611.02731, 2016	780	2016
One-shot imitation learning Y Duan, M Andrychowicz, B Stadie, OAI Jonathan Ho, J Schneider, ... Advances in neural information processing systems 30, 2017	771	2017
Deep Spatial Autoencoders for Visuomotor Learning C Finn, XY Tan, Y Duan, T Darrell, S Levine, P Abbeel International Conference on Robotics and Automation (ICRA), 2016	708*	2016
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning H Tang, R Houthooft, D Foote, A Stooke, X Chen, Y Duan, J Schulman, ... arXiv preprint arXiv:1611.04717, 2016	679	2016
Model-ensemble trust-region policy optimization T Kurutach, I Clavera, Y Duan, A Tamar, P Abbeel arXiv preprint arXiv:1802.10592, 2018	522	2018
Flow++: Improving flow-based generative models with variational dequantization and architecture design J Ho, X Chen, A Srinivas, Y Duan, P Abbeel International conference on machine learning, 2722-2730, 2019	472	2019
Stochastic neural networks for hierarchical reinforcement learning C Florensa, Y Duan, P Abbeel arXiv preprint arXiv:1704.03012, 2017	417	2017
Deep unsupervised cardinality estimation Z Yang, E Liang, A Kamsetty, C Wu, Y Duan, X Chen, P Abbeel, ... arXiv preprint arXiv:1905.04278, 2019	228	2019
Variance reduction for policy gradient with action-dependent factorized baselines C Wu, A Rajeswaran, Y Duan, V Kumar, AM Bayen, S Kakade, I Mordatch, ... arXiv preprint arXiv:1803.07246, 2018	170	2018
NeuroCard: one cardinality estimator for all tables Z Yang, A Kamsetty, S Luan, E Liang, Y Duan, X Chen, I Stoica arXiv preprint arXiv:2006.08109, 2020	169	2020
The Importance of Sampling in Meta-Reinforcement Learning B Stadie, G Yang, R Houthooft, P Chen, Y Duan, Y Wu, P Abbeel, ... Advances in Neural Information Processing Systems, 9299-9309, 2018	167*	2018
Attacking machine learning with adversarial examples I Goodfellow, N Papernot, S Huang, Y Duan, P Abbeel, J Clark OpenAI Blog 24, 1, 2017	80	2017
Sigma hulls for gaussian belief space planning for imprecise articulated robots amid obstacles A Lee, Y Duan, S Patil, J Schulman, Z McCarthy, J Van Den Berg, ... 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2013	46	2013

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用