Alborz Geramifard 个人学术档案

引用次数

	总计	2019 年至今
引用	1971	1134
h 指数	22	16
i10 指数	39	27

260

130

195

20072008200920102011201220132014201520162017201820192020202120222023202412 12 18 15 47 88 85 102 106 91 102 138 138 160 227 241 259 107

合著作者

Jonathan P. HowRichard C. Maclaurin Professor of Aerospace Engineering, Massachusetts Institute of Technology在 mit.edu 的电子邮件经过验证
Nicholas RoyMIT在 csail.mit.edu 的电子邮件经过验证
Satwik KotturResearch Scientist, Facebook AI在 fb.com 的电子邮件经过验证
Seungwhan MoonFacebook, Carnegie Mellon University在 fb.com 的电子邮件经过验证
Ahmad BeiramiGoogle DeepMind在 google.com 的电子邮件经过验证
Michael BowlingUniversity of Alberta在 ualberta.ca 的电子邮件经过验证
Paul A CrookResearch Scientist, Meta Platforms, Inc.在 fb.com 的电子邮件经过验证
Nazim Kemal UreIstanbul Technical University在 itu.edu.tr 的电子邮件经过验证
Richard S. SuttonKeen, Amii, and University of Alberta在 richsutton.com 的电子邮件经过验证
Rajen SubbaGoogle在 google.com 的电子邮件经过验证
Girish ChowdharyAssociate Professor在 illinois.edu 的电子邮件经过验证
Chinnadhurai SankarResearch Lead, SliceX AI | ex-Meta AI在 fb.com 的电子邮件经过验证
Ankita DeFacebook在 fb.com 的电子邮件经过验证
Thomas J. WalshSony AI在 sony.com 的电子邮件经过验证
Babak DamavandiMeta Reality Labs在 fb.com 的电子邮件经过验证
Csaba SzepesvariDeepMind & University of Alberta在 cs.ualberta.ca 的电子邮件经过验证
David WhitneyMeta在 meta.com 的电子邮件经过验证
Christoph DannResearch Scientist, Google在 google.com 的电子邮件经过验证
Stefanie TellexBrown University在 cs.brown.edu 的电子邮件经过验证
Will DabneyDeepMind在 google.com 的电子邮件经过验证

关注

Alborz Geramifard

Research Scientist Director at Meta

在 meta.com 的电子邮件经过验证 - 首页

Reinforcement Learning Conversational AI Planning Brain and Cognitive Sciences


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Dyna-style planning with linear function approximation and prioritized sweeping RS Sutton, C Szepesvári, A Geramifard, MP Bowling arXiv preprint arXiv:1206.3285, 2012	231	2012
A tutorial on linear function approximators for dynamic programming and reinforcement learning A Geramifard, TJ Walsh, S Tellex, G Chowdhary, N Roy, JP How Foundations and Trends® in Machine Learning 6 (4), 375-451, 2013	163	2013
Decentralized control of partially observable Markov decision processes C Amato, G Chowdhary, A Geramifard, NK Üre, MJ Kochenderfer 52nd IEEE Conference on Decision and Control, 2398-2405, 2013	150	2013
Cooperative mission planning for multi-UAV teams SS Ponda, LB Johnson, A Geramifard, JP How Handbook of unmanned aerial vehicles 2, 1447-1490, 2015	98	2015
Incremental least-squares temporal difference learning A Geramifard, M Bowling, RS Sutton Proceedings of the 21st national conference on Artificial intelligence …, 2006	92	2006
RLPy: a value-function-based reinforcement learning framework for education and research. A Geramifard, C Dann, RH Klein, W Dabney, JP How J. Mach. Learn. Res. 16 (1), 1573-1578, 2015	91	2015
Online Discovery of Feature Dependencies. A Geramifard, F Doshi, J Redding, N Roy, JP How ICML, 881-888, 2011	82	2011
SIMMC 2.0: A task-oriented dialog dataset for immersive multimodal conversations S Kottur, S Moon, A Geramifard, B Damavandi arXiv preprint arXiv:2104.08667, 2021	79	2021
Situated and interactive multimodal conversations S Moon, S Kottur, PA Crook, A De, S Poddar, T Levin, D Whitney, ... arXiv preprint arXiv:2006.01460, 2020	79	2020
Overview of the ninth dialog system technology challenge: Dstc9 C Gunasekara, S Kim, LF D'Haro, A Rastogi, YN Chen, M Eric, ... arXiv preprint arXiv:2011.06486, 2020	69	2020
iLSTD: Eligibility traces and convergence analysis A Geramifard, M Bowling, M Zinkevich, RS Sutton Advances in Neural Information Processing Systems 19, 2006	65	2006
On the design and use of a micro air vehicle to track and avoid adversaries R He, A Bachrach, M Achtelik, A Geramifard, D Gurdan, S Prentice, ... The International Journal of Robotics Research 29 (5), 529-546, 2010	55	2010
Customized movie trailers A Geramifard US Patent App. 14/105,428, 2015	51	2015
Intelligent cooperative control architecture: a framework for performance improvement using safe learning A Geramifard, J Redding, JP How Journal of Intelligent & Robotic Systems 72, 83-103, 2013	50	2013
Reinforcement learning with misspecified model classes J Joseph, A Geramifard, JW Roberts, JP How, N Roy 2013 IEEE International Conference on Robotics and Automation, 939-946, 2013	47	2013
UAV cooperative control with stochastic risk models A Geramifard, J Redding, N Roy, JP How Proceedings of the 2011 american control conference, 3393-3398, 2011	45	2011
Biased cost pathfinding A Geramifard, P Chubak, V Bulitko Proceedings of the AAAI Conference on Artificial Intelligence and …, 2006	43	2006
An intelligent cooperative control architecture J Redding, A Geramifard, A Undurti, HL Choi, JP How Proceedings of the 2010 American control conference, 57-62, 2010	37	2010
Adaptive planning for Markov decision processes with uncertain transition models via incremental feature dependency discovery NK Ure, A Geramifard, G Chowdhary, JP How Machine Learning and Knowledge Discovery in Databases: European Conference …, 2012	32	2012
Annotation inconsistency and entity bias in MultiWOZ K Qian, A Beirami, Z Lin, A De, A Geramifard, Z Yu, C Sankar arXiv preprint arXiv:2105.14150, 2021	29	2021

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用