Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback S Casper, X Davies, C Shi, TK Gilbert, J Scheurer, J Rando, R Freedman, ... arXiv preprint arXiv:2307.15217, 2023 | 239 | 2023 |
Asking Easy Questions: A User-Friendly Approach to Active Reward Learning E Bıyık, M Palan, NC Landolfi, DP Losey, D Sadigh arXiv preprint arXiv:1910.04365, 2019 | 119 | 2019 |
Batch Active Preference-Based Learning of Reward Functions E Bıyık, D Sadigh Proceedings of the 2nd Conference on Robot Learning 87 (Proceedings of …, 2018 | 101 | 2018 |
When humans aren't optimal: Robots that collaborate with risk-aware humans M Kwon, E Biyik, A Talati, K Bhasin, DP Losey, D Sadigh Proceedings of the 2020 ACM/IEEE international conference on human-robot …, 2020 | 100 | 2020 |
Learning reward functions from diverse sources of human feedback: Optimally integrating demonstrations and preferences E Bıyık, DP Losey, M Palan, NC Landolfi, G Shevchuk, D Sadigh The International Journal of Robotics Research 41 (1), 45-67, 2022 | 92 | 2022 |
Active preference-based gaussian process regression for reward learning E Bıyık, N Huynh, MJ Kochenderfer, D Sadigh arXiv preprint arXiv:2005.02575, 2020 | 92 | 2020 |
Reinforcement Learning based Control of Imitative Policies for Near-Accident Driving Z Cao, E Bıyık, WZ Wang, A Raventos, A Gaidon, G Rosman, D Sadigh arXiv preprint arXiv:2007.00178, 2020 | 63 | 2020 |
Batch Active Learning Using Determinantal Point Processes E Bıyık, K Wang, N Anari, D Sadigh arXiv preprint arXiv:1906.07975, 2019 | 57 | 2019 |
Learning how to dynamically route autonomous vehicles on shared roads DA Lazar, E Bıyık, D Sadigh, R Pedarsani Transportation Research Part C: Emerging Technologies 130, 103258, 2021 | 43 | 2021 |
Learning multimodal rewards from rankings V Myers, E Biyik, N Anari, D Sadigh Conference on Robot Learning, 342-352, 2022 | 42 | 2022 |
Active learning of reward dynamics from hierarchical queries C Basu, E Bıyık, Z He, M Singhal, D Sadigh 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2019 | 39 | 2019 |
Profile‐encoding reconstruction for multiple‐acquisition balanced steady‐state free precession imaging E Ilicak, LK Senel, E Biyik, T Çukur Magnetic resonance in medicine 78 (4), 1316-1329, 2017 | 37 | 2017 |
Roial: Region of interest active learning for characterizing exoskeleton gait preference landscapes K Li, M Tucker, E Bıyık, E Novoseller, JW Burdick, Y Sui, D Sadigh, Y Yue, ... 2021 IEEE International Conference on Robotics and Automation (ICRA), 3212-3218, 2021 | 35 | 2021 |
The green choice: Learning and influencing human decisions on shared roads E Bıyık, DA Lazar, D Sadigh, R Pedarsani 2019 IEEE 58th Conference on Decision and Control (CDC), 347-354, 2019 | 34 | 2019 |
Altruistic autonomy: Beating congestion on shared roads E Bıyık, DA Lazar, R Pedarsani, D Sadigh International Workshop on the Algorithmic Foundations of Robotics, 887-904, 2018 | 29 | 2018 |
Reconstruction by calibration over tensors for multi‐coil multi‐acquisition balanced SSFP imaging E Biyik, E Ilicak, T Cukur Magnetic resonance in medicine 79 (5), 2542-2554, 2018 | 29 | 2018 |
Learning Reward Functions from Scale Feedback N Wilde, E Bıyık, D Sadigh, SL Smith arXiv preprint arXiv:2110.00284, 2021 | 28 | 2021 |
APReL: A Library for Active Preference-based Reward Learning Algorithms E Bıyık, A Talati, D Sadigh Proceedings of the 2022 ACM/IEEE International Conference on Human-Robot …, 2022 | 27 | 2022 |
Emergent Prosociality in Multi-Agent Games Through Gifting WZ Wang, M Beliaev, E Bıyık, DA Lazar, R Pedarsani, D Sadigh arXiv preprint arXiv:2105.06593, 2021 | 27 | 2021 |
Real-Time Detection, Tracking and Classification of Multiple Moving Objects in UAV Videos HC Baykara, E Bıyık, G Gül, D Onural, AS Öztürk, İ Yıldız Tools with Artificial Intelligence (ICTAI), 2017 IEEE 29th International …, 2017 | 26 | 2017 |