Visual genome: Connecting language and vision using crowdsourced dense image annotations R Krishna, Y Zhu, O Groth, J Johnson, K Hata, J Kravitz, S Chen, ... URL http://arxiv. org/abs/1602.07332, 2016 | 5524 | 2016 |
Target-driven visual navigation in indoor scenes using deep reinforcement learning Y Zhu, R Mottaghi, E Kolve, JJ Lim, A Gupta, L Fei-Fei, A Farhadi 2017 IEEE international conference on robotics and automation (ICRA), 3357-3364, 2017 | 1807 | 2017 |
Scene graph generation by iterative message passing D Xu, Y Zhu, CB Choy, L Fei-Fei Proceedings of the IEEE conference on computer vision and pattern …, 2017 | 1341 | 2017 |
DenseFusion: 6D object pose estimation by iterative dense fusion C Wang, D Xu, Y Zhu, R Martín-Martín, C Lu, L Fei-Fei, S Savarese Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2019 | 1038 | 2019 |
Visual7W: Grounded question answering in images Y Zhu, O Groth, M Bernstein, L Fei-Fei arXiv preprint arXiv:1511.03416, 2015 | 963 | 2015 |
AI2-THOR: An interactive 3D environment for visual AI E Kolve, R Mottaghi, D Gordon, Y Zhu, A Gupta, A Farhadi arXiv preprint arXiv:1712.05474, 2017 | 812 | 2017 |
Voyager: An open-ended embodied agent with large language models G Wang, Y Xie, Y Jiang, A Mandlekar, C Xiao, Y Zhu, L Fan, ... arXiv preprint arXiv:2305.16291, 2023 | 428 | 2023 |
Making sense of vision and touch: Self-supervised learning of multimodal representations for contact-rich tasks MA Lee, Y Zhu, K Srinivasan, P Shah, S Savarese, L Fei-Fei, A Garg, ... 2019 International Conference on Robotics and Automation (ICRA), 8943-8950, 2019 | 370 | 2019 |
Reinforcement and imitation learning for diverse visuomotor skills Y Zhu, Z Wang, J Merel, A Rusu, T Erez, S Cabi, S Tunyasuvunakool, ... arXiv preprint arXiv:1802.09564, 2018 | 352 | 2018 |
robosuite: A modular simulation framework and benchmark for robot learning Y Zhu, J Wong, A Mandlekar, R Martín-Martín, A Joshi, S Nasiriany, Y Zhu arXiv preprint arXiv:2009.12293, 2020 | 328 | 2020 |
Reasoning about object affordances in a knowledge base representation Y Zhu, A Fathi, L Fei-Fei Computer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland …, 2014 | 299 | 2014 |
What matters in learning from offline human demonstrations for robot manipulation A Mandlekar, D Xu, J Wong, S Nasiriany, C Wang, R Kulkarni, L Fei-Fei, ... arXiv preprint arXiv:2108.03298, 2021 | 258 | 2021 |
Minedojo: Building open-ended embodied agents with internet-scale knowledge L Fan, G Wang, Y Jiang, A Mandlekar, Y Yang, H Zhu, A Tang, DA Huang, ... Advances in Neural Information Processing Systems 35, 18343-18362, 2022 | 250 | 2022 |
RoboTurk: A crowdsourcing platform for robotic skill learning through imitation A Mandlekar, Y Zhu, A Garg, J Booher, M Spero, A Tung, J Gao, ... arXiv preprint arXiv:1811.02790, 2018 | 230 | 2018 |
Learning task-oriented grasping for tool manipulation from simulated self-supervision K Fang, Y Zhu, A Garg, A Kurenkov, V Mehta, L Fei-Fei, S Savarese The International Journal of Robotics Research 39 (2-3), 202-216, 2020 | 225 | 2020 |
Neural task programming: Learning to generalize across hierarchical tasks D Xu, S Nair, Y Zhu, J Gao, A Garg, L Fei-Fei, S Savarese 2018 IEEE international conference on robotics and automation (ICRA), 3795-3802, 2018 | 222 | 2018 |
Making sense of vision and touch: Learning multimodal representations for contact-rich tasks MA Lee, Y Zhu, P Zachares, M Tan, K Srinivasan, S Savarese, L Fei-Fei, ... IEEE Transactions on Robotics 36 (3), 582-596, 2020 | 195 | 2020 |
Adversarially Robust Policy Learning: Active Construction of Physically-Plausible Perturbations A Mandlekar, Y Zhu, A Garg, L Fei-Fei, S Savarese IEEE Int’l Conf. on Intelligent Robots and Systems (IROS), 2017 | 193 | 2017 |
Surreal: Open-source reinforcement learning framework and robot manipulation benchmark L Fan, Y Zhu, J Zhu, Z Liu, O Zeng, A Gupta, J Creus-Costa, S Savarese, ... Conference on robot learning, 767-782, 2018 | 178 | 2018 |
Visual semantic planning using deep successor representations Y Zhu, D Gordon, E Kolve, D Fox, L Fei-Fei, A Gupta, R Mottaghi, ... Proceedings of the IEEE international conference on computer vision, 483-492, 2017 | 164 | 2017 |