Merlot: Multimodal neural script knowledge models R Zellers, X Lu, J Hessel, Y Yu, JS Park, J Cao, A Farhadi, Y Choi Advances in Neural Information Processing Systems 34, 23634-23651, 2021 | 348 | 2021 |
VisualCOMET: Reasoning about the Dynamic Context of a Still Image JS Park, C Bhagavatula, R Mottaghi, A Farhadi, Y Choi arXiv preprint arXiv:2004.10796, 2020 | 120 | 2020 |
Adversarial inference for multi-sentence video description JS Park, M Rohrbach, T Darrell, A Rohrbach Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019 | 105 | 2019 |
Natural language rationales with full-stack visual reasoning: From pixels to semantic frames to commonsense graphs A Marasović, C Bhagavatula, JS Park, RL Bras, NA Smith, Y Choi arXiv preprint arXiv:2010.07526, 2020 | 55 | 2020 |
Fusing pre-trained language models with multimodal prompts through reinforcement learning Y Yu, J Chung, H Yun, J Hessel, JS Park, X Lu, R Zellers, P Ammanabrolu, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 34* | 2023 |
The Abduction of Sherlock Holmes: A Dataset for Visual Abductive Reasoning J Hessel, JD Hwang, JS Park, R Zellers, C Bhagavatula, A Rohrbach, ... European Conference on Computer Vision, 558-575, 2022 | 30 | 2022 |
Exposing the limits of video-text models through contrast sets JS Park, S Shen, A Farhadi, T Darrell, Y Choi, A Rohrbach Proceedings of the 2022 Conference of the North American Chapter of the …, 2022 | 24 | 2022 |
Identity-aware multi-sentence video description JS Park, T Darrell, A Rohrbach Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020 | 23 | 2020 |
Agent ai: Surveying the horizons of multimodal interaction Z Durante, Q Huang, N Wake, R Gong, JS Park, B Sarkar, R Taori, Y Noda, ... arXiv preprint arXiv:2401.03568, 2024 | 21 | 2024 |
Llc: Accurate, multi-purpose learnt low-dimensional binary codes A Kusupati, M Wallingford, V Ramanujan, R Somani, JS Park, K Pillutla, ... Advances in neural information processing systems 34, 23900-23913, 2021 | 11 | 2021 |
Localized symbolic knowledge distillation for visual commonsense models JS Park, J Hessel, K Chandu, PP Liang, X Lu, P West, Y Yu, Q Huang, ... Advances in Neural Information Processing Systems 36, 2024 | 4 | 2024 |
Superposed Decoding: Multiple Generations from a Single Autoregressive Inference Pass E Shen, A Fan, SM Pratt, JS Park, M Wallingford, SM Kakade, A Holtzman, ... arXiv preprint arXiv:2405.18400, 2024 | | 2024 |