关注
Jihyung Kil
标题
引用次数
引用次数
年份
GPT-4V (ision) is a Generalist Web Agent, if Grounded
B Zheng, B Gou, J Kil, H Sun, Y Su
International Conference on Machine Learning (ICML), 2024
482024
PreSTU: Pre-Training for Scene-Text Understanding
J Kil, S Changpinyo, X Chen, H Hu, S Goodman, WL Chao, R Soricut
IEEE/CVF International Conference on Computer Vision (ICCV), 2023
212023
One Step at a Time: Long-Horizon Vision-and-Language Navigation with Milestones
CH Song, J Kil, TY Pan, BM Sadler, WL Chao, Y Su
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022
202022
Discovering the Unknown Knowns: Turning Implicit Knowledge in the Dataset into Explicit Training Examples for Visual Question Answering
J Kil, C Zhang, D Xuan, WL Chao
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
202021
Revisiting Document Representations for Large-Scale Zero-Shot Learning
J Kil, WL Chao
NAACL, 2021
62021
Dual-View Visual Contextualization for Web Navigation
J Kil, CH Song, B Zheng, X Deng, Y Su, WL Chao
IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR), 2024
12024
ARES: Alternating Reinforcement Learning and Supervised Fine-Tuning for Enhanced Multi-Modal Chain-of-Thought Reasoning Through Diverse AI Feedback
JS Byun, J Chun, J Kil, A Perrault
arXiv preprint arXiv:2407.00087, 2024
2024
II-MMR: Identifying and Improving Multi-modal Multi-hop Reasoning in Visual Question Answering
J Kil, F Tavazoee, D Kang, JK Kim
Annual Meeting of the Association for Computational Linguistics (ACL), Findings, 2024
2024
系统目前无法执行此操作,请稍后再试。
文章 1–8