Mitigating hallucination in large multi-modal models via robust instruction tuning F Liu, K Lin, L Li, J Wang, Y Yacoob, L Wang ICLR 2024, 2023 | 174* | 2023 |
Visual news: Benchmark and challenges in news image captioning F Liu, Y Wang, T Wang, V Ordonez EMNLP 2021, 2021 | 103* | 2021 |
Hallusionbench: You see what you think? or you think what you see? an image-context reasoning benchmark challenging for gpt-4v (ision), llava-1.5, and other multi-modality models F Liu*, T Guan*, X Wu, R Xian, Z Li, X Wang, X Liu, Y Yacoob, D Manocha, ... CVPR 2024, 2023 | 97* | 2023 |
MMC: Advancing multimodal chart understanding with large-scale instruction tuning F Liu, X Wang, W Yao, J Chen, K Song, S Cho, Y Yacoob, D Yu NAACL 2024, 2024 | 31 | 2024 |
COVID-VTS: Fact Extraction and Verification on Short Video Platforms F Liu, Y Yacoob, A Shrivastava EACL 2023, 2023 | 27 | 2023 |
Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences X Wang, Y Zhou, X Liu, H Lu, Y Xu, F He, J Yoon, T Lu, G Bertasius, F Liu, ... ACL 2024, 2024 | 20 | 2024 |
Towards understanding in-context learning with contrastive demonstrations and saliency maps F Liu*, P Xu*, Z Li* arXiv preprint arXiv:2307.05052, 2023 | 20 | 2023 |
DocumentCLIP: Linking Figures and Main Body Text in Reflowed Documents F Liu, H Tan, C Tensmeyer ICPRAI 2024, 2023 | 20 | 2023 |
Large language models and causal inference in collaboration: A comprehensive survey X Liu, P Xu, J Wu, J Yuan, Y Yang, Y Zhou, F Liu, T Guan, H Wang, T Yu, ... arXiv preprint arXiv:2403.09606, 2024 | 10 | 2024 |
On the Safety Concerns of Deploying LLMs/VLMs in Robotics: Highlighting the Risks and Vulnerabilities X Wu, R Xian, T Guan, J Liang, S Chakraborty, F Liu, B Sadler, ... CVPR 2024 Workshop on Vision and Language for Autonomous Driving and Robotics, 2024 | 4 | 2024 |
Mosaic IT: Enhancing Instruction Tuning with Data Mosaics M Li, P Chen, C Wang, H Zhao, Y Liang, Y Hou, F Liu, T Zhou arXiv preprint arXiv:2405.13326, 2024 | 1 | 2024 |
From Multimodal LLM to Human-level AI: Modality, Instruction, Reasoning, Efficiency and Beyond H Fei, Y Yao, Z Zhang, F Liu, A Zhang, T Chua LREC-COLING 2024, 2024 | | 2024 |