RealCQA: Scientific Chart Question Answering as a Test-Bed for First-Order Logic

Y Yao, T Yu, A Zhang, C Wang, J Cui, H Zhu… - arXiv preprint arXiv …, 2024 - arxiv.org

The recent surge of Multimodal Large Language Models (MLLMs) has fundamentally
reshaped the landscape of AI research and industry, shedding light on a promising path …

被引用次数：145 相关文章所有 3 个版本

[PDF] arxiv.org

DocTabQA: Answering Questions from Long Documents Using Tables

H Wang, K Hu, H Dong, L Gao - International Conference on Document …, 2024 - Springer

We study a new problem setting of question answering (QA), referred to as DocTabQA.
Within this setting, given a long document, the goal is to respond to questions by organizing …

被引用次数：1 相关文章所有 4 个版本

[PDF] arxiv.org

EvoChart: A Benchmark and a Self-Training Approach Towards Real-World Chart Understanding

M Huang, L Han, X Zhang, W Wu, J Ma… - arXiv preprint arXiv …, 2024 - arxiv.org

Chart understanding enables automated data analysis for humans, which requires models
to achieve highly accurate visual comprehension. While existing Visual Language Models …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

[图书][B] Document Analysis and Recognition-ICDAR 2024: 18th International Conference, Athens, Greece, August 30-September 4, 2024, Proceedings, Part I

EHB Smith - 2024 - books.google.com

This six-volume set LNCS 14804-14809 constitutes the proceedings of the 18th International
Conference on Document Analysis and Recognition, ICDAR 2024, held in Athens, Greece …