InstructOCR: Instruction Boosting Scene Text Spotting

C Duan, Q Jiang, P Fu, J Chen, S Li, Z Wang… - arXiv preprint arXiv …, 2024 - arxiv.org
In the field of scene text spotting, previous OCR methods primarily relied on image encoders
and pre-trained text information, but they often overlooked the advantages of incorporating …

TableVQA-Bench: A Visual Question Answering Benchmark on Multiple Table Domains

Y Kim, M Yim, KY Song - arXiv preprint arXiv:2404.19205, 2024 - arxiv.org
In this paper, we establish a benchmark for table visual question answering, referred to as
the TableVQA-Bench, derived from pre-existing table question-answering (QA) and table …