Knowledge graphs meet multi-modal learning: A comprehensive survey

Z Chen, Y Zhang, Y Fang, Y Geng, L Guo… - arXiv preprint arXiv …, 2024 - arxiv.org
Knowledge Graphs (KGs) play a pivotal role in advancing various AI applications, with the
semantic web community's exploration into multi-modal dimensions unlocking new avenues …

Harnessing GPT-4V (ision) for Insurance: A Preliminary Exploration

C Lin, H Lyu, J Luo, X Xu - arXiv preprint arXiv:2404.09690, 2024 - arxiv.org
The emergence of Large Multimodal Models (LMMs) marks a significant milestone in the
development of artificial intelligence. Insurance, as a vast and complex discipline, involves a …

Joint Visual and Text Prompting for Improved Object-Centric Perception with Multimodal Large Language Models

S Jiang, Y Zhang, C Zhou, Y Jin, Y Feng, J Wu… - arXiv preprint arXiv …, 2024 - arxiv.org
Multimodal Large Language Models (MLLMs) such as GPT-4V and Gemini Pro face
challenges in achieving human-level perception in Visual Question Answering (VQA) …