PLMM: Personal Large Models on Mobile Devices

Y Gong - arXiv preprint arXiv:2309.14726, 2023 - arxiv.org
Inspired by Federated Learning, in this paper, we propose personal large models that are
distilled from traditional large language models but more adaptive to local users' personal …

Collaborative Modality Fusion for Mitigating Language Bias in Visual Question Answering

Q Lu, S Chen, X Zhu - Journal of Imaging, 2024 - mdpi.com
Language bias stands as a noteworthy concern in visual question answering (VQA),
wherein models tend to rely on spurious correlations between questions and answers for …

Overcoming Language Priors in Visual Question Answering with Cumulative Learning Strategy

A Mao, F Chen, Z Ma, K Lin - Available at SSRN 4740502 - papers.ssrn.com
The performance of visual question answering (VQA) has witnessed great progress over the
last few years. However, many current VQA models tend to rely on superficial linguistic …