Revo-lion: Evaluating and refining vision-language instruction tuning datasets

文章

学术资源搜索

获得 3 条结果（用时0.01秒）

我的图书馆

Revo-lion: Evaluating and refining vision-language instruction tuning datasets

在引用文章中搜索

[PDF] thecvf.com

Hallucidoctor: Mitigating hallucinatory toxicity in visual instruction data

Q Yu, J Li, L Wei, L Pang, W Ye, B Qin… - Proceedings of the …, 2024 - openaccess.thecvf.com

Abstract Multi-modal Large Language Models (MLLMs) tuned on machine-generated
instruction-following data have demonstrated remarkable performance in various multimodal …

被引用次数：46 相关文章所有 3 个版本

[PDF] aclanthology.org

A comprehensive survey of hallucination in large language, image, video and audio foundation models

P Sahoo, P Meharia, A Ghosh, S Saha… - Findings of the …, 2024 - aclanthology.org

The rapid advancement of foundation models (FMs) across language, image, audio, and
video domains has shown remarkable capabilities in diverse tasks. However, the …

被引用次数：2 相关文章

[PDF] arxiv.org

Unveiling Hallucination in Text, Image, Video, and Audio Foundation Models: A Comprehensive Survey

P Sahoo, P Meharia, A Ghosh, S Saha, V Jain… - arXiv preprint arXiv …, 2024 - arxiv.org

The rapid advancement of foundation models (FMs) across language, image, audio, and
video domains has shown remarkable capabilities in diverse tasks. However, the …

被引用次数：1 相关文章所有 4 个版本