Holistic evaluation for interleaved text-and-image generation

M Liu, Z Xu, Z Lin, T Ashby, J Rimchala, J Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org
Interleaved text-and-image generation has been an intriguing research direction, where the
models are required to generate both images and text pieces in an arbitrary order. Despite …

Lateralization LoRA: Interleaved Instruction Tuning with Modality-Specialized Adaptations

Z Xu, M Liu, Y Shen, J Rimchala, J Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org
Recent advancements in Vision-Language Models (VLMs) have led to the development of
Vision-Language Generalists (VLGs) capable of understanding and generating interleaved …