Large multilingual models pivot zero-shot multimodal learning across languages

J Hu, Y Yao, C Wang, S Wang, Y Pan, Q Chen… - arXiv preprint arXiv …, 2023 - arxiv.org
Recently there has been a significant surge in multimodal learning in terms of both image-to-
text and text-to-image generation. However, the success is typically limited to English …

Configurable foundation models: Building llms from a modular perspective

C Xiao, Z Zhang, C Song, D Jiang, F Yao, X Han… - arXiv preprint arXiv …, 2024 - arxiv.org
Advancements in LLMs have recently unveiled challenges tied to computational efficiency
and continual scalability due to their requirements of huge parameters, making the …

Pea-diffusion: Parameter-efficient adapter with knowledge distillation in non-english text-to-image generation

J Ma, C Chen, Q Xie, H Lu - European Conference on Computer Vision, 2025 - Springer
Text-to-image diffusion models are well known for their ability to generate realistic images
based on textual prompts. However, the existing works have predominantly focused on …

Text-to-Image Synthesis With Generative Models: Methods, Datasets, Performance Metrics, Challenges, and Future Direction

SK Alhabeeb, AA Al-Shargabi - IEEE Access, 2024 - ieeexplore.ieee.org
Text-to-image synthesis, the process of turning words into images, opens up a world of
creative possibilities, and meets the growing need for engaging visual experiences in a …

Pai-diffusion: Constructing and serving a family of open chinese diffusion models for text-to-image synthesis on the cloud

C Wang, Z Duan, B Liu, X Zou, C Chen, K Jia… - arXiv preprint arXiv …, 2023 - arxiv.org
Text-to-image synthesis for the Chinese language poses unique challenges due to its large
vocabulary size, and intricate character relationships. While existing diffusion models have …

[PDF][PDF] Cross-lingual Transfer in Generative AI-Based Educational Platforms for Equitable and Personalized Learning

N Shoeibi - 2023 - ceur-ws.org
This doctoral thesis explores the integration of Generative AI, specifically Large Language
Models (LLMs) and diffusion models, in educational platforms. Emphasis is placed on cross …