LLMs Meet Multimodal Generation and Editing: A Survey

Y He, Z Liu, J Chen, Z Tian, H Liu, X Chi, R Liu… - arXiv preprint arXiv …, 2024 - arxiv.org
With the recent advancement in large language models (LLMs), there is a growing interest in
combining LLMs with multimodal learning. Previous surveys of multimodal large language …

FontStudio: Shape-Adaptive Diffusion Model for Coherent and Consistent Font Effect Generation

X Mu, L Chen, B Chen, S Gu, J Bao, D Chen… - arXiv preprint arXiv …, 2024 - arxiv.org
Recently, the application of modern diffusion-based text-to-image generation models for
creating artistic fonts, traditionally the domain of professional designers, has garnered …

AnyTrans: Translate AnyText in the Image with Large Scale Models

Z Qian, P Zhang, B Yang, K Fan, Y Ma… - arXiv preprint arXiv …, 2024 - arxiv.org
This paper introduces AnyTrans, an all-encompassing framework for the task-Translate
AnyText in the Image (TATI), which includes multilingual text translation and text fusion …

Cross-Domain Image Conversion by CycleDM

S Shimotsumagari, S Takezaki, D Haraguchi… - arXiv preprint arXiv …, 2024 - arxiv.org
The purpose of this paper is to enable the conversion between machine-printed character
images (ie, font images) and handwritten character images through machine learning. For …

WordArt Designer API: User-Driven Artistic Typography Synthesis with Large Language Models on ModelScope

JY He, ZQ Cheng, C Li, J Sun, W Xiang, Y Hu… - arXiv preprint arXiv …, 2024 - arxiv.org
This paper introduces the WordArt Designer API, a novel framework for user-driven artistic
typography synthesis utilizing Large Language Models (LLMs) on ModelScope. We address …

Typographic Text Generation with Off-the-Shelf Diffusion Model

KT Peong, S Uchida, D Haraguchi - arXiv preprint arXiv:2402.14314, 2024 - arxiv.org
Recent diffusion-based generative models show promise in their ability to generate text
images, but limitations in specifying the styles of the generated texts render them insufficient …

[PDF][PDF] MetaDesigner: Advancing Artistic Typography through AI-Driven, User-Centric, and Multilingual WordArt Syn-thesis

K Wang - mercersec.org
MetaDesigner introduces a groundbreaking approach to the synthesis of artistic typography
by utilizing Large Language Models (LLMs). This system, designed to enhance user …