LLMs Meet Multimodal Generation and Editing: A Survey
With the recent advancement in large language models (LLMs), there is a growing interest in
combining LLMs with multimodal learning. Previous surveys of multimodal large language …
combining LLMs with multimodal learning. Previous surveys of multimodal large language …
FontStudio: Shape-Adaptive Diffusion Model for Coherent and Consistent Font Effect Generation
Recently, the application of modern diffusion-based text-to-image generation models for
creating artistic fonts, traditionally the domain of professional designers, has garnered …
creating artistic fonts, traditionally the domain of professional designers, has garnered …
AnyTrans: Translate AnyText in the Image with Large Scale Models
This paper introduces AnyTrans, an all-encompassing framework for the task-Translate
AnyText in the Image (TATI), which includes multilingual text translation and text fusion …
AnyText in the Image (TATI), which includes multilingual text translation and text fusion …
Cross-Domain Image Conversion by CycleDM
S Shimotsumagari, S Takezaki, D Haraguchi… - arXiv preprint arXiv …, 2024 - arxiv.org
The purpose of this paper is to enable the conversion between machine-printed character
images (ie, font images) and handwritten character images through machine learning. For …
images (ie, font images) and handwritten character images through machine learning. For …
WordArt Designer API: User-Driven Artistic Typography Synthesis with Large Language Models on ModelScope
This paper introduces the WordArt Designer API, a novel framework for user-driven artistic
typography synthesis utilizing Large Language Models (LLMs) on ModelScope. We address …
typography synthesis utilizing Large Language Models (LLMs) on ModelScope. We address …
Typographic Text Generation with Off-the-Shelf Diffusion Model
KT Peong, S Uchida, D Haraguchi - arXiv preprint arXiv:2402.14314, 2024 - arxiv.org
Recent diffusion-based generative models show promise in their ability to generate text
images, but limitations in specifying the styles of the generated texts render them insufficient …
images, but limitations in specifying the styles of the generated texts render them insufficient …
[PDF][PDF] MetaDesigner: Advancing Artistic Typography through AI-Driven, User-Centric, and Multilingual WordArt Syn-thesis
K Wang - mercersec.org
MetaDesigner introduces a groundbreaking approach to the synthesis of artistic typography
by utilizing Large Language Models (LLMs). This system, designed to enhance user …
by utilizing Large Language Models (LLMs). This system, designed to enhance user …