Glyph-byt5: A customized text encoder for accurate visual text rendering

Z Liu, W Liang, Z Liang, C Luo, J Li, G Huang… - … on Computer Vision, 2025 - Springer
Visual text rendering poses a fundamental challenge for contemporary text-to-image
generation models, with the core problem lying in text encoder deficiencies. To achieve …

Glyph-byt5-v2: A strong aesthetic baseline for accurate multilingual visual text rendering

Z Liu, W Liang, Y Zhao, B Chen, L Liang… - arXiv preprint arXiv …, 2024 - arxiv.org
Recently, Glyph-ByT5 has achieved highly accurate visual text rendering performance in
graphic design images. However, it still focuses solely on English and performs relatively …

LLMs Meet Multimodal Generation and Editing: A Survey

Y He, Z Liu, J Chen, Z Tian, H Liu, X Chi, R Liu… - arXiv preprint arXiv …, 2024 - arxiv.org
With the recent advancement in large language models (LLMs), there is a growing interest in
combining LLMs with multimodal learning. Previous surveys of multimodal large language …

Kinetic Typography Diffusion Model

S Park, I Bae, S Shin, HG Jeon - European Conference on Computer …, 2025 - Springer
This paper introduces a method for realistic kinetic typography that generates user-preferred
animatable “text content”. We draw on recent advances in guided video diffusion models to …

Graphic Design with Large Multimodal Model

Y Cheng, Z Zhang, M Yang, H Nie, C Li, X Wu… - arXiv preprint arXiv …, 2024 - arxiv.org
In the field of graphic design, automating the integration of design elements into a cohesive
multi-layered artwork not only boosts productivity but also paves the way for the …

Type-R: Automatically Retouching Typos for Text-to-Image Generation

W Shimoda, N Inoue, D Haraguchi, H Mitani… - arXiv preprint arXiv …, 2024 - arxiv.org
While recent text-to-image models can generate photorealistic images from text prompts that
reflect detailed instructions, they still face significant challenges in accurately rendering …

Design-o-meter: Towards Evaluating and Refining Graphic Designs

S Goyal, A Mahajan, S Mishra, P Udhayanan… - arXiv preprint arXiv …, 2024 - arxiv.org
Graphic designs are an effective medium for visual communication. They range from
greeting cards to corporate flyers and beyond. Off-late, machine learning techniques are …

OpenCOLE: Towards Reproducible Automatic Graphic Design Generation

N Inoue, K Masui, W Shimoda, K Yamaguchi - arXiv preprint arXiv …, 2024 - arxiv.org
Automatic generation of graphic designs has recently received considerable attention.
However, the state-of-the-art approaches are complex and rely on proprietary datasets …

Can GPTs Evaluate Graphic Design Based on Design Principles?

D Haraguchi, N Inoue, W Shimoda, H Mitani… - SIGGRAPH Asia 2024 …, 2024 - dl.acm.org
Recent advancements in foundation models show promising capability in graphic design
generation. Several studies have started employing Large Multimodal Models (LMMs) to …

Graphic Design with Creative Coding: Using Abstract Art as Bidimensional Logic in Project Development

ML Bergamo, AL Silva, VHS Valentim - DAT Journal, 2024 - datjournal.emnuvens.com.br
This paper describes an approach to teaching creative coding to design students. It may
respond to a situation where 'apprehension of automation replacement'from artificial …