Large-scale text-to-image generation models for visual artists’ creative works

From google gemini to openai q*(q-star): A survey of reshaping the generative artificial intelligence (ai) research landscape

TR McIntosh, T Susnjak, T Liu, P Watters… - arXiv preprint arXiv …, 2023 - arxiv.org

This comprehensive survey explored the evolving landscape of generative Artificial
Intelligence (AI), with a specific focus on the transformative impacts of Mixture of Experts …

被引用次数：93 相关文章所有 3 个版本

[PDF] thecvf.com

Neurallift-360: Lifting an in-the-wild 2d photo to a 3d object with 360deg views

D Xu, Y Jiang, P Wang, Z Fan… - Proceedings of the …, 2023 - openaccess.thecvf.com

Virtual reality and augmented reality (XR) bring increasing demand for 3D content
generation. However, creating high-quality 3D content requires tedious work from a human …

被引用次数：111 相关文章所有 5 个版本

[PDF] thecvf.com

A survey on multimodal large language models for autonomous driving

C Cui, Y Ma, X Cao, W Ye, Y Zhou… - Proceedings of the …, 2024 - openaccess.thecvf.com

With the emergence of Large Language Models (LLMs) and Vision Foundation Models
(VFMs), multimodal AI systems benefiting from large models have the potential to equally …

被引用次数：176 相关文章所有 7 个版本

Promptify: Text-to-image generation through interactive prompt exploration with large language models

S Brade, B Wang, M Sousa, S Oore… - Proceedings of the 36th …, 2023 - dl.acm.org

Text-to-image generative models have demonstrated remarkable capabilities in generating
high-quality images based on textual prompts. However, crafting prompts that accurately …

被引用次数：72 相关文章所有 4 个版本

[PDF] acm.org

“An Adapt-or-Die Type of Situation”: Perception, Adoption, and Use of Text-to-Image-Generation AI by Game Industry Professionals

V Vimpari, A Kultima, P Hämäläinen… - Proceedings of the …, 2023 - dl.acm.org

Text-to-image generation (TTIG) models, a recent addition to creative AI, can generate
images based on a text description. These models have begun to rival the work of …

被引用次数：38 相关文章所有 7 个版本

[PDF] openreview.net

Navigating text-to-image customization: From lycoris fine-tuning to model evaluation

SY Yeh, YG Hsieh, Z Gao, BBW Yang… - The Twelfth …, 2023 - openreview.net

Text-to-image generative models have garnered immense attention for their ability to
produce high-fidelity images from text prompts. Among these, Stable Diffusion distinguishes …

被引用次数：25 相关文章所有 3 个版本

[PDF] acm.org

GenAssist: Making image generation accessible

M Huh, YH Peng, A Pavel - Proceedings of the 36th Annual ACM …, 2023 - dl.acm.org

Blind and low vision (BLV) creators use images to communicate with sighted audiences.
However, creating or retrieving images is challenging for BLV creators as it is difficult to use …

被引用次数：28 相关文章所有 3 个版本

[PDF] arxiv.org

Situating the social issues of image generation models in the model life cycle: a sociotechnical approach

A Katirai, N Garcia, K Ide, Y Nakashima, A Kishimoto - AI and Ethics, 2024 - Springer

The race to develop image generation models is intensifying, with a rapid increase in the
number of text-to-image models available. This is coupled with growing public awareness of …

被引用次数：10 相关文章所有 2 个版本

[PDF] researchgate.net

Cam: A large language model-based creative analogy mining framework

B Bhavya, J Xiong, C Zhai - Proceedings of the ACM Web Conference …, 2023 - dl.acm.org

Analogies inspire creative solutions to problems, and facilitate the creative expression of
ideas and the explanation of complex concepts. They have widespread applications in …

被引用次数：15 相关文章所有 4 个版本

[PDF] acm.org

CreativeConnect: Supporting Reference Recombination for Graphic Design Ideation with Generative AI

DE Choi, S Hong, J Park, JJY Chung… - Proceedings of the CHI …, 2024 - dl.acm.org

Graphic designers often get inspiration through the recombination of references. Our
formative study (N= 6) reveals that graphic designers focus on conceptual keywords during …

被引用次数：16 相关文章所有 4 个版本