From google gemini to openai q*(q-star): A survey of reshaping the generative artificial intelligence (ai) research landscape

TR McIntosh, T Susnjak, T Liu, P Watters… - arXiv preprint arXiv …, 2023 - arxiv.org
This comprehensive survey explored the evolving landscape of generative Artificial
Intelligence (AI), with a specific focus on the transformative impacts of Mixture of Experts …

Neurallift-360: Lifting an in-the-wild 2d photo to a 3d object with 360deg views

D Xu, Y Jiang, P Wang, Z Fan… - Proceedings of the …, 2023 - openaccess.thecvf.com
Virtual reality and augmented reality (XR) bring increasing demand for 3D content
generation. However, creating high-quality 3D content requires tedious work from a human …

A survey on multimodal large language models for autonomous driving

C Cui, Y Ma, X Cao, W Ye, Y Zhou… - Proceedings of the …, 2024 - openaccess.thecvf.com
With the emergence of Large Language Models (LLMs) and Vision Foundation Models
(VFMs), multimodal AI systems benefiting from large models have the potential to equally …

Promptify: Text-to-image generation through interactive prompt exploration with large language models

S Brade, B Wang, M Sousa, S Oore… - Proceedings of the 36th …, 2023 - dl.acm.org
Text-to-image generative models have demonstrated remarkable capabilities in generating
high-quality images based on textual prompts. However, crafting prompts that accurately …

“An Adapt-or-Die Type of Situation”: Perception, Adoption, and Use of Text-to-Image-Generation AI by Game Industry Professionals

V Vimpari, A Kultima, P Hämäläinen… - Proceedings of the …, 2023 - dl.acm.org
Text-to-image generation (TTIG) models, a recent addition to creative AI, can generate
images based on a text description. These models have begun to rival the work of …

Navigating text-to-image customization: From lycoris fine-tuning to model evaluation

SY Yeh, YG Hsieh, Z Gao, BBW Yang… - The Twelfth …, 2023 - openreview.net
Text-to-image generative models have garnered immense attention for their ability to
produce high-fidelity images from text prompts. Among these, Stable Diffusion distinguishes …

GenAssist: Making image generation accessible

M Huh, YH Peng, A Pavel - Proceedings of the 36th Annual ACM …, 2023 - dl.acm.org
Blind and low vision (BLV) creators use images to communicate with sighted audiences.
However, creating or retrieving images is challenging for BLV creators as it is difficult to use …

Situating the social issues of image generation models in the model life cycle: a sociotechnical approach

A Katirai, N Garcia, K Ide, Y Nakashima, A Kishimoto - AI and Ethics, 2024 - Springer
The race to develop image generation models is intensifying, with a rapid increase in the
number of text-to-image models available. This is coupled with growing public awareness of …

Cam: A large language model-based creative analogy mining framework

B Bhavya, J Xiong, C Zhai - Proceedings of the ACM Web Conference …, 2023 - dl.acm.org
Analogies inspire creative solutions to problems, and facilitate the creative expression of
ideas and the explanation of complex concepts. They have widespread applications in …

CreativeConnect: Supporting Reference Recombination for Graphic Design Ideation with Generative AI

DE Choi, S Hong, J Park, JJY Chung… - Proceedings of the CHI …, 2024 - dl.acm.org
Graphic designers often get inspiration through the recombination of references. Our
formative study (N= 6) reveals that graphic designers focus on conceptual keywords during …