From google gemini to openai q*(q-star): A survey of reshaping the generative artificial intelligence (ai) research landscape
This comprehensive survey explored the evolving landscape of generative Artificial
Intelligence (AI), with a specific focus on the transformative impacts of Mixture of Experts …
Intelligence (AI), with a specific focus on the transformative impacts of Mixture of Experts …
Neurallift-360: Lifting an in-the-wild 2d photo to a 3d object with 360deg views
Virtual reality and augmented reality (XR) bring increasing demand for 3D content
generation. However, creating high-quality 3D content requires tedious work from a human …
generation. However, creating high-quality 3D content requires tedious work from a human …
A survey on multimodal large language models for autonomous driving
With the emergence of Large Language Models (LLMs) and Vision Foundation Models
(VFMs), multimodal AI systems benefiting from large models have the potential to equally …
(VFMs), multimodal AI systems benefiting from large models have the potential to equally …
Promptify: Text-to-image generation through interactive prompt exploration with large language models
Text-to-image generative models have demonstrated remarkable capabilities in generating
high-quality images based on textual prompts. However, crafting prompts that accurately …
high-quality images based on textual prompts. However, crafting prompts that accurately …
“An Adapt-or-Die Type of Situation”: Perception, Adoption, and Use of Text-to-Image-Generation AI by Game Industry Professionals
V Vimpari, A Kultima, P Hämäläinen… - Proceedings of the …, 2023 - dl.acm.org
Text-to-image generation (TTIG) models, a recent addition to creative AI, can generate
images based on a text description. These models have begun to rival the work of …
images based on a text description. These models have begun to rival the work of …
Navigating text-to-image customization: From lycoris fine-tuning to model evaluation
Text-to-image generative models have garnered immense attention for their ability to
produce high-fidelity images from text prompts. Among these, Stable Diffusion distinguishes …
produce high-fidelity images from text prompts. Among these, Stable Diffusion distinguishes …
GenAssist: Making image generation accessible
Blind and low vision (BLV) creators use images to communicate with sighted audiences.
However, creating or retrieving images is challenging for BLV creators as it is difficult to use …
However, creating or retrieving images is challenging for BLV creators as it is difficult to use …
Situating the social issues of image generation models in the model life cycle: a sociotechnical approach
The race to develop image generation models is intensifying, with a rapid increase in the
number of text-to-image models available. This is coupled with growing public awareness of …
number of text-to-image models available. This is coupled with growing public awareness of …
Cam: A large language model-based creative analogy mining framework
Analogies inspire creative solutions to problems, and facilitate the creative expression of
ideas and the explanation of complex concepts. They have widespread applications in …
ideas and the explanation of complex concepts. They have widespread applications in …
CreativeConnect: Supporting Reference Recombination for Graphic Design Ideation with Generative AI
Graphic designers often get inspiration through the recombination of references. Our
formative study (N= 6) reveals that graphic designers focus on conceptual keywords during …
formative study (N= 6) reveals that graphic designers focus on conceptual keywords during …