Multimodal image synthesis and editing: A survey and taxonomy
As information exists in various modalities in real world, effective interaction and fusion
among multimodal information plays a key role for the creation and perception of multimodal …
among multimodal information plays a key role for the creation and perception of multimodal …
T2i-adapter: Learning adapters to dig out more controllable ability for text-to-image diffusion models
The incredible generative ability of large-scale text-to-image (T2I) models has demonstrated
strong power of learning complex structures and meaningful semantics. However, relying …
strong power of learning complex structures and meaningful semantics. However, relying …
Layoutllm-t2i: Eliciting layout guidance from llm for text-to-image generation
In the text-to-image generation field, recent remarkable progress in Stable Diffusion makes it
possible to generate rich kinds of novel photorealistic images. However, current models still …
possible to generate rich kinds of novel photorealistic images. However, current models still …
[HTML][HTML] Generative ai for visualization: State of the art and future directions
Generative AI (GenAI) has witnessed remarkable progress in recent years and
demonstrated impressive performance in various generation tasks in different domains such …
demonstrated impressive performance in various generation tasks in different domains such …
Panacea: Panoramic and controllable video generation for autonomous driving
The field of autonomous driving increasingly demands high-quality annotated training data.
In this paper we propose Panacea an innovative approach to generate panoramic and …
In this paper we propose Panacea an innovative approach to generate panoramic and …
A unified framework for guiding generative ai with wireless perception in resource constrained mobile edge networks
With the significant advancements in artificial intelligence (AI) technologies and
computational capabilities, generative AI (GAI) has become a pivotal digital content …
computational capabilities, generative AI (GAI) has become a pivotal digital content …
Bevcontrol: Accurately controlling street-view elements with multi-perspective consistency via bev sketch layout
Using synthesized images to boost the performance of perception models is a long-standing
research challenge in computer vision. It becomes more eminent in visual-centric …
research challenge in computer vision. It becomes more eminent in visual-centric …
FlowDiffuser: Advancing Optical Flow Estimation with Diffusion Models
Optical flow estimation a process of predicting pixel-wise displacement between consecutive
frames has commonly been approached as a regression task in the age of deep learning …
frames has commonly been approached as a regression task in the age of deep learning …
Shadow-Enlightened Image Outpainting
H Yu, R Li, S Xie, J Qiu - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
Conventional image outpainting methods usually treat unobserved areas as unknown and
extend the scene only in terms of semantic consistency thus overlooking the hidden …
extend the scene only in terms of semantic consistency thus overlooking the hidden …
Data augmentation for object detection via controllable diffusion models
Data augmentation is vital for object detection tasks that require expensive bounding box
annotations. Recent successes in diffusion models have inspired the use of diffusion-based …
annotations. Recent successes in diffusion models have inspired the use of diffusion-based …