Multimodal image synthesis and editing: A survey and taxonomy
As information exists in various modalities in real world, effective interaction and fusion
among multimodal information plays a key role for the creation and perception of multimodal …
among multimodal information plays a key role for the creation and perception of multimodal …
Adventures in data analysis: A systematic review of Deep Learning techniques for pattern recognition in cyber-physical-social systems
Abstract Machine Learning (ML) and Deep Learning (DL) have achieved high success in
many textual, auditory, medical imaging, and visual recognition patterns. Concerning the …
many textual, auditory, medical imaging, and visual recognition patterns. Concerning the …
Tryondiffusion: A tale of two unets
Given two images depicting a person and a garment worn by another person, our goal is to
generate a visualization of how the garment might look on the input person. A key challenge …
generate a visualization of how the garment might look on the input person. A key challenge …
Animatable neural radiance fields for modeling dynamic human bodies
This paper addresses the challenge of reconstructing an animatable human model from a
multi-view video. Some recent works have proposed to decompose a non-rigidly deforming …
multi-view video. Some recent works have proposed to decompose a non-rigidly deforming …
Text2human: Text-driven controllable human image generation
Generating high-quality and diverse human images is an important yet challenging task in
vision and graphics. However, existing generative models often fall short under the high …
vision and graphics. However, existing generative models often fall short under the high …
Gauhuman: Articulated gaussian splatting from monocular human videos
We present GauHuman a 3D human model with Gaussian Splatting for both fast training (1 2
minutes) and real-time rendering (up to 189 FPS) compared with existing NeRF-based …
minutes) and real-time rendering (up to 189 FPS) compared with existing NeRF-based …
Person image synthesis via denoising diffusion model
The pose-guided person image generation task requires synthesizing photorealistic images
of humans in arbitrary poses. The existing approaches use generative adversarial networks …
of humans in arbitrary poses. The existing approaches use generative adversarial networks …
Unbalanced feature transport for exemplar-based image translation
Despite the great success of GANs in images translation with different conditioned inputs
such as semantic segmentation and edge map, generating high-fidelity images with …
such as semantic segmentation and edge map, generating high-fidelity images with …
Human-art: A versatile human-centric dataset bridging natural and artificial scenes
Humans have long been recorded in a variety of forms since antiquity. For example,
sculptures and paintings were the primary media for depicting human beings before the …
sculptures and paintings were the primary media for depicting human beings before the …
Humansd: A native skeleton-guided diffusion model for human image generation
Controllable human image generation (HIG) has attracted significant attention from
academia and industry for its numerous real-life applications. State-of-the-art solutions, such …
academia and industry for its numerous real-life applications. State-of-the-art solutions, such …