Real-time user-guided image colorization with learned deep priors

J Gui, T Chen, J Zhang, Q Cao, Z Sun… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Deep supervised learning algorithms typically require a large volume of labeled data to
achieve satisfactory performance. However, the process of collecting and labeling such data …

被引用次数：40 相关文章所有 3 个版本

[HTML] springer.com Full View

[HTML][HTML] Transformers in computational visual media: A survey

Y Xu, H Wei, M Lin, Y Deng, K Sheng, M Zhang… - Computational Visual …, 2022 - Springer

Transformers, the dominant architecture for natural language processing, have also recently
attracted much attention from computational visual media researchers due to their capacity …

被引用次数：106 相关文章所有 6 个版本

[PDF] neurips.cc

Openagi: When llm meets domain experts

Y Ge, W Hua, K Mei, J Tan, S Xu… - Advances in Neural …, 2024 - proceedings.neurips.cc

Human Intelligence (HI) excels at combining basic skills to solve complex tasks. This
capability is vital for Artificial Intelligence (AI) and should be embedded in comprehensive AI …

被引用次数：110 相关文章所有 9 个版本

[PDF] arxiv.org

Sdedit: Guided image synthesis and editing with stochastic differential equations

C Meng, Y He, Y Song, J Song, J Wu, JY Zhu… - arXiv preprint arXiv …, 2021 - arxiv.org

Guided image synthesis enables everyday users to create and edit photo-realistic images
with minimum effort. The key challenge is balancing faithfulness to the user input (eg, hand …

被引用次数：918 相关文章所有 3 个版本

[PDF] thecvf.com

Expressive text-to-image generation with rich text

S Ge, T Park, JY Zhu, JB Huang - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Plain text has become a prevalent interface for text-to-image synthesis. However, its limited
customization options hinder users from accurately describing desired outputs. For example …

被引用次数：42 相关文章所有 6 个版本

[PDF] arxiv.org

Image-to-image translation: Methods and applications

Y Pang, J Lin, T Qin, Z Chen - IEEE Transactions on Multimedia, 2021 - ieeexplore.ieee.org

Image-to-image translation (I2I) aims to transfer images from a source domain to a target
domain while preserving the content representations. I2I has drawn increasing attention and …

被引用次数：274 相关文章所有 4 个版本

[PDF] thecvf.com

Editing conditional radiance fields

S Liu, X Zhang, Z Zhang, R Zhang… - Proceedings of the …, 2021 - openaccess.thecvf.com

A neural radiance field (NeRF) is a scene model supporting high-quality view synthesis,
optimized per scene. In this paper, we explore enabling user editing of a category-level …

被引用次数：210 相关文章所有 6 个版本

[PDF] neurips.cc Find it @ Konkuk Glocal

Swapping autoencoder for deep image manipulation

T Park, JY Zhu, O Wang, J Lu… - Advances in …, 2020 - proceedings.neurips.cc

Deep generative models have become increasingly effective at producing realistic images
from randomly sampled seeds, but using such models for controllable manipulation of …

被引用次数：334 相关文章所有 8 个版本

[PDF] arxiv.org

Self-supervised visual feature learning with deep neural networks: A survey

L Jing, Y Tian - IEEE transactions on pattern analysis and …, 2020 - ieeexplore.ieee.org

Large-scale labeled data are generally required to train deep neural networks in order to
obtain better performance in visual feature learning from images or videos for computer …

被引用次数：1932 相关文章所有 7 个版本

[PDF] arxiv.org

U-gat-it: Unsupervised generative attentional networks with adaptive layer-instance normalization for image-to-image translation

J Kim, M Kim, H Kang, K Lee - arXiv preprint arXiv:1907.10830, 2019 - arxiv.org

We propose a novel method for unsupervised image-to-image translation, which
incorporates a new attention module and a new learnable normalization function in an end …

被引用次数：636 相关文章所有 4 个版本