A Survey on Self-supervised Learning: Algorithms, Applications, and Future Trends

J Gui, T Chen, J Zhang, Q Cao, Z Sun… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Deep supervised learning algorithms typically require a large volume of labeled data to
achieve satisfactory performance. However, the process of collecting and labeling such data …

[HTML][HTML] Transformers in computational visual media: A survey

Y Xu, H Wei, M Lin, Y Deng, K Sheng, M Zhang… - Computational Visual …, 2022 - Springer
Transformers, the dominant architecture for natural language processing, have also recently
attracted much attention from computational visual media researchers due to their capacity …

Openagi: When llm meets domain experts

Y Ge, W Hua, K Mei, J Tan, S Xu… - Advances in Neural …, 2024 - proceedings.neurips.cc
Human Intelligence (HI) excels at combining basic skills to solve complex tasks. This
capability is vital for Artificial Intelligence (AI) and should be embedded in comprehensive AI …

Sdedit: Guided image synthesis and editing with stochastic differential equations

C Meng, Y He, Y Song, J Song, J Wu, JY Zhu… - arXiv preprint arXiv …, 2021 - arxiv.org
Guided image synthesis enables everyday users to create and edit photo-realistic images
with minimum effort. The key challenge is balancing faithfulness to the user input (eg, hand …

Expressive text-to-image generation with rich text

S Ge, T Park, JY Zhu, JB Huang - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Plain text has become a prevalent interface for text-to-image synthesis. However, its limited
customization options hinder users from accurately describing desired outputs. For example …

Image-to-image translation: Methods and applications

Y Pang, J Lin, T Qin, Z Chen - IEEE Transactions on Multimedia, 2021 - ieeexplore.ieee.org
Image-to-image translation (I2I) aims to transfer images from a source domain to a target
domain while preserving the content representations. I2I has drawn increasing attention and …

Editing conditional radiance fields

S Liu, X Zhang, Z Zhang, R Zhang… - Proceedings of the …, 2021 - openaccess.thecvf.com
A neural radiance field (NeRF) is a scene model supporting high-quality view synthesis,
optimized per scene. In this paper, we explore enabling user editing of a category-level …

Swapping autoencoder for deep image manipulation

T Park, JY Zhu, O Wang, J Lu… - Advances in …, 2020 - proceedings.neurips.cc
Deep generative models have become increasingly effective at producing realistic images
from randomly sampled seeds, but using such models for controllable manipulation of …

Self-supervised visual feature learning with deep neural networks: A survey

L Jing, Y Tian - IEEE transactions on pattern analysis and …, 2020 - ieeexplore.ieee.org
Large-scale labeled data are generally required to train deep neural networks in order to
obtain better performance in visual feature learning from images or videos for computer …

U-gat-it: Unsupervised generative attentional networks with adaptive layer-instance normalization for image-to-image translation

J Kim, M Kim, H Kang, K Lee - arXiv preprint arXiv:1907.10830, 2019 - arxiv.org
We propose a novel method for unsupervised image-to-image translation, which
incorporates a new attention module and a new learnable normalization function in an end …