Virtual-reality interpromotion technology for metaverse: A survey

D Wu, Z Yang, P Zhang, R Wang… - IEEE Internet of Things …, 2023 - ieeexplore.ieee.org
The metaverse aims to build an immersive virtual reality world to support the daily life, work,
and recreation of people. In this survey, the status quo of the metaverse is investigated, and …

Learning generative vision transformer with energy-based latent space for saliency prediction

J Zhang, J Xie, N Barnes, P Li - Advances in Neural …, 2021 - proceedings.neurips.cc
Vision transformer networks have shown superiority in many computer vision tasks. In this
paper, we take a step further by proposing a novel generative vision transformer with latent …

Blind image super-resolution with elaborate degradation modeling on noise and kernel

Z Yue, Q Zhao, J Xie, L Zhang… - Proceedings of the …, 2022 - openaccess.thecvf.com
While researches on model-based blind single image super-resolution (SISR) have
achieved tremendous successes recently, most of them do not consider the image …

Semi-supervised video deraining with dynamical rain generator

Z Yue, J Xie, Q Zhao, D Meng - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
While deep learning (DL)-based video deraining methods have achieved significant
success recently, they still exist two major drawbacks. Firstly, most of them do not sufficiently …

Learning feature-to-feature translator by alternating back-propagation for generative zero-shot learning

Y Zhu, J Xie, B Liu, A Elgammal - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
We investigate learning feature-to-feature translator networks by alternating back-
propagation as a general-purpose solution to zero-shot learning (ZSL) problems. It is a …

Videoflow: A conditional flow-based model for stochastic video generation

M Kumar, M Babaeizadeh, D Erhan, C Finn… - arXiv preprint arXiv …, 2019 - arxiv.org
Generative models that can model and predict sequences of future events can, in principle,
learn to capture complex real-world phenomena, such as physical interactions. However, a …

Stochastic image-to-video synthesis using cinns

M Dorkenwald, T Milbich, A Blattmann… - Proceedings of the …, 2021 - openaccess.thecvf.com
Video understanding calls for a model to learn the characteristic interplay between static
scene content and its dynamics: Given an image, the model must be able to predict a future …

Learning noise-aware encoder-decoder from noisy labels by alternating back-propagation for saliency detection

J Zhang, J Xie, N Barnes - Computer Vision–ECCV 2020: 16th European …, 2020 - Springer
In this paper, we propose a noise-aware encoder-decoder framework to disentangle a clean
saliency predictor from noisy training examples, where the noisy labels are generated by …

Learning multi-layer latent variable model via variational optimization of short run mcmc for approximate inference

E Nijkamp, B Pang, T Han, L Zhou, SC Zhu… - Computer Vision–ECCV …, 2020 - Springer
This paper studies the fundamental problem of learning deep generative models that consist
of multiple layers of latent variables organized in top-down architectures. Such models have …

Deep generative model with hierarchical latent factors for time series anomaly detection

CI Challu, P Jiang, YN Wu… - … Conference on Artificial …, 2022 - proceedings.mlr.press
Multivariate time series anomaly detection has become an active area of research in recent
years, with Deep Learning models outperforming previous approaches on benchmark …