Deep learning modelling techniques: current progress, applications, advantages, and challenges

SF Ahmed, MSB Alam, M Hassan, MR Rozbu… - Artificial Intelligence …, 2023 - Springer
Deep learning (DL) is revolutionizing evidence-based decision-making techniques that can
be applied across various sectors. Specifically, it possesses the ability to utilize two or more …

An overview of deep-learning-based audio-visual speech enhancement and separation

D Michelsanti, ZH Tan, SX Zhang, Y Xu… - … on Audio, Speech …, 2021 - ieeexplore.ieee.org
Speech enhancement and speech separation are two related tasks, whose purpose is to
extract either one or more target speech signals, respectively, from a mixture of sounds …

Image segmentation using text and image prompts

T Lüddecke, A Ecker - … of the IEEE/CVF conference on …, 2022 - openaccess.thecvf.com
Image segmentation is usually addressed by training a model for a fixed set of object
classes. Incorporating additional classes or more complex queries later is expensive as it …

Neural fields in visual computing and beyond

Y Xie, T Takikawa, S Saito, O Litany… - Computer Graphics …, 2022 - Wiley Online Library
Recent advances in machine learning have led to increased interest in solving visual
computing problems using methods that employ coordinate‐based neural networks. These …

Learning nonlinear operators via DeepONet based on the universal approximation theorem of operators

L Lu, P Jin, G Pang, Z Zhang… - Nature machine …, 2021 - nature.com
It is widely known that neural networks (NNs) are universal approximators of continuous
functions. However, a less known but powerful result is that a NN with a single hidden layer …

pi-gan: Periodic implicit generative adversarial networks for 3d-aware image synthesis

ER Chan, M Monteiro, P Kellnhofer… - Proceedings of the …, 2021 - openaccess.thecvf.com
We have witnessed rapid progress on 3D-aware image synthesis, leveraging recent
advances in generative visual models and neural rendering. Existing approaches however …

Tryondiffusion: A tale of two unets

L Zhu, D Yang, T Zhu, F Reda… - Proceedings of the …, 2023 - openaccess.thecvf.com
Given two images depicting a person and a garment worn by another person, our goal is to
generate a visualization of how the garment might look on the input person. A key challenge …

[HTML][HTML] A gentle introduction to graph neural networks

B Sanchez-Lengeling, E Reif, A Pearce, AB Wiltschko - Distill, 2021 - staging.distill.pub
A Gentle Introduction to Graph Neural Networks Distill About Prize Submit A Gentle Introduction
to Graph Neural Networks Neural networks have been adapted to leverage the structure and …

Viewset diffusion:(0-) image-conditioned 3d generative models from 2d data

S Szymanowicz, C Rupprecht… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract We present Viewset Diffusion, a diffusion-based generator that outputs 3D objects
while only using multi-view 2D data for supervision. We note that there exists a one-to-one …

Analyzing and improving the image quality of stylegan

T Karras, S Laine, M Aittala, J Hellsten… - Proceedings of the …, 2020 - openaccess.thecvf.com
The style-based GAN architecture (StyleGAN) yields state-of-the-art results in data-driven
unconditional generative image modeling. We expose and analyze several of its …