Deep variational canonical correlation analysis

A Mohamed, H Lee, L Borgholt… - IEEE Journal of …, 2022 - ieeexplore.ieee.org

Although supervised deep learning has revolutionized speech and audio processing, it has
necessitated the building of specialist models for individual tasks and application scenarios …

被引用次数：283 相关文章所有 10 个版本

[PDF] arxiv.org

A survey of multimodal deep generative models

M Suzuki, Y Matsuo - Advanced Robotics, 2022 - Taylor & Francis

Multimodal learning is a framework for building models that make predictions based on
different types of modalities. Important challenges in multimodal learning are the inference of …

被引用次数：74 相关文章所有 6 个版本

[PDF] arxiv.org

Trusted multi-view classification with dynamic evidential fusion

Z Han, C Zhang, H Fu, JT Zhou - IEEE transactions on pattern …, 2022 - ieeexplore.ieee.org

Existing multi-view classification algorithms focus on promoting accuracy by exploiting
different views, typically integrating them into common representations for follow-up tasks …

被引用次数：264 相关文章所有 9 个版本

[PDF] ieee.org

Deep multimodal representation learning: A survey

W Guo, J Wang, S Wang - Ieee Access, 2019 - ieeexplore.ieee.org

Multimodal representation learning, which aims to narrow the heterogeneity gap among
different modalities, plays an indispensable role in the utilization of ubiquitous multimodal …

被引用次数：442 相关文章所有 4 个版本

[PDF] neurips.cc

Variational mixture-of-experts autoencoders for multi-modal deep generative models

Y Shi, B Paige, P Torr - Advances in neural information …, 2019 - proceedings.neurips.cc

Learning generative models that span multiple data modalities, such as vision and
language, is often motivated by the desire to learn more useful, generalisable …

被引用次数：264 相关文章所有 11 个版本

[PDF] neurips.cc

Multimodal generative models for scalable weakly-supervised learning

M Wu, N Goodman - Advances in neural information …, 2018 - proceedings.neurips.cc

Multiple modalities often co-occur when describing natural phenomena. Learning a joint
representation of these modalities should yield deeper and more useful representations …

被引用次数：400 相关文章所有 7 个版本

[PDF] arxiv.org

Deep partial multi-view learning

C Zhang, Y Cui, Z Han, JT Zhou… - IEEE transactions on …, 2020 - ieeexplore.ieee.org

Although multi-view learning has made significant progress over the past few decades, it is
still challenging due to the difficulty in modeling complex correlations among different views …

被引用次数：194 相关文章所有 7 个版本

[PDF] github.io

Learning modality-specific and-agnostic representations for asynchronous multimodal language sequences

D Yang, H Kuang, S Huang, L Zhang - Proceedings of the 30th ACM …, 2022 - dl.acm.org

Understanding human behaviors and intents from videos is a challenging task. Video flows
usually involve time-series data from different modalities, such as natural language, facial …

被引用次数：48 相关文章所有 3 个版本

[HTML] cell.com Full View

[HTML][HTML] The human tumor atlas network: charting tumor transitions across space and time at single-cell resolution

O Rozenblatt-Rosen, A Regev, P Oberdoerffer, T Nawy… - Cell, 2020 - cell.com

Crucial transitions in cancer—including tumor initiation, local expansion, metastasis, and
therapeutic resistance—involve complex interactions between cells within the dynamic …

被引用次数：398 相关文章所有 32 个版本

[PDF] neurips.cc

Gaussian process prior variational autoencoders

FP Casale, A Dalca, L Saglietti… - Advances in neural …, 2018 - proceedings.neurips.cc

Variational autoencoders (VAE) are a powerful and widely-used class of models to learn
complex data distributions in an unsupervised fashion. One important limitation of VAEs is …

被引用次数：135 相关文章所有 6 个版本