Recent advances and trends in multimodal deep learning: A review

J Summaira, X Li, AM Shoib, S Li, J Abdul - arXiv preprint arXiv …, 2021 - arxiv.org
Deep Learning has implemented a wide range of applications and has become increasingly
popular in recent years. The goal of multimodal deep learning is to create models that can …

Modality competition: What makes joint training of multi-modal network fail in deep learning?(provably)

Y Huang, J Lin, C Zhou, H Yang… - … conference on machine …, 2022 - proceedings.mlr.press
Despite the remarkable success of deep multi-modal learning in practice, it has not been
well-explained in theory. Recently, it has been observed that the best uni-modal network …

Cross-modal subspace learning for fine-grained sketch-based image retrieval

P Xu, Q Yin, Y Huang, YZ Song, Z Ma, L Wang, T Xiang… - Neurocomputing, 2018 - Elsevier
Sketch-based image retrieval (SBIR) is challenging due to the inherent domain-gap
between sketch and photo. Compared with pixel-perfect depictions of photos, sketches are …

Multimodal target detection by sparse coding: Application to paint loss detection in paintings

S Huang, B Cornelis, B Devolder… - … on Image Processing, 2020 - ieeexplore.ieee.org
Sparse representation based methods have demonstrated their superior performance in
target detection tasks compared to more traditional approaches such as matched subspace …

Semantic feature extraction based on subspace learning with temporal constraints for acoustic event recognition

Q Shi, J Han - Digital Signal Processing, 2021 - Elsevier
In acoustic event recognition (AER), it is important to extract semantic features. As two
crucial aspects of semantic features, the essential content and the temporal structure can …

Multimodal sparse Bayesian dictionary learning applied to multimodal data classification

I Fedorov, BD Rao, TQ Nguyen - 2017 IEEE International …, 2017 - ieeexplore.ieee.org
In this paper, we present a novel multimodal sparse dictionary learning algorithm based on
a hierarchical sparse Bayesian framework. The framework allows for enforcing joint sparsity …

Multimodal sparse bayesian dictionary learning

I Fedorov, BD Rao - arXiv preprint arXiv:1804.03740, 2018 - arxiv.org
This paper addresses the problem of learning dictionaries for multimodal datasets, ie
datasets collected from multiple data sources. We present an algorithm called multimodal …

[图书][B] Structured Learning with Scale Mixture Priors

I Fedorov - 2018 - search.proquest.com
Sparsity plays an essential role in a number of modern algorithms. This thesis examines
how we can incorporate additional structural information within the sparsity profile and …