Combining language and vision with a multimodal skip-gram model

C Zhang, Z Yang, X He, L Deng - IEEE Journal of Selected …, 2020 - ieeexplore.ieee.org

Deep learning methods haverevolutionized speech recognition, image recognition, and
natural language processing since 2010. Each of these tasks involves a single modality in …

被引用次数：364 相关文章所有 3 个版本

[HTML] springer.com

[HTML][HTML] Semantic memory: A review of methods, models, and current challenges

AA Kumar - Psychonomic Bulletin & Review, 2021 - Springer

Adult semantic memory has been traditionally conceptualized as a relatively static memory
system that consists of knowledge about the world, concepts, and symbols. Considerable …

被引用次数：209 相关文章所有 8 个版本

[HTML] nih.gov

[HTML][HTML] Multimodal transformer for unaligned multimodal language sequences

YHH Tsai, S Bai, PP Liang, JZ Kolter… - Proceedings of the …, 2019 - ncbi.nlm.nih.gov

Human language is often multimodal, which comprehends a mixture of natural language,
facial gestures, and acoustic behaviors. However, two major challenges in modeling such …

被引用次数：1330 相关文章所有 8 个版本

[PDF] ieee.org

Deep multimodal representation learning: A survey

W Guo, J Wang, S Wang - Ieee Access, 2019 - ieeexplore.ieee.org

Multimodal representation learning, which aims to narrow the heterogeneity gap among
different modalities, plays an indispensable role in the utilization of ubiquitous multimodal …

被引用次数：453 相关文章所有 4 个版本

[PDF] arxiv.org

What artificial neural networks can tell us about human language acquisition

A Warstadt, SR Bowman - Algebraic structures in natural …, 2022 - taylorfrancis.com

Rapid progress in machine learning for natural language processing has the potential to
transform debates about how humans learn language. However, the learning environments …

被引用次数：93 相关文章所有 6 个版本

[PDF] aclanthology.org

The importance of modeling social factors of language: Theory and practice

D Hovy, D Yang - Proceedings of the 2021 Conference of the …, 2021 - aclanthology.org

Natural language processing (NLP) applications are now more powerful and ubiquitous
than ever before. With rapidly developing (neural) models and ever-more available data …

被引用次数：159 相关文章所有 8 个版本

[PDF] aaai.org

Found in translation: Learning robust joint representations by cyclic translations between modalities

H Pham, PP Liang, T Manzini, LP Morency… - Proceedings of the …, 2019 - ojs.aaai.org

Multimodal sentiment analysis is a core research area that studies speaker sentiment
expressed from the language, visual, and acoustic modalities. The central challenge in …

被引用次数：392 相关文章所有 15 个版本

[PDF] unimib.it

Vector-space models of semantic representation from a cognitive perspective: A discussion of common misconceptions

F Günther, L Rinaldi, M Marelli - … on Psychological Science, 2019 - journals.sagepub.com

Models that represent meaning as high-dimensional numerical vectors—such as latent
semantic analysis (LSA), hyperspace analogue to language (HAL), bound encoding of the …

被引用次数：289 相关文章所有 8 个版本

[PDF] jair.org

A survey of cross-lingual word embedding models

S Ruder, I Vulić, A Søgaard - Journal of Artificial Intelligence Research, 2019 - jair.org

Cross-lingual representations of words enable us to reason about word meaning in
multilingual contexts and are a key facilitator of cross-lingual transfer when developing …

被引用次数：739 相关文章所有 16 个版本

[PDF] mlr.press

Characterizing and overcoming the greedy nature of learning in multi-modal deep neural networks

N Wu, S Jastrzebski, K Cho… - … Conference on Machine …, 2022 - proceedings.mlr.press

We hypothesize that due to the greedy nature of learning in multi-modal deep neural
networks, these models tend to rely on just one modality while under-fitting the other …

被引用次数：75 相关文章所有 8 个版本