Probabilistic topic modeling in multilingual settings: An overview of its methodology and applications

I Vulić, W De Smet, J Tang, MF Moens - Information Processing & …, 2015 - Elsevier
Probabilistic topic models are unsupervised generative models which model document
content as a two-step generation process, that is, documents are observed as mixtures of …

A survey of cross-lingual word embedding models

S Ruder, I Vulić, A Søgaard - Journal of Artificial Intelligence Research, 2019 - jair.org
Cross-lingual representations of words enable us to reason about word meaning in
multilingual contexts and are a key facilitator of cross-lingual transfer when developing …

Biomedical term extraction: overview and a new methodology

JA Lossio-Ventura, C Jonquet, M Roche… - Information Retrieval …, 2016 - Springer
Terminology extraction is an essential task in domain knowledge acquisition, as well as for
information retrieval. It is also a mandatory first step aimed at building/enriching …

[PDF][PDF] Bilingual word embeddings from non-parallel document-aligned data applied to bilingual lexicon induction

I Vulic, MF Moens - Proceedings of the 53rd Annual Meeting of …, 2015 - lirias.kuleuven.be
We propose a simple yet effective approach to learning bilingual word embeddings (BWEs)
from non-parallel document-aligned data (based on the omnipresent skip-gram model), and …

Bilingual distributed word representations from document-aligned comparable data

I Vulić, MF Moens - Journal of Artificial Intelligence Research, 2016 - jair.org
We propose a new model for learning bilingual word representations from nonparallel
document-aligned data. Following the recent advances in word representation learning, our …

[图书][B] Cross-lingual word embeddings

A Søgaard, I Vulić, S Ruder, M Faruq - 2019 - Springer
The majority of natural language processing (NLP) is English language processing, and
while there is good language technology support for (standard varieties of) English, support …

Bilingual term alignment from comparable corpora in english discharge summary and chinese discharge summary

Y Xu, L Chen, J Wei, S Ananiadou, Y Fan, Y Qian… - BMC …, 2015 - Springer
Background Electronic medical record (EMR) systems have become widely used throughout
the world to improve the quality of healthcare and the efficiency of hospital services. A …

A comprehensive analysis of bilingual lexicon induction

A Irvine, C Callison-Burch - Computational Linguistics, 2017 - direct.mit.edu
Bilingual lexicon induction is the task of inducing word translations from monolingual
corpora in two languages. In this article we present the most comprehensive analysis of …

[PDF][PDF] Cross-lingual semantic similarity of words as the similarity of their semantic word responses

I Vulic, MF Moens - Proceedings of the 2013 Conference of the …, 2013 - lirias.kuleuven.be
We propose a new approach to identifying semantically similar words across languages.
The approach is based on an idea that two words in different languages are similar if they …

[PDF][PDF] A study on bootstrapping bilingual vector spaces from non-parallel data (and nothing else)

I Vulić, MF Moens - Proceedings of the 2013 conference on …, 2013 - aclanthology.org
We present a new language pair agnostic approach to inducing bilingual vector spaces from
non-parallel data without any other resource in a bootstrapping fashion. The paper …