Probabilistic topic modeling in multilingual settings: An overview of its methodology and applications
Probabilistic topic models are unsupervised generative models which model document
content as a two-step generation process, that is, documents are observed as mixtures of …
content as a two-step generation process, that is, documents are observed as mixtures of …
A survey of cross-lingual word embedding models
Cross-lingual representations of words enable us to reason about word meaning in
multilingual contexts and are a key facilitator of cross-lingual transfer when developing …
multilingual contexts and are a key facilitator of cross-lingual transfer when developing …
Biomedical term extraction: overview and a new methodology
Terminology extraction is an essential task in domain knowledge acquisition, as well as for
information retrieval. It is also a mandatory first step aimed at building/enriching …
information retrieval. It is also a mandatory first step aimed at building/enriching …
[PDF][PDF] Bilingual word embeddings from non-parallel document-aligned data applied to bilingual lexicon induction
We propose a simple yet effective approach to learning bilingual word embeddings (BWEs)
from non-parallel document-aligned data (based on the omnipresent skip-gram model), and …
from non-parallel document-aligned data (based on the omnipresent skip-gram model), and …
Bilingual distributed word representations from document-aligned comparable data
We propose a new model for learning bilingual word representations from nonparallel
document-aligned data. Following the recent advances in word representation learning, our …
document-aligned data. Following the recent advances in word representation learning, our …
Bilingual term alignment from comparable corpora in english discharge summary and chinese discharge summary
Y Xu, L Chen, J Wei, S Ananiadou, Y Fan, Y Qian… - BMC …, 2015 - Springer
Background Electronic medical record (EMR) systems have become widely used throughout
the world to improve the quality of healthcare and the efficiency of hospital services. A …
the world to improve the quality of healthcare and the efficiency of hospital services. A …
A comprehensive analysis of bilingual lexicon induction
A Irvine, C Callison-Burch - Computational Linguistics, 2017 - direct.mit.edu
Bilingual lexicon induction is the task of inducing word translations from monolingual
corpora in two languages. In this article we present the most comprehensive analysis of …
corpora in two languages. In this article we present the most comprehensive analysis of …
[PDF][PDF] Cross-lingual semantic similarity of words as the similarity of their semantic word responses
We propose a new approach to identifying semantically similar words across languages.
The approach is based on an idea that two words in different languages are similar if they …
The approach is based on an idea that two words in different languages are similar if they …
[PDF][PDF] A study on bootstrapping bilingual vector spaces from non-parallel data (and nothing else)
We present a new language pair agnostic approach to inducing bilingual vector spaces from
non-parallel data without any other resource in a bootstrapping fashion. The paper …
non-parallel data without any other resource in a bootstrapping fashion. The paper …