Word translation without parallel data

A Conneau, G Lample, MA Ranzato, L Denoyer… - arXiv preprint arXiv …, 2017 - arxiv.org
State-of-the-art methods for learning cross-lingual word embeddings have relied on
bilingual dictionaries or parallel corpora. Recent studies showed that the need for parallel …

[PDF][PDF] Word translation without parallel data

G Lample, A Conneau, MA Ranzato… - International …, 2018 - openreview.net
State-of-the-art methods for learning cross-lingual word embeddings have relied on
bilingual dictionaries or parallel corpora. Recent studies showed that the need for parallel …

[PDF][PDF] Deep multilingual correlation for improved word embeddings

A Lu, W Wang, M Bansal, K Gimpel… - Proceedings of the 2015 …, 2015 - aclanthology.org
Word embeddings have been found useful for many NLP tasks, including part-of-speech
tagging, named entity recognition, and parsing. Adding multilingual context when learning …

Non-adversarial unsupervised word translation

Y Hoshen, L Wolf - arXiv preprint arXiv:1801.06126, 2018 - arxiv.org
Unsupervised word translation from non-parallel inter-lingual corpora has attracted much
research interest. Very recently, neural network methods trained with adversarial loss …

[PDF][PDF] Combining bilingual and comparable corpora for low resource machine translation

A Irvine, C Callison-Burch - … of the eighth workshop on statistical …, 2013 - aclanthology.org
Statistical machine translation (SMT) performance suffers when models are trained on only
small amounts of parallel data. The learned models typically have both low accuracy …

A comprehensive analysis of bilingual lexicon induction

A Irvine, C Callison-Burch - Computational Linguistics, 2017 - direct.mit.edu
Bilingual lexicon induction is the task of inducing word translations from monolingual
corpora in two languages. In this article we present the most comprehensive analysis of …

A discriminative latent-variable model for bilingual lexicon induction

S Ruder, R Cotterell, Y Kementchedjhieva… - arXiv preprint arXiv …, 2018 - arxiv.org
We introduce a novel discriminative latent-variable model for the task of bilingual lexicon
induction. Our model combines the bipartite matching dictionary prior of Haghighi et …

Bilingual lexicon induction by learning to combine word-level and character-level representations

G Heyman, I Vulić, MF Moens - … of the 15th Conference of the …, 2017 - aclanthology.org
We study the problem of bilingual lexicon induction (BLI) in a setting where some translation
resources are available, but unknown translations are sought for certain, possibly domain …

[PDF][PDF] Improving statistical machine translation with a multilingual paraphrase database

RM Seraj, M Siahbani, A Sarkar - Proceedings of the 2015 …, 2015 - aclanthology.org
Abstract The multilingual Paraphrase Database (PPDB) is a freely available automatically
created resource of paraphrases in multiple languages. In statistical machine translation …

End-to-end statistical machine translation with zero or small parallel texts

A Irvine, C Callison-Burch - Natural Language Engineering, 2016 - cambridge.org
We use bilingual lexicon induction techniques, which learn translations from monolingual
texts in two languages, to build an end-to-end statistical machine translation (SMT) system …