Opennmt: Open-source toolkit for neural machine translation

G Klein, Y Kim, Y Deng, J Senellart… - arXiv preprint arXiv …, 2017 - arxiv.org
We describe an open-source toolkit for neural machine translation (NMT). The toolkit
prioritizes efficiency, modularity, and extensibility with the goal of supporting NMT research …

[PDF][PDF] Recurrent continuous translation models

N Kalchbrenner, P Blunsom - … of the 2013 conference on empirical …, 2013 - aclanthology.org
We introduce a class of probabilistic continuous translation models called Recurrent
Continuous Translation Models that are purely based on continuous representations for …

[PDF][PDF] Findings of the 2014 workshop on statistical machine translation

O Bojar, C Buck, C Federmann, B Haddow… - Proceedings of the …, 2014 - aclanthology.org
This paper presents the results of the WMT14 shared tasks, which included a standard news
translation task, a separate medical translation task, a task for run-time estimation of …

[PDF][PDF] Improving vector space word representations using multilingual correlation

M Faruqui, C Dyer - Proceedings of the 14th Conference of the …, 2014 - aclanthology.org
The distributional hypothesis of Harris (1954), according to which the meaning of words is
evidenced by the contexts they occur in, has motivated several effective techniques for …

[PDF][PDF] Gaussian LDA for topic models with word embeddings

R Das, M Zaheer, C Dyer - … of the 53rd Annual Meeting of the …, 2015 - aclanthology.org
Continuous space word embeddings learned from large, unstructured corpora have been
shown to be effective at capturing semantic regularities in language. In this paper we …

Grid long short-term memory

N Kalchbrenner, I Danihelka, A Graves - arXiv preprint arXiv:1507.01526, 2015 - arxiv.org
This paper introduces Grid Long Short-Term Memory, a network of LSTM cells arranged in a
multidimensional grid that can be applied to vectors, sequences or higher dimensional data …

[PDF][PDF] KenLM: Faster and smaller language model queries

K Heafield - Proceedings of the sixth workshop on statistical …, 2011 - aclanthology.org
We present KenLM, a library that implements two data structures for efficient language
model queries, reducing both time and memory costs. The PROBING data structure uses …

Addressing the rare word problem in neural machine translation

MT Luong, I Sutskever, QV Le, O Vinyals… - arXiv preprint arXiv …, 2014 - arxiv.org
Neural Machine Translation (NMT) is a new approach to machine translation that has shown
promising results that are comparable to traditional approaches. A significant weakness in …

A shared task on multimodal machine translation and crosslingual image description

L Specia, S Frank, K Sima'An… - First Conference on …, 2016 - research.ed.ac.uk
This paper introduces and summarises the findings of a new shared task at the intersection
of Natural Language Processing and Computer Vision: the generation of image descriptions …

Multilingual models for compositional distributed semantics

KM Hermann, P Blunsom - arXiv preprint arXiv:1404.4641, 2014 - arxiv.org
We present a novel technique for learning semantic representations, which extends the
distributional hypothesis to multilingual data and joint-space embeddings. Our models …