Opennmt: Open-source toolkit for neural machine translation
We describe an open-source toolkit for neural machine translation (NMT). The toolkit
prioritizes efficiency, modularity, and extensibility with the goal of supporting NMT research …
prioritizes efficiency, modularity, and extensibility with the goal of supporting NMT research …
[PDF][PDF] Recurrent continuous translation models
N Kalchbrenner, P Blunsom - … of the 2013 conference on empirical …, 2013 - aclanthology.org
We introduce a class of probabilistic continuous translation models called Recurrent
Continuous Translation Models that are purely based on continuous representations for …
Continuous Translation Models that are purely based on continuous representations for …
[PDF][PDF] Findings of the 2014 workshop on statistical machine translation
This paper presents the results of the WMT14 shared tasks, which included a standard news
translation task, a separate medical translation task, a task for run-time estimation of …
translation task, a separate medical translation task, a task for run-time estimation of …
[PDF][PDF] Improving vector space word representations using multilingual correlation
The distributional hypothesis of Harris (1954), according to which the meaning of words is
evidenced by the contexts they occur in, has motivated several effective techniques for …
evidenced by the contexts they occur in, has motivated several effective techniques for …
[PDF][PDF] Gaussian LDA for topic models with word embeddings
Continuous space word embeddings learned from large, unstructured corpora have been
shown to be effective at capturing semantic regularities in language. In this paper we …
shown to be effective at capturing semantic regularities in language. In this paper we …
Grid long short-term memory
This paper introduces Grid Long Short-Term Memory, a network of LSTM cells arranged in a
multidimensional grid that can be applied to vectors, sequences or higher dimensional data …
multidimensional grid that can be applied to vectors, sequences or higher dimensional data …
[PDF][PDF] KenLM: Faster and smaller language model queries
K Heafield - Proceedings of the sixth workshop on statistical …, 2011 - aclanthology.org
We present KenLM, a library that implements two data structures for efficient language
model queries, reducing both time and memory costs. The PROBING data structure uses …
model queries, reducing both time and memory costs. The PROBING data structure uses …
Addressing the rare word problem in neural machine translation
Neural Machine Translation (NMT) is a new approach to machine translation that has shown
promising results that are comparable to traditional approaches. A significant weakness in …
promising results that are comparable to traditional approaches. A significant weakness in …
A shared task on multimodal machine translation and crosslingual image description
L Specia, S Frank, K Sima'An… - First Conference on …, 2016 - research.ed.ac.uk
This paper introduces and summarises the findings of a new shared task at the intersection
of Natural Language Processing and Computer Vision: the generation of image descriptions …
of Natural Language Processing and Computer Vision: the generation of image descriptions …
Multilingual models for compositional distributed semantics
KM Hermann, P Blunsom - arXiv preprint arXiv:1404.4641, 2014 - arxiv.org
We present a novel technique for learning semantic representations, which extends the
distributional hypothesis to multilingual data and joint-space embeddings. Our models …
distributional hypothesis to multilingual data and joint-space embeddings. Our models …