[PDF][PDF] Target-side word segmentation strategies for neural machine translation
For efficiency considerations, state-of-theart neural machine translation (NMT) requires the
vocabulary to be restricted to a limited-size set of several thousand symbols. This is highly …
vocabulary to be restricted to a limited-size set of several thousand symbols. This is highly …
[PDF][PDF] Modeling inflection and word-formation in SMT
The current state-of-the-art in statistical machine translation (SMT) suffers from issues of
sparsity and inadequate modeling power when translating into morphologically rich …
sparsity and inadequate modeling power when translating into morphologically rich …
Adaptation of machine translation for multilingual information retrieval in the medical domain
Objective We investigate machine translation (MT) of user search queries in the context of
cross-lingual information retrieval (IR) in the medical domain. The main focus is on …
cross-lingual information retrieval (IR) in the medical domain. The main focus is on …
Analysis, preparation, and optimization of statistical sign language machine translation
Sign languages represent an interesting niche for statistical machine translation that is
typically hampered by the scarceness of suitable data, and most papers in this area apply …
typically hampered by the scarceness of suitable data, and most papers in this area apply …
Splitting compounds by semantic analogy
Compounding is a highly productive word-formation process in some languages that is often
problematic for natural language processing applications. In this paper, we investigate …
problematic for natural language processing applications. In this paper, we investigate …
[PDF][PDF] How to avoid burning ducks: Combining linguistic analysis and corpus statistics for German compound processing
F Fritzinger, A Fraser - Proceedings of the Joint Fifth Workshop on …, 2010 - aclanthology.org
Compound splitting is an important problem in many NLP applications which must be solved
in order to address issues of data sparsity. Previous work has shown that linguistic …
in order to address issues of data sparsity. Previous work has shown that linguistic …
The language of programming: On the vocabulary of names
N Amit, DG Feitelson - 2022 29th Asia-Pacific Software …, 2022 - ieeexplore.ieee.org
Most of the text in a computer program is composed of the names of variables and functions.
These names are selected by one developer, and need to be understood by others. This is …
These names are selected by one developer, and need to be understood by others. This is …
[PDF][PDF] Comparing different architectures of hybrid machine translation systems
G Thurmair - Proceedings of Machine Translation Summit XII …, 2009 - aclanthology.org
The contribution discusses variants of architectures of hybrid MT systems. The three main
types of architectures are: coupling of systems (serial or parallel), architecture adaptations …
types of architectures are: coupling of systems (serial or parallel), architecture adaptations …
Morphological processing of compounds for statistical machine translation
F Cap - 2014 - elib.uni-stuttgart.de
Machine Translation denotes the translation of a text written in one language into another
language performed by a computer program. In times of internet and globalisation, there has …
language performed by a computer program. In times of internet and globalisation, there has …
Machine translation in Indian languages: challenges and resolution
Abstract English to Indian language machine translation poses the challenge of structural
and morphological divergence. This paper describes English to Indian language statistical …
and morphological divergence. This paper describes English to Indian language statistical …