Statistical machine translation
A Lopez - ACM Computing Surveys (CSUR), 2008 - dl.acm.org
Statistical machine translation (SMT) treats the translation of natural language as a machine
learning problem. By examining many samples of human-produced translation, SMT …
learning problem. By examining many samples of human-produced translation, SMT …
Multilingual sentiment analysis: from formal to informal and scarce resource languages
The ability to analyse online user-generated content related to sentiments (eg, thoughts and
opinions) on products or policies has become a de-facto skillset for many companies and …
opinions) on products or policies has become a de-facto skillset for many companies and …
SimAlign: High quality word alignments without parallel training data using static and contextualized embeddings
Word alignments are useful for tasks like statistical and neural machine translation (NMT)
and cross-lingual annotation projection. Statistical word aligners perform well, as do …
and cross-lingual annotation projection. Statistical word aligners perform well, as do …
Attention is not only a weight: Analyzing transformers with vector norms
Attention is a key component of Transformers, which have recently achieved considerable
success in natural language processing. Hence, attention is being extensively studied to …
success in natural language processing. Hence, attention is being extensively studied to …
Multiword expression processing: A survey
Multiword expressions (MWEs) are a class of linguistic forms spanning conventional word
boundaries that are both idiosyncratic and pervasive across different languages. The …
boundaries that are both idiosyncratic and pervasive across different languages. The …
Jointly learning to align and translate with transformer models
The state of the art in machine translation (MT) is governed by neural approaches, which
typically provide superior translation accuracy over statistical approaches. However, on the …
typically provide superior translation accuracy over statistical approaches. However, on the …
Statistical phrase-based translation
We propose a new phrase-based translation model and decoding algorithm that enables us
to evaluate and compare several, previously proposed phrase-based translation models …
to evaluate and compare several, previously proposed phrase-based translation models …
Automatically constructing a corpus of sentential paraphrases
B Dolan, C Brockett - Third international workshop on paraphrasing …, 2005 - microsoft.com
An obstacle to research in automatic paraphrase identification and generation is the lack of
large-scale, publiclyavailable labeled corpora of sentential paraphrases. This paper …
large-scale, publiclyavailable labeled corpora of sentential paraphrases. This paper …
[PDF][PDF] Paraphrase-driven learning for open question answering
We study question answering as a machine learning problem, and induce a function that
maps open-domain questions to queries over a database of web extractions. Given a large …
maps open-domain questions to queries over a database of web extractions. Given a large …
Unsupervised construction of large paraphrase corpora: Exploiting massively parallel news sources
We investigate unsupervised techniques for acquiring monolingual sentence-level
paraphrases from a corpus of temporally and topically clustered news articles collected from …
paraphrases from a corpus of temporally and topically clustered news articles collected from …