Statistical machine translation

A Lopez - ACM Computing Surveys (CSUR), 2008 - dl.acm.org
Statistical machine translation (SMT) treats the translation of natural language as a machine
learning problem. By examining many samples of human-produced translation, SMT …

Multilingual sentiment analysis: from formal to informal and scarce resource languages

SL Lo, E Cambria, R Chiong, D Cornforth - Artificial Intelligence Review, 2017 - Springer
The ability to analyse online user-generated content related to sentiments (eg, thoughts and
opinions) on products or policies has become a de-facto skillset for many companies and …

SimAlign: High quality word alignments without parallel training data using static and contextualized embeddings

MJ Sabet, P Dufter, F Yvon, H Schütze - arXiv preprint arXiv:2004.08728, 2020 - arxiv.org
Word alignments are useful for tasks like statistical and neural machine translation (NMT)
and cross-lingual annotation projection. Statistical word aligners perform well, as do …

Attention is not only a weight: Analyzing transformers with vector norms

G Kobayashi, T Kuribayashi, S Yokoi, K Inui - arXiv preprint arXiv …, 2020 - arxiv.org
Attention is a key component of Transformers, which have recently achieved considerable
success in natural language processing. Hence, attention is being extensively studied to …

Multiword expression processing: A survey

M Constant, G Eryiğit, J Monti, L Van Der Plas… - Computational …, 2017 - direct.mit.edu
Multiword expressions (MWEs) are a class of linguistic forms spanning conventional word
boundaries that are both idiosyncratic and pervasive across different languages. The …

Jointly learning to align and translate with transformer models

S Garg, S Peitz, U Nallasamy, M Paulik - arXiv preprint arXiv:1909.02074, 2019 - arxiv.org
The state of the art in machine translation (MT) is governed by neural approaches, which
typically provide superior translation accuracy over statistical approaches. However, on the …

Statistical phrase-based translation

P Koehn, FJ Och, D Marcu - … of the North American Chapter of …, 2003 - research.ed.ac.uk
We propose a new phrase-based translation model and decoding algorithm that enables us
to evaluate and compare several, previously proposed phrase-based translation models …

Automatically constructing a corpus of sentential paraphrases

B Dolan, C Brockett - Third international workshop on paraphrasing …, 2005 - microsoft.com
An obstacle to research in automatic paraphrase identification and generation is the lack of
large-scale, publiclyavailable labeled corpora of sentential paraphrases. This paper …

[PDF][PDF] Paraphrase-driven learning for open question answering

A Fader, L Zettlemoyer, O Etzioni - … of the 51st Annual Meeting of …, 2013 - aclanthology.org
We study question answering as a machine learning problem, and induce a function that
maps open-domain questions to queries over a database of web extractions. Given a large …

Unsupervised construction of large paraphrase corpora: Exploiting massively parallel news sources

W Dolan, C Quirk, C Brockett, B Dolan - 2004 - microsoft.com
We investigate unsupervised techniques for acquiring monolingual sentence-level
paraphrases from a corpus of temporally and topically clustered news articles collected from …