Statistical machine translation

A Lopez - ACM Computing Surveys (CSUR), 2008 - dl.acm.org
Statistical machine translation (SMT) treats the translation of natural language as a machine
learning problem. By examining many samples of human-produced translation, SMT …

[PDF][PDF] Low resource dependency parsing: Cross-lingual parameter sharing in a neural network parser

L Duong, T Cohn, S Bird, P Cook - … of the 53rd annual meeting of …, 2015 - aclanthology.org
Training a high-accuracy dependency parser requires a large treebank. However, these are
costly and time-consuming to build. We propose a learning method that needs less data …

[PDF][PDF] JW300: A wide-coverage parallel corpus for low-resource languages

Ž Agic, I Vulic - 2019 - repository.cam.ac.uk
Viable cross-lingual transfer critically depends on the availability of parallel texts. Shortage
of such resources imposes a development and evaluation bottleneck in multilingual …

[PDF][PDF] Improving vector space word representations using multilingual correlation

M Faruqui, C Dyer - Proceedings of the 14th Conference of the …, 2014 - aclanthology.org
The distributional hypothesis of Harris (1954), according to which the meaning of words is
evidenced by the contexts they occur in, has motivated several effective techniques for …

Massively multilingual transfer for NER

A Rahimi, Y Li, T Cohn - arXiv preprint arXiv:1902.00193, 2019 - arxiv.org
In cross-lingual transfer, NLP models over one or more source languages are applied to a
low-resource target language. While most prior work has used a single source model or a …

[PDF][PDF] Universal dependency annotation for multilingual parsing

R McDonald, J Nivre… - Proceedings of the …, 2013 - aclanthology.org
We present a new collection of treebanks with homogeneous syntactic dependency
annotation for six languages: German, English, Swedish, Spanish, French and Korean. To …

Neural cross-lingual named entity recognition with minimal resources

J Xie, Z Yang, G Neubig, NA Smith… - arXiv preprint arXiv …, 2018 - arxiv.org
For languages with no annotated resources, unsupervised transfer of natural language
processing models such as named-entity recognition (NER) from resource-rich languages …

Many languages, one parser

W Ammar, G Mulcaire, M Ballesteros, C Dyer… - Transactions of the …, 2016 - direct.mit.edu
We train one multilingual model for dependency parsing and use it to parse sentences in
several languages. The parsing model uses (i) multilingual word clusters and …

A survey of syntactic-semantic parsing based on constituent and dependency structures

MS Zhang - Science China Technological Sciences, 2020 - Springer
Syntactic and semantic parsing has been investigated for decades, which is one primary
topic in the natural language processing community. This article aims for a brief survey on …

[PDF][PDF] Posterior regularization for structured latent variable models

K Ganchev, J Graça, J Gillenwater, B Taskar - The Journal of Machine …, 2010 - jmlr.org
We present posterior regularization, a probabilistic framework for structured, weakly
supervised learning. Our framework efficiently incorporates indirect supervision via …