Statistical machine translation
A Lopez - ACM Computing Surveys (CSUR), 2008 - dl.acm.org
Statistical machine translation (SMT) treats the translation of natural language as a machine
learning problem. By examining many samples of human-produced translation, SMT …
learning problem. By examining many samples of human-produced translation, SMT …
[PDF][PDF] Low resource dependency parsing: Cross-lingual parameter sharing in a neural network parser
Training a high-accuracy dependency parser requires a large treebank. However, these are
costly and time-consuming to build. We propose a learning method that needs less data …
costly and time-consuming to build. We propose a learning method that needs less data …
[PDF][PDF] JW300: A wide-coverage parallel corpus for low-resource languages
Viable cross-lingual transfer critically depends on the availability of parallel texts. Shortage
of such resources imposes a development and evaluation bottleneck in multilingual …
of such resources imposes a development and evaluation bottleneck in multilingual …
[PDF][PDF] Improving vector space word representations using multilingual correlation
The distributional hypothesis of Harris (1954), according to which the meaning of words is
evidenced by the contexts they occur in, has motivated several effective techniques for …
evidenced by the contexts they occur in, has motivated several effective techniques for …
Massively multilingual transfer for NER
In cross-lingual transfer, NLP models over one or more source languages are applied to a
low-resource target language. While most prior work has used a single source model or a …
low-resource target language. While most prior work has used a single source model or a …
[PDF][PDF] Universal dependency annotation for multilingual parsing
R McDonald, J Nivre… - Proceedings of the …, 2013 - aclanthology.org
We present a new collection of treebanks with homogeneous syntactic dependency
annotation for six languages: German, English, Swedish, Spanish, French and Korean. To …
annotation for six languages: German, English, Swedish, Spanish, French and Korean. To …
Neural cross-lingual named entity recognition with minimal resources
For languages with no annotated resources, unsupervised transfer of natural language
processing models such as named-entity recognition (NER) from resource-rich languages …
processing models such as named-entity recognition (NER) from resource-rich languages …
Many languages, one parser
We train one multilingual model for dependency parsing and use it to parse sentences in
several languages. The parsing model uses (i) multilingual word clusters and …
several languages. The parsing model uses (i) multilingual word clusters and …
A survey of syntactic-semantic parsing based on constituent and dependency structures
MS Zhang - Science China Technological Sciences, 2020 - Springer
Syntactic and semantic parsing has been investigated for decades, which is one primary
topic in the natural language processing community. This article aims for a brief survey on …
topic in the natural language processing community. This article aims for a brief survey on …
[PDF][PDF] Posterior regularization for structured latent variable models
We present posterior regularization, a probabilistic framework for structured, weakly
supervised learning. Our framework efficiently incorporates indirect supervision via …
supervised learning. Our framework efficiently incorporates indirect supervision via …