A primer on neural network models for natural language processing

Y Goldberg - Journal of Artificial Intelligence Research, 2016 - jair.org
Over the past few years, neural networks have re-emerged as powerful machine-learning
models, yielding state-of-the-art results in fields such as image recognition and speech …

A survey of deep learning techniques for neural machine translation

S Yang, Y Wang, X Chu - arXiv preprint arXiv:2002.07526, 2020 - arxiv.org
In recent years, natural language processing (NLP) has got great development with deep
learning techniques. In the sub-field of machine translation, a new approach named Neural …

Language-driven representation learning for robotics

S Karamcheti, S Nair, AS Chen, T Kollar, C Finn… - arXiv preprint arXiv …, 2023 - arxiv.org
Recent work in visual representation learning for robotics demonstrates the viability of
learning from large video datasets of humans performing everyday tasks. Leveraging …

[图书][B] Neural network methods for natural language processing

Y Goldberg - 2022 - books.google.com
Neural networks are a family of powerful machine learning models. This book focuses on the
application of neural network models to natural language data. The first half of the book …

Transfer learning for low-resource neural machine translation

B Zoph, D Yuret, J May, K Knight - arXiv preprint arXiv:1604.02201, 2016 - arxiv.org
The encoder-decoder framework for neural machine translation (NMT) has been shown
effective in large data scenarios, but is much less effective for low-resource languages. We …

[图书][B] Neural machine translation

P Koehn - 2020 - books.google.com
Deep learning is revolutionizing how machine translation systems are built today. This book
introduces the challenge of machine translation and evaluation-including historical …

Learning phrase representations using RNN encoder-decoder for statistical machine translation

K Cho, B Van Merriënboer, C Gulcehre… - arXiv preprint arXiv …, 2014 - arxiv.org
In this paper, we propose a novel neural network model called RNN Encoder-Decoder that
consists of two recurrent neural networks (RNN). One RNN encodes a sequence of symbols …

An introduction to neural information retrieval

B Mitra, N Craswell - Foundations and Trends® in Information …, 2018 - nowpublishers.com
Neural ranking models for information retrieval (IR) use shallow or deep neural networks to
rank search results in response to a query. Traditional learning to rank models employ …

Distributionally robust language modeling

Y Oren, S Sagawa, TB Hashimoto, P Liang - arXiv preprint arXiv …, 2019 - arxiv.org
Language models are generally trained on data spanning a wide range of topics (eg, news,
reviews, fiction), but they might be applied to an a priori unknown target distribution (eg …

[PDF][PDF] Fast and robust neural network joint models for statistical machine translation

J Devlin, R Zbib, Z Huang, T Lamar… - proceedings of the …, 2014 - aclanthology.org
Recent work has shown success in using neural network language models (NNLMs) as
features in MT systems. Here, we present a novel formulation for a neural network joint …