A comprehensive overview of large language models

H Naveed, AU Khan, S Qiu, M Saqib, S Anwar… - arXiv preprint arXiv …, 2023 - arxiv.org
Large Language Models (LLMs) have recently demonstrated remarkable capabilities in
natural language processing tasks and beyond. This success of LLMs has led to a large …

Palm: Scaling language modeling with pathways

A Chowdhery, S Narang, J Devlin, M Bosma… - Journal of Machine …, 2023 - jmlr.org
Large language models have been shown to achieve remarkable performance across a
variety of natural language tasks using few-shot learning, which drastically reduces the …

Findings of the 2019 conference on machine translation (WMT19)

L Barrault, O Bojar, MR Costa-Jussa, C Federmann… - 2019 - zora.uzh.ch
This paper presents the results of the premier shared task organized alongside the
Conference on Machine Translation (WMT) 2019. Participants were asked to build machine …

Survey of low-resource machine translation

B Haddow, R Bawden, AVM Barone, J Helcl… - Computational …, 2022 - direct.mit.edu
We present a survey covering the state of the art in low-resource machine translation (MT)
research. There are currently around 7,000 languages spoken in the world and almost all …

Using natural language prompts for machine translation

X Garcia, O Firat - arXiv preprint arXiv:2202.11822, 2022 - arxiv.org
We explore the use of natural language prompts for controlling various aspects of the
outputs generated by machine translation models. We demonstrate that natural language …

Better document-level machine translation with Bayes' rule

L Yu, L Sartran, W Stokowiec, W Ling… - Transactions of the …, 2020 - direct.mit.edu
We show that Bayes' rule provides an effective mechanism for creating document translation
models that can be learned from only parallel sentences and monolingual documents a …

Transcormer: Transformer for sentence scoring with sliding language modeling

K Song, Y Leng, X Tan, Y Zou… - Advances in Neural …, 2022 - proceedings.neurips.cc
Sentence scoring aims at measuring the likelihood score of a sentence and is widely used in
many natural language processing scenarios, like reranking, which is to select the best …

Findings of the third workshop on neural generation and translation

H Hayashi, Y Oda, A Birch, I Konstas, A Finch… - arXiv preprint arXiv …, 2019 - arxiv.org
This document describes the findings of the Third Workshop on Neural Generation and
Translation, held in concert with the annual conference of the Empirical Methods in Natural …

WeChat neural machine translation systems for WMT20

F Meng, J Yan, Y Liu, Y Gao, X Zeng, Q Zeng… - arXiv preprint arXiv …, 2020 - arxiv.org
We participate in the WMT 2020 shared news translation task on Chinese to English. Our
system is based on the Transformer (Vaswani et al., 2017a) with effective variants and the …

Capturing document context inside sentence-level neural machine translation models with self-training

E Mansimov, G Melis, L Yu - arXiv preprint arXiv:2003.05259, 2020 - arxiv.org
Neural machine translation (NMT) has arguably achieved human level parity when trained
and evaluated at the sentence-level. Document-level neural machine translation has …