Epsilon sampling rocks: Investigating sampling strategies for minimum bayes risk decoding...

J Baan, N Daheim, E Ilia, D Ulmer, HS Li… - arXiv preprint arXiv …, 2023 - arxiv.org

Recent advances of powerful Language Models have allowed Natural Language
Generation (NLG) to emerge as an important technology that can not only perform traditional …

被引用次数：24 相关文章所有 2 个版本

[PDF] arxiv.org

It's MBR All the Way Down: Modern Generation Techniques Through the Lens of Minimum Bayes Risk

A Bertsch, A Xie, G Neubig, MR Gormley - arXiv preprint arXiv:2310.01387, 2023 - arxiv.org

Minimum Bayes Risk (MBR) decoding is a method for choosing the outputs of a machine
learning system based not on the output with the highest probability, but the output with the …

被引用次数：11 相关文章所有 4 个版本

[PDF] arxiv.org

Mbr and qe finetuning: Training-time distillation of the best and most expensive decoding methods

M Finkelstein, S Naskar, M Mirzazadeh, A Shah… - arXiv preprint arXiv …, 2023 - arxiv.org

Recent research in decoding methods for Natural Language Generation (NLG) tasks has
shown that MAP decoding is not optimal, because model probabilities do not always align …

被引用次数：7 相关文章所有 3 个版本

[PDF] arxiv.org

Narrowing the knowledge evaluation gap: Open-domain question answering with multi-granularity answers

G Yona, R Aharoni, M Geva - arXiv preprint arXiv:2401.04695, 2024 - arxiv.org

Factual questions typically can be answered correctly at different levels of granularity. For
example, both``August 4, 1961''and``1961''are correct answers to the question``When was …

被引用次数：5 相关文章所有 2 个版本

[PDF] arxiv.org

Non-exchangeable conformal language generation with nearest neighbors

D Ulmer, C Zerva, AFT Martins - arXiv preprint arXiv:2402.00707, 2024 - arxiv.org

Quantifying uncertainty in automatically generated text is important for letting humans check
potential hallucinations and making systems more reliable. Conformal prediction is an …

被引用次数：3 相关文章所有 3 个版本

[PDF] arxiv.org

Faster minimum Bayes risk decoding with confidence-based pruning

J Cheng, A Vlachos - arXiv preprint arXiv:2311.14919, 2023 - arxiv.org

Minimum Bayes risk (MBR) decoding outputs the hypothesis with the highest expected utility
over the model distribution for some utility function. It has been shown to improve accuracy …

被引用次数：6 相关文章所有 6 个版本

[PDF] arxiv.org

Hyperparameter-Free Approach for Faster Minimum Bayes Risk Decoding

Y Jinnai, K Ariu - arXiv preprint arXiv:2401.02749, 2024 - arxiv.org

Minimum Bayes-Risk (MBR) decoding is shown to be a powerful alternative to beam search
decoding for a wide range of text generation tasks. However, MBR requires a huge amount …

被引用次数：1 相关文章所有 2 个版本

[PDF] aclanthology.org

Measuring Uncertainty in Neural Machine Translation with Similarity-Sensitive Entropy

J Cheng, A Vlachos - Proceedings of the 18th Conference of the …, 2024 - aclanthology.org

Uncertainty estimation is an important diagnostic tool for statistical models, and is often used
to assess the confidence of model predictions. Previous work shows that neural machine …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

QUEST: Quality-Aware Metropolis-Hastings Sampling for Machine Translation

GRA Faria, S Agrawal, A Farinhas, R Rei… - arXiv preprint arXiv …, 2024 - arxiv.org

An important challenge in machine translation (MT) is to generate high-quality and diverse
translations. Prior work has shown that the estimated likelihood from the MT model …

Separating the Wheat from the Chaff with BREAD: An open-source benchmark and metrics to detect redundancy in text

I Caswell, L Wang, I Papadimitriou - arXiv preprint arXiv:2311.06440, 2023 - arxiv.org

Data quality is a problem that perpetually resurfaces throughout the field of NLP, regardless
of task, domain, or architecture, and remains especially severe for lower-resource …