Conformal language modeling

AZ Ren, A Dixit, A Bodrova, S Singh, S Tu… - arXiv preprint arXiv …, 2023 - arxiv.org

Large language models (LLMs) exhibit a wide range of promising capabilities--from step-by-
step planning to commonsense reasoning--that may provide utility for robots, but remain …

被引用次数：175 相关文章所有 7 个版本

[PDF] arxiv.org

Trustworthy LLMs: A survey and guideline for evaluating large language models' alignment

Y Liu, Y Yao, JF Ton, X Zhang, RGH Cheng… - arXiv preprint arXiv …, 2023 - arxiv.org

Ensuring alignment, which refers to making models behave in accordance with human
intentions [1, 2], has become a critical task before deploying large language models (LLMs) …

被引用次数：249 相关文章所有 3 个版本

[PDF] arxiv.org

Uncertainty in natural language generation: From theory to applications

J Baan, N Daheim, E Ilia, D Ulmer, HS Li… - arXiv preprint arXiv …, 2023 - arxiv.org

Recent advances of powerful Language Models have allowed Natural Language
Generation (NLG) to emerge as an important technology that can not only perform traditional …

被引用次数：29 相关文章所有 2 个版本

[PDF] mit.edu

Conformal prediction for natural language processing: A survey

M Campos, A Farinhas, C Zerva… - Transactions of the …, 2024 - direct.mit.edu

The rapid proliferation of large language models and natural language processing (NLP)
applications creates a crucial need for uncertainty quantification to mitigate risks such as …

被引用次数：5 相关文章所有 2 个版本

[PDF] arxiv.org

Benchmarking llms via uncertainty quantification

F Ye, M Yang, J Pang, L Wang, DF Wong… - arXiv preprint arXiv …, 2024 - arxiv.org

The proliferation of open-source Large Language Models (LLMs) from various institutions
has highlighted the urgent need for comprehensive evaluation methods. However, current …

被引用次数：39 相关文章所有 3 个版本

[PDF] arxiv.org

Forking uncertainties: Reliable prediction and model predictive control with sequence models via conformal risk control

M Zecchin, S Park, O Simeone - IEEE Journal on Selected …, 2024 - ieeexplore.ieee.org

In many real-world problems, predictions are leveraged to monitor and control cyber-
physical systems, demanding guarantees on the satisfaction of reliability and safety …

被引用次数：9 相关文章所有 3 个版本

[PDF] kcl.ac.uk

Knowing when to stop: Delay-adaptive spiking neural network classifiers with reliability guarantees

J Chen, S Park, O Simeone - IEEE Journal of Selected Topics …, 2024 - ieeexplore.ieee.org

Spiking neural networks (SNNs) process time-series data via internal event-driven neural
dynamics. The energy consumption of an SNN depends on the number of spikes exchanged …

被引用次数：3 相关文章所有 3 个版本

[PDF] aaai.org

Conformal autoregressive generation: Beam search with coverage guarantees

N Deutschmann, M Alberts, MR Martínez - Proceedings of the AAAI …, 2024 - ojs.aaai.org

We introduce two new extensions to the beam search algorithm based on conformal
predictions (CP) to produce sets of sequences with theoretical coverage guarantees. The …

被引用次数：8 相关文章所有 6 个版本

[PDF] arxiv.org

Prompt risk control: A rigorous framework for responsible deployment of large language models

TP Zollo, T Morrill, Z Deng, JC Snell, T Pitassi… - arXiv preprint arXiv …, 2023 - arxiv.org

The recent explosion in the capabilities of large language models has led to a wave of
interest in how best to prompt a model to perform a given task. While it may be tempting to …

被引用次数：6 相关文章所有 4 个版本

[PDF] arxiv.org

C-rag: Certified generation risks for retrieval-augmented language models

M Kang, NM Gürel, N Yu, D Song, B Li - arXiv preprint arXiv:2402.03181, 2024 - arxiv.org

Despite the impressive capabilities of large language models (LLMs) across diverse
applications, they still suffer from trustworthiness issues, such as hallucinations and …

被引用次数：10 相关文章所有 3 个版本

Robots that ask for help: Uncertainty alignment for large language model planners

Trustworthy LLMs: A survey and guideline for evaluating large language models' alignment

Uncertainty in natural language generation: From theory to applications

Conformal prediction for natural language processing: A survey

Benchmarking llms via uncertainty quantification

Forking uncertainties: Reliable prediction and model predictive control with sequence models via conformal risk control

Knowing when to stop: Delay-adaptive spiking neural network classifiers with reliability guarantees

Conformal autoregressive generation: Beam search with coverage guarantees

Prompt risk control: A rigorous framework for responsible deployment of large language models

C-rag: Certified generation risks for retrieval-augmented language models

高级搜索

引用