Look before you leap: An exploratory study of uncertainty measurement for large language models

C Chen, K Shu - AI Magazine, 2024 - Wiley Online Library

Misinformation such as fake news and rumors is a serious threat for information ecosystems
and public trust. The emergence of large language models (LLMs) has great potential to …

被引用次数：118 相关文章所有 4 个版本

[PDF] springer.com

Explainable generative ai (genxai): A survey, conceptualization, and research agenda

J Schneider - Artificial Intelligence Review, 2024 - Springer

Generative AI (GenAI) represents a shift from AI's ability to “recognize” to its ability to
“generate” solutions for a wide range of tasks. As generated solutions and applications grow …

被引用次数：13 相关文章所有 3 个版本

[PDF] arxiv.org

Siren's song in the AI ocean: a survey on hallucination in large language models

Y Zhang, Y Li, L Cui, D Cai, L Liu, T Fu… - arXiv preprint arXiv …, 2023 - arxiv.org

While large language models (LLMs) have demonstrated remarkable capabilities across a
range of downstream tasks, a significant concern revolves around their propensity to exhibit …

被引用次数：836 相关文章所有 2 个版本

[PDF] acm.org

Explainability for large language models: A survey

H Zhao, H Chen, F Yang, N Liu, H Deng, H Cai… - ACM Transactions on …, 2024 - dl.acm.org

Large language models (LLMs) have demonstrated impressive capabilities in natural
language processing. However, their internal mechanisms are still unclear and this lack of …

被引用次数：377 相关文章所有 5 个版本

[PDF] software-lab.org

[PDF][PDF] Calibration and correctness of language models for code

C Spiess, D Gros, KS Pai, M Pradel… - arXiv preprint arXiv …, 2024 - software-lab.org

Machine learning models are widely used, but can also often be wrong. Users would benefit
from a reliable indication of whether a given output from a given model should be trusted, so …

被引用次数：14 相关文章

[PDF] arxiv.org

Decomposing uncertainty for large language models through input clarification ensembling

B Hou, Y Liu, K Qian, J Andreas, S Chang… - arXiv preprint arXiv …, 2023 - arxiv.org

Uncertainty decomposition refers to the task of decomposing the total uncertainty of a model
into data (aleatoric) uncertainty, resulting from the inherent complexity or ambiguity of the …

被引用次数：27 相关文章所有 4 个版本

[PDF] arxiv.org

A new era in llm security: Exploring security concerns in real-world llm-based systems

F Wu, N Zhang, S Jha, P McDaniel, C Xiao - arXiv preprint arXiv …, 2024 - arxiv.org

Large Language Model (LLM) systems are inherently compositional, with individual LLM
serving as the core foundation with additional layers of objects such as plugins, sandbox …

被引用次数：44 相关文章所有 2 个版本

[PDF] arxiv.org

Think twice before assure: Confidence estimation for large language models through reflection on multiple answers

M Li, W Wang, F Feng, F Zhu, Q Wang… - arXiv preprint arXiv …, 2024 - arxiv.org

Confidence estimation aiming to evaluate output trustability is crucial for the application of
large language models (LLM), especially the black-box ones. Existing confidence estimation …

被引用次数：9 相关文章所有 2 个版本

[PDF] arxiv.org

Benchmarking llms via uncertainty quantification

F Ye, M Yang, J Pang, L Wang, DF Wong… - arXiv preprint arXiv …, 2024 - arxiv.org

The proliferation of open-source Large Language Models (LLMs) from various institutions
has highlighted the urgent need for comprehensive evaluation methods. However, current …

被引用次数：40 相关文章所有 3 个版本

[PDF] aclanthology.org

A Survey of Confidence Estimation and Calibration in Large Language Models

J Geng, F Cai, Y Wang, H Koeppl… - Proceedings of the …, 2024 - aclanthology.org

Large language models (LLMs) have demonstrated remarkable capabilities across a wide
range of tasks in various domains. Despite their impressive performance, they can be …

被引用次数：25 相关文章