A survey of language model confidence estimation and calibration
Language models (LMs) have demonstrated remarkable capabilities across a wide range of
tasks in various domains. Despite their impressive performance, the reliability of their output …
tasks in various domains. Despite their impressive performance, the reliability of their output …
A Survey of Confidence Estimation and Calibration in Large Language Models
Large language models (LLMs) have demonstrated remarkable capabilities across a wide
range of tasks in various domains. Despite their impressive performance, they can be …
range of tasks in various domains. Despite their impressive performance, they can be …
Uncertainty estimation of transformer predictions for misclassification detection
Uncertainty estimation (UE) of model predictions is a crucial step for a variety of tasks such
as active learning, misclassification detection, adversarial attack detection, out-of-distribution …
as active learning, misclassification detection, adversarial attack detection, out-of-distribution …
Hybrid uncertainty quantification for selective text classification in ambiguous tasks
Many text classification tasks are inherently ambiguous, which results in automatic systems
having a high risk of making mistakes, in spite of using advanced machine learning models …
having a high risk of making mistakes, in spite of using advanced machine learning models …
Annotation error detection: Analyzing the past and present for a more coherent future
Annotated data is an essential ingredient in natural language processing for training and
evaluating machine learning models. It is therefore very desirable for the annotations to be …
evaluating machine learning models. It is therefore very desirable for the annotations to be …
CBCT-guided adaptive radiotherapy using self-supervised sequential domain adaptation with uncertainty estimation
Adaptive radiotherapy (ART) is an advanced technology in modern cancer treatment that
incorporates progressive changes in patient anatomy into active plan/dose adaption during …
incorporates progressive changes in patient anatomy into active plan/dose adaption during …
A structured narrative prompt for prompting narratives from large language models: Sentiment assessment of chatgpt-generated narratives and real tweets
Large language models (LLMs) excel in providing natural language responses that sound
authoritative, reflect knowledge of the context area, and can present from a range of varied …
authoritative, reflect knowledge of the context area, and can present from a range of varied …
LM-polygraph: Uncertainty estimation for language models
Recent advancements in the capabilities of large language models (LLMs) have paved the
way for a myriad of groundbreaking applications in various fields. However, a significant …
way for a myriad of groundbreaking applications in various fields. However, a significant …
[HTML][HTML] Automatic uncertainty-based quality controlled T1 mapping and ECV analysis from native and post-contrast cardiac T1 mapping images using Bayesian vision …
Deep learning-based methods for cardiac MR segmentation have achieved state-of-the-art
results. However, these methods can generate incorrect segmentation results which can …
results. However, these methods can generate incorrect segmentation results which can …
Transformer uncertainty estimation with hierarchical stochastic attention
Transformers are state-of-the-art in a wide range of NLP tasks and have also been applied
to many real-world products. Understanding the reliability and certainty of transformer …
to many real-world products. Understanding the reliability and certainty of transformer …