Brainllama at semeval-2024 task 6: Prompting llama to detect hallucinations and related observable overgeneration mistakes
M Siino - Proceedings of the 18th International Workshop on …, 2024 - aclanthology.org
Abstract Participants in the SemEval-2024 Task 6 were tasked with executing binary
classification aimed at discerning instances of fluent overgeneration hallucinations across …
classification aimed at discerning instances of fluent overgeneration hallucinations across …
HIT-MI&T Lab at SemEval-2024 Task 6: DeBERTa-based Entailment Model is a Reliable Hallucination Detector
This paper describes our submission for SemEval-2024 Task 6: SHROOM, a Shared-task on
Hallucinations and Related Observable Overgeneration Mistakes. We propose four groups …
Hallucinations and Related Observable Overgeneration Mistakes. We propose four groups …
Malto at semeval-2024 task 6: Leveraging synthetic data for llm hallucination detection
In Natural Language Generation (NLG), contemporary Large Language Models (LLMs) face
several challenges, such as generating fluent yet inaccurate outputs and reliance on fluency …
several challenges, such as generating fluent yet inaccurate outputs and reliance on fluency …
Overview of ELOQUENT 2024—shared tasks for evaluating generative language model quality
ELOQUENT is a set of shared tasks for evaluating the quality and usefulness of generative
language models. ELOQUENT aims to apply high-level quality criteria, grounded in …
language models. ELOQUENT aims to apply high-level quality criteria, grounded in …
Nootnoot at semeval-2024 task 6: Hallucinations and related observable overgeneration mistakes detection
S Bahad, Y Bhaskar… - Proceedings of the 18th …, 2024 - aclanthology.org
Semantic hallucinations in neural language gen-eration systems pose a significant
challenge tothe reliability and accuracy of natural languageprocessing applications. Current …
challenge tothe reliability and accuracy of natural languageprocessing applications. Current …
Tu wien at semeval-2024 task 6: Unifying model-agnostic and model-aware techniques for hallucination detection
This paper discusses challenges in Natural Language Generation (NLG), specifically
addressing neural networks producing output that is fluent but incorrect, leading to …
addressing neural networks producing output that is fluent but incorrect, leading to …
DeepPavlov at SemEval-2024 task 6: Detection of hallucinations and overgeneration mistakes with an ensemble of transformer-based models
I Maksimov, V Konovalov, A Glinskii - Proceedings of the 18th …, 2024 - aclanthology.org
The inclination of large language models (LLMs) to produce mistaken assertions, known as
hallucinations, can be problematic. These hallucinations could potentially be harmful since …
hallucinations, can be problematic. These hallucinations could potentially be harmful since …
HaRMoNEE at SemEval-2024 Task 6: Tuning-based Approaches to Hallucination Recognition
This paper presents the Hallucination Recognition Model for New Experiment Evaluation
(HaRMoNEE) team's winning (# 1) and# 10 submissions for SemEval-2024 Task 6: Shared …
(HaRMoNEE) team's winning (# 1) and# 10 submissions for SemEval-2024 Task 6: Shared …
Hallusafe at semeval-2024 task 6: An nli-based approach to make llms safer by better detecting hallucinations and overgeneration mistakes
Z Rahimi, H Amirzadeh, A Sohrabi… - Proceedings of the …, 2024 - aclanthology.org
The advancement of large language models (LLMs), their ability to produce eloquent and
fluent content, and their vast knowledge have resulted in their usage in various tasks and …
fluent content, and their vast knowledge have resulted in their usage in various tasks and …
UMUTeam at SemEval-2024 Task 6: Leveraging Zero-Shot Learning for Detecting Hallucinations and Related Observable Overgeneration Mistakes
R Pan, JA García-Díaz, T Bernal-Beltrán… - Proceedings of the …, 2024 - aclanthology.org
In these working notes we describe the UMUTeam's participation in SemEval-2024 shared
task 6, which aims at detecting grammatically correct output of Natural Language Generation …
task 6, which aims at detecting grammatically correct output of Natural Language Generation …