Brainllama at semeval-2024 task 6: Prompting llama to detect hallucinations and related observable overgeneration mistakes

M Siino - Proceedings of the 18th International Workshop on …, 2024 - aclanthology.org
Abstract Participants in the SemEval-2024 Task 6 were tasked with executing binary
classification aimed at discerning instances of fluent overgeneration hallucinations across …

HIT-MI&T Lab at SemEval-2024 Task 6: DeBERTa-based Entailment Model is a Reliable Hallucination Detector

W Liu, W Shi, Z Zhang, H Huang - Proceedings of the 18th …, 2024 - aclanthology.org
This paper describes our submission for SemEval-2024 Task 6: SHROOM, a Shared-task on
Hallucinations and Related Observable Overgeneration Mistakes. We propose four groups …

Malto at semeval-2024 task 6: Leveraging synthetic data for llm hallucination detection

F Borra, C Savelli, G Rosso, A Koudounas… - arXiv preprint arXiv …, 2024 - arxiv.org
In Natural Language Generation (NLG), contemporary Large Language Models (LLMs) face
several challenges, such as generating fluent yet inaccurate outputs and reliance on fluency …

Overview of ELOQUENT 2024—shared tasks for evaluating generative language model quality

J Karlgren, L Dürlich, E Gogoulou, L Guillou… - … Conference of the Cross …, 2024 - Springer
ELOQUENT is a set of shared tasks for evaluating the quality and usefulness of generative
language models. ELOQUENT aims to apply high-level quality criteria, grounded in …

Nootnoot at semeval-2024 task 6: Hallucinations and related observable overgeneration mistakes detection

S Bahad, Y Bhaskar… - Proceedings of the 18th …, 2024 - aclanthology.org
Semantic hallucinations in neural language gen-eration systems pose a significant
challenge tothe reliability and accuracy of natural languageprocessing applications. Current …

Tu wien at semeval-2024 task 6: Unifying model-agnostic and model-aware techniques for hallucination detection

V Arzt, MM Azarbeik, I Lasy, T Kerl… - Proceedings of the 18th …, 2024 - aclanthology.org
This paper discusses challenges in Natural Language Generation (NLG), specifically
addressing neural networks producing output that is fluent but incorrect, leading to …

DeepPavlov at SemEval-2024 task 6: Detection of hallucinations and overgeneration mistakes with an ensemble of transformer-based models

I Maksimov, V Konovalov, A Glinskii - Proceedings of the 18th …, 2024 - aclanthology.org
The inclination of large language models (LLMs) to produce mistaken assertions, known as
hallucinations, can be problematic. These hallucinations could potentially be harmful since …

HaRMoNEE at SemEval-2024 Task 6: Tuning-based Approaches to Hallucination Recognition

T Obiso, J Tu, J Pustejovsky - Proceedings of the 18th International …, 2024 - aclanthology.org
This paper presents the Hallucination Recognition Model for New Experiment Evaluation
(HaRMoNEE) team's winning (# 1) and# 10 submissions for SemEval-2024 Task 6: Shared …

Hallusafe at semeval-2024 task 6: An nli-based approach to make llms safer by better detecting hallucinations and overgeneration mistakes

Z Rahimi, H Amirzadeh, A Sohrabi… - Proceedings of the …, 2024 - aclanthology.org
The advancement of large language models (LLMs), their ability to produce eloquent and
fluent content, and their vast knowledge have resulted in their usage in various tasks and …

UMUTeam at SemEval-2024 Task 6: Leveraging Zero-Shot Learning for Detecting Hallucinations and Related Observable Overgeneration Mistakes

R Pan, JA García-Díaz, T Bernal-Beltrán… - Proceedings of the …, 2024 - aclanthology.org
In these working notes we describe the UMUTeam's participation in SemEval-2024 shared
task 6, which aims at detecting grammatically correct output of Natural Language Generation …