The bigscience roots corpus: A 1.6 tb composite multilingual dataset
H Laurençon, L Saulnier, T Wang… - Advances in …, 2022 - proceedings.neurips.cc
As language models grow ever larger, the need for large-scale high-quality text datasets has
never been more pressing, especially in multilingual settings. The BigScience workshop, a 1 …
never been more pressing, especially in multilingual settings. The BigScience workshop, a 1 …
Multilingual language models are not multicultural: A case study in emotion
Emotions are experienced and expressed differently across the world. In order to use Large
Language Models (LMs) for multilingual tasks that require emotional sensitivity, LMs must …
Language Models (LMs) for multilingual tasks that require emotional sensitivity, LMs must …
Albeto and distilbeto: Lightweight spanish language models
J Cañete, S Donoso, F Bravo-Marquez… - arXiv preprint arXiv …, 2022 - arxiv.org
In recent years there have been considerable advances in pre-trained language models,
where non-English language versions have also been made available. Due to their …
where non-English language versions have also been made available. Due to their …
Overview of exist 2022: sexism identification in social networks
F Rodríguez-Sánchez… - … del Lenguaje Natural, 2022 - journal.sepln.org
The paper describes the organization, goals, and results of the sEXism Identification in
Social neTworks (EXIST) 2022 challenge, a shared task proposed for the second year at …
Social neTworks (EXIST) 2022 challenge, a shared task proposed for the second year at …
Findings of the VarDial evaluation campaign 2023
This report presents the results of the shared tasks organized as part of the VarDial
Evaluation Campaign 2023. The campaign is part of the tenth workshop on Natural …
Evaluation Campaign 2023. The campaign is part of the tenth workshop on Natural …
Assessing the impact of contextual information in hate speech detection
Social networks and other digital media deal with huge amounts of user-generated contents
where hate speech has become a problematic more and more relevant. A great effort has …
where hate speech has become a problematic more and more relevant. A great effort has …
Overview of EmoSPeech at IberLEF 2024: Multimodal Speech-text Emotion Recognition in Spanish
R Pan, JA García-Díaz… - … del Lenguaje Natural, 2024 - journal.sepln.org
This paper presents the EmoSPeech 2024 shared task, which was organized in the IberLEF
2024 workshop within the framework of the 40th International Conference of the Spanish …
2024 workshop within the framework of the 40th International Conference of the Spanish …
Analytics-driven complaint prioritisation via deep learning and multicriteria decision-making
C Vairetti, I Aránguiz, S Maldonado, JP Karmy… - European Journal of …, 2024 - Elsevier
Complaint analysis is an essential business analytics application because complaints have
a strong influence on customer satisfaction (CSAT). However, the process of categorising …
a strong influence on customer satisfaction (CSAT). However, the process of categorising …
Evaluation of transformer models for financial targeted sentiment analysis in Spanish
Nowadays, financial data from social media plays an important role to predict the stock
market. However, the exponential growth of financial information and the different polarities …
market. However, the exponential growth of financial information and the different polarities …
Overview of politices 2022: Spanish author profiling for political ideology
Este artículo presenta la tarea PoliticEs 2022, organizada en el taller IberLEF 2022, en el
marco de la 38 edición del Congreso Internacional de la Sociedad Española para el …
marco de la 38 edición del Congreso Internacional de la Sociedad Española para el …