“HOT” ChatGPT: The promise of ChatGPT in detecting and discriminating hateful, offensive, and toxic comments on social media

L Li, L Fan, S Atreja, L Hemphill - ACM Transactions on the Web, 2024 - dl.acm.org
Harmful textual content is pervasive on social media, poisoning online communities and
negatively impacting participation. A common approach to this issue is developing detection …

Integrating Large Language Models in Political Discourse Studies on Social Media: Challenges of Validating an LLMs-in-the-loop Pipeline

G Marino, F Giglietto - Sociologica, 2024 - sociologica.unibo.it
Abstract The integration of Large Language Models (LLMs) into research workflows has the
potential to transform the study of political content on social media. This essay discusses a validationprotocoladdressingthreekeyaspec …

Toward Automatic Group Membership Annotation for Group Fairness Evaluation

F Chen, D Yang, H Fang - … on Applications of Natural Language to …, 2024 - Springer
With the increasing research attention on fairness in information retrieval systems, more and
more fairness-aware algorithms have been proposed to ensure fairness for a sustainable …

Zero-Shot Relation Classification Through Inference on Category Attributes

Y Xiao, Y Jin, B Wang, Y Zhang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
The goal of relationship classification (RC) is to predict the semantic relationship between
two entities in a given sentence. With the advent of deep learning and pretrained language …

Meaning-Sensitive Text Data Augmentation with Intelligent Masking

B Kasthuriarachchy, M Chetty, A Shatte… - ACM Transactions on …, 2023 - dl.acm.org
With the recent popularity of applying large-scale deep neural network-based models for
natural language processing (NLP), attention to develop methods for text data augmentation …

[PDF][PDF] TruEyes: Utilizing Microtasks in Mobile Apps for Crowdsourced Labeling of Machine Learning Datasets

C Sudar, M Froehlich, F Alt - arXiv preprint arXiv:2209.14708, 2022 - researchgate.net
The growing use of supervised machine learning in research and industry has increased the
need for labeled datasets. Crowdsourcing has emerged as a popular method to create data …

Methods for data and user efficient annotation for multi-label topic classification

A Miszkurka - 2022 - diva-portal.org
Machine Learning models trained using supervised learning can achieve great results when
a sufficient amount of labeled data is used. However, the annotation process is a costly and …