Data and its (dis) contents: A survey of dataset development and use in machine learning research

A Paullada, ID Raji, EM Bender, E Denton, A Hanna - Patterns, 2021 - cell.com
In this work, we survey a breadth of literature that has revealed the limitations of
predominant practices for dataset collection and use in the field of machine learning. We …

Handling bias in toxic speech detection: A survey

T Garg, S Masud, T Suresh, T Chakraborty - ACM Computing Surveys, 2023 - dl.acm.org
Detecting online toxicity has always been a challenge due to its inherent subjectivity. Factors
such as the context, geography, socio-political climate, and background of the producers …

A survey on automated fact-checking

Z Guo, M Schlichtkrull, A Vlachos - Transactions of the Association for …, 2022 - direct.mit.edu
Fact-checking has become increasingly important due to the speed with which both
information and misinformation can spread in the modern media ecosystem. Therefore …

Evaluation of text generation: A survey

A Celikyilmaz, E Clark, J Gao - arXiv preprint arXiv:2006.14799, 2020 - arxiv.org
The paper surveys evaluation methods of natural language generation (NLG) systems that
have been developed in the last few years. We group NLG evaluation methods into three …

Gender bias in machine translation

B Savoldi, M Gaido, L Bentivogli, M Negri… - Transactions of the …, 2021 - direct.mit.edu
Abstract Machine translation (MT) technology has facilitated our daily tasks by providing
accessible shortcuts for gathering, processing, and communicating information. However, it …

ERASER: A benchmark to evaluate rationalized NLP models

J DeYoung, S Jain, NF Rajani, E Lehman… - arXiv preprint arXiv …, 2019 - arxiv.org
State-of-the-art models in NLP are now predominantly based on deep neural networks that
are opaque in terms of how they come to make predictions. This limitation has increased …

Fact or fiction: Verifying scientific claims

D Wadden, S Lin, K Lo, LL Wang, M van Zuylen… - arXiv preprint arXiv …, 2020 - arxiv.org
We introduce scientific claim verification, a new task to select abstracts from the research
literature containing evidence that SUPPORTS or REFUTES a given scientific claim, and to …

Hate speech detection and racial bias mitigation in social media based on BERT model

M Mozafari, R Farahbakhsh, N Crespi - PloS one, 2020 - journals.plos.org
Disparate biases associated with datasets and trained classifiers in hateful and abusive
content identification tasks have raised many concerns recently. Although the problem of …

Feverous: Fact extraction and verification over unstructured and structured information

R Aly, Z Guo, M Schlichtkrull, J Thorne… - arXiv preprint arXiv …, 2021 - arxiv.org
Fact verification has attracted a lot of attention in the machine learning and natural language
processing communities, as it is one of the key methods for detecting misinformation …

Shortcut learning of large language models in natural language understanding

M Du, F He, N Zou, D Tao, X Hu - Communications of the ACM, 2023 - dl.acm.org
Shortcut Learning of Large Language Models in Natural Language Understanding Page 1 110
COMMUNICATIONS OF THE ACM | JANUARY 2024 | VOL. 67 | NO. 1 research IMA GE B Y …