Data and its (dis) contents: A survey of dataset development and use in machine learning research
In this work, we survey a breadth of literature that has revealed the limitations of
predominant practices for dataset collection and use in the field of machine learning. We …
predominant practices for dataset collection and use in the field of machine learning. We …
Handling bias in toxic speech detection: A survey
Detecting online toxicity has always been a challenge due to its inherent subjectivity. Factors
such as the context, geography, socio-political climate, and background of the producers …
such as the context, geography, socio-political climate, and background of the producers …
A survey on automated fact-checking
Fact-checking has become increasingly important due to the speed with which both
information and misinformation can spread in the modern media ecosystem. Therefore …
information and misinformation can spread in the modern media ecosystem. Therefore …
Evaluation of text generation: A survey
A Celikyilmaz, E Clark, J Gao - arXiv preprint arXiv:2006.14799, 2020 - arxiv.org
The paper surveys evaluation methods of natural language generation (NLG) systems that
have been developed in the last few years. We group NLG evaluation methods into three …
have been developed in the last few years. We group NLG evaluation methods into three …
Gender bias in machine translation
Abstract Machine translation (MT) technology has facilitated our daily tasks by providing
accessible shortcuts for gathering, processing, and communicating information. However, it …
accessible shortcuts for gathering, processing, and communicating information. However, it …
ERASER: A benchmark to evaluate rationalized NLP models
State-of-the-art models in NLP are now predominantly based on deep neural networks that
are opaque in terms of how they come to make predictions. This limitation has increased …
are opaque in terms of how they come to make predictions. This limitation has increased …
Fact or fiction: Verifying scientific claims
We introduce scientific claim verification, a new task to select abstracts from the research
literature containing evidence that SUPPORTS or REFUTES a given scientific claim, and to …
literature containing evidence that SUPPORTS or REFUTES a given scientific claim, and to …
Hate speech detection and racial bias mitigation in social media based on BERT model
Disparate biases associated with datasets and trained classifiers in hateful and abusive
content identification tasks have raised many concerns recently. Although the problem of …
content identification tasks have raised many concerns recently. Although the problem of …
Feverous: Fact extraction and verification over unstructured and structured information
Fact verification has attracted a lot of attention in the machine learning and natural language
processing communities, as it is one of the key methods for detecting misinformation …
processing communities, as it is one of the key methods for detecting misinformation …
Shortcut learning of large language models in natural language understanding
Shortcut Learning of Large Language Models in Natural Language Understanding Page 1 110
COMMUNICATIONS OF THE ACM | JANUARY 2024 | VOL. 67 | NO. 1 research IMA GE B Y …
COMMUNICATIONS OF THE ACM | JANUARY 2024 | VOL. 67 | NO. 1 research IMA GE B Y …