[HTML][HTML] Directions in abusive language training data, a systematic review: Garbage in, garbage out
B Vidgen, L Derczynski - Plos one, 2020 - journals.plos.org
Data-driven and machine learning based approaches for detecting, categorising and
measuring abusive content such as hate speech and harassment have gained traction due …
measuring abusive content such as hate speech and harassment have gained traction due …
The MID5 Dataset, 2011–2014: Procedures, coding rules, and description
G Palmer, RW McManus, V D'Orazio… - … and Peace Science, 2022 - journals.sagepub.com
This article introduces the latest iteration of the most widely used dataset on interstate
conflicts, the Militarized Interstate Dispute (MID) 5 dataset. We begin by outlining the data …
conflicts, the Militarized Interstate Dispute (MID) 5 dataset. We begin by outlining the data …
Updating the militarized interstate dispute data: A response to Gibler, Miller, and Little
G Palmer, V D'Orazio, MR Kenwick… - International Studies …, 2020 - academic.oup.com
In a recent article, Gibler, Miller, and Little (2016)(GML) conduct an extensive review of the
Militarized Interstate Dispute (MID) data between the years 1816 and 2001, highlighting …
Militarized Interstate Dispute (MID) data between the years 1816 and 2001, highlighting …
Crowdsourcing reliable local data
The adage “All politics is local” in the United States is largely true. Of the United States'
90,106 governments, 99.9% are local governments. Despite variations in institutional …
90,106 governments, 99.9% are local governments. Despite variations in institutional …
Machine learning from crowds: A systematic review of its applications
E G. Rodrigo, JA Aledo… - … Reviews: Data Mining and …, 2019 - Wiley Online Library
Crowdsourcing opens the door to solving a wide variety of problems that previously were
unfeasible in the field of machine learning, allowing us to obtain relatively low cost labeled …
unfeasible in the field of machine learning, allowing us to obtain relatively low cost labeled …
Automatic Coding of Text Answers to Open-Ended Questions: Should You Double Code the Training Data?
Z He, M Schonlau - Social Science Computer Review, 2020 - journals.sagepub.com
Open-ended questions in surveys are often manually coded into one of several classes (or
categories). When the data are too large to manually code all texts, a statistical (or machine) …
categories). When the data are too large to manually code all texts, a statistical (or machine) …
Hot under the collar: A latent measure of interstate hostility
Z Terechshenko - Journal of Peace Research, 2020 - journals.sagepub.com
The majority of studies on international conflict escalation use a variety of measures of
hostility including the use of force, reciprocity, and the number of fatalities. The use of …
hostility including the use of force, reciprocity, and the number of fatalities. The use of …
Infrastructure and authority at the state's edge: The Border Crossings of the World dataset
MR Kenwick, BA Simmons… - Journal of Peace …, 2024 - journals.sagepub.com
The Border Crossings of the World (BCW) dataset explores state authority spatially by
collecting information about infrastructure built where highways cross internationally …
collecting information about infrastructure built where highways cross internationally …
Tweeting islamophobia
B Vidgen - 2019 - ora.ox.ac.uk
The great promise of social media platforms such as Twitter is to connect people separated
across time and space. This has had far-ranging consequences for politics by changing …
across time and space. This has had far-ranging consequences for politics by changing …
Automatic coding of open-ended questions into multiple classes: Whether and how to use double coded data
Z He, M Schonlau - Survey Research Methods, 2020 - ojs.ub.uni-konstanz.de
Responses to open-ended questions in surveys are usually coded into pre-specified
classes, manually or automatically using a statistical learning algorithm. Automatic coding of …
classes, manually or automatically using a statistical learning algorithm. Automatic coding of …