Precedents, progress, and prospects in political event data
PA Schrodt - International Interactions, 2012 - Taylor & Francis
The past decade has seen a renaissance in the development of political event data sets.
This has been due to at least three sets of factors. First, there have been technological …
This has been due to at least three sets of factors. First, there have been technological …
Text preprocessing for unsupervised learning: Why it matters, when it misleads, and what to do about it
MJ Denny, A Spirling - Political analysis, 2018 - cambridge.org
Despite the popularity of unsupervised techniques for political science text-as-data research,
the importance and implications of preprocessing decisions in this domain have received …
the importance and implications of preprocessing decisions in this domain have received …
The MID4 dataset, 2002–2010: Procedures, coding rules and description
G Palmer, V d'Orazio, M Kenwick… - … and Peace Science, 2015 - journals.sagepub.com
Understanding the causes of interstate conflict continues to be a primary goal of the field of
international relations. To that end, scholars continue to rely on large datasets of conflict in …
international relations. To that end, scholars continue to rely on large datasets of conflict in …
ThunderSVM: A fast SVM library on GPUs and CPUs
Support Vector Machines (SVMs) are classic supervised learning models for classification,
regression and distribution estimation. A survey conducted by Kaggle in 2017 shows that …
regression and distribution estimation. A survey conducted by Kaggle in 2017 shows that …
Automatic learning path creation using OER: a systematic literature mapping
A Siren, V Tzerpos - IEEE Transactions on Learning …, 2022 - ieeexplore.ieee.org
Learning paths are curated sequences of resources organized in a way that a learner has all
the prerequisite knowledge needed to achieve their learning goals. In this article, we …
the prerequisite knowledge needed to achieve their learning goals. In this article, we …
The MID5 Dataset, 2011–2014: Procedures, coding rules, and description
G Palmer, RW McManus, V D'orazio… - … and Peace Science, 2022 - journals.sagepub.com
This article introduces the latest iteration of the most widely used dataset on interstate
conflicts, the Militarized Interstate Dispute (MID) 5 dataset. We begin by outlining the data …
conflicts, the Militarized Interstate Dispute (MID) 5 dataset. We begin by outlining the data …
[图书][B] Text as data: A new framework for machine learning and the social sciences
A guide for using computational text analysis to learn about the social world From social
media posts and text messages to digital government documents and archives, researchers …
media posts and text messages to digital government documents and archives, researchers …
[图书][B] Automated data collection with R: A practical guide to web scraping and text mining
A hands on guide to web scraping and text mining for both beginners and experienced
users of R Introduces fundamental concepts of the main architecture of the web and …
users of R Introduces fundamental concepts of the main architecture of the web and …
Computer‐assisted keyword and document set discovery from unstructured text
G King, P Lam, ME Roberts - American Journal of Political …, 2017 - Wiley Online Library
The (unheralded) first step in many applications of automated text analysis involves
selecting keywords to choose documents from a large text corpus for further study. Although …
selecting keywords to choose documents from a large text corpus for further study. Although …
Cross-lingual classification of political texts using multilingual sentence embeddings
H Licht - Political Analysis, 2023 - cambridge.org
Established approaches to analyze multilingual text corpora require either a duplication of
analysts' efforts or high-quality machine translation (MT). In this paper, I argue that …
analysts' efforts or high-quality machine translation (MT). In this paper, I argue that …