[HTML][HTML] An ensemble heterogeneous classification methodology for discovering health-related knowledge in social media messages

S Tuarob, CS Tucker, M Salathe, N Ram - Journal of biomedical informatics, 2014 - Elsevier
Objectives The role of social media as a source of timely and massive information has
become more apparent since the era of Web 2.0. Multiple studies illustrated the use of …

Economic history goes digital: topic modeling the Journal of Economic History

L Wehrheim - Cliometrica, 2019 - Springer
Digitization and computer science have established a completely new set of methods with
which to analyze large collections of texts. One of these methods is particularly promising for …

Redundancy-aware topic modeling for patient record notes

R Cohen, I Aviram, M Elhadad, N Elhadad - PloS one, 2014 - journals.plos.org
The clinical notes in a given patient record contain much redundancy, in large part due to
clinicians' documentation habit of copying from previous notes in the record and pasting into …

How Do You Measure a Constitutional Moment: Using Algorithmic Topic Modeling to Evaluate Bruce Ackerman's Theory of Constitutional Change

DT Young - Yale LJ, 2012 - HeinOnline
Bruce Ackerman argues that major shifts in constitutional law can occur outside the Article V
amendment process when there are unusually high levels of sustained popular attention to …

Evaluating the impact of OCR errors on topic modeling

S Mutuvi, A Doucet, M Odeo, A Jatowt - International Conference on Asian …, 2018 - Springer
Historical documents pose a challenge for character recognition due to various reasons
such as font disparities across different materials, lack of orthographic standards where …

A one-hundred-year structural topic modeling analysis of the knowledge structure of international management research

MM Mostafa - Quality & Quantity, 2023 - Springer
International Management is a vast and multidisciplinary research domain that is heavily
influenced by several other disciplines, such as Economics, Organizational Theory and …

[PDF][PDF] Improving OCR of black letter in historical newspapers: the unreasonable effectiveness of HTR models on low-resolution images

PB Ströbel, S Clematide - 2019 - zora.uzh.ch
The quality of Optical Character Recognition (OCR) is a decisive factor for the application of
text mining techniques on historical newspapers (Chiron et al., 2017; Walker et al., 2010; …

Does accuracy matter? Methodological considerations when using automated speech-to-text for social science research

SJ Pentland, CM Fuller, LA Spitzley… - International Journal of …, 2023 - Taylor & Francis
The analysis of spoken language has been integral to a breadth of research in social
science and beyond. However, for analyses to occur with efficiency, language must be in the …

[图书][B] Word embeddings: reliability & semantic change

J Hellrich - 2019 - books.google.com
Word embeddings are a form of distributional semantics increasingly popular for
investigating lexical semantic change. However, typical training algorithms are probabilistic …

[PDF][PDF] Flexible techniques for automatic text recognition of historical documents

P Ströbel - 2023 - zora.uzh.ch
Thischapterhighlightstheimportanceofflexibl…. InSection1. 1,
weprovideanoverviewofthecurrent state of digitisation of historical documents in Switzerland …