Using word n-grams to identify authors and idiolects: A corpus approach to a forensic linguistic problem

D Wright - International journal of corpus linguistics, 2017 - jbe-platform.com
Forensic authorship attribution is concerned with identifying the writers of anonymous
criminal documents. Over the last twenty years, computer scientists have developed a wide …

Multilingual author profiling on Facebook

M Fatima, K Hasan, S Anwar, RMA Nawab - Information Processing & …, 2017 - Elsevier
Author profiling is the identification of demographic features of an author by examining his
written text. Recently, it has attracted the attention of research community due to it's potential …

What is Elena Ferrante? A comparative analysis of a secretive bestselling Italian writer

A Tuzzi, MA Cortelazzo - Digital Scholarship in the Humanities, 2018 - academic.oup.com
This article looks at the case of Elena Ferrante, the (presumed) pseudonym of an
internationally successful Italian novelist, and has two objectives: first, to observe how her …

The modern Greek language on the social web: A survey of data sets and mining applications

MN Nikiforos, Y Voutos, A Drougani, P Mylonas… - Data, 2021 - mdpi.com
Mining social web text has been at the heart of the Natural Language Processing and Data
Mining research community in the last 15 years. Though most of the reported work is on …

[PDF][PDF] Overview of the Celebrity Profiling Task at PAN 2019.

M Wiegmann, B Stein, M Potthast - CLEF (Working Notes), 2019 - downloads.webis.de
Celebrity profiling is author profiling applied to celebrities. The focus on celebrities has
several advantages: Celebrities are prolific social media users supplying lots of writing …

Vive la différence: Tracing the (authorial) gender signal by multivariate analysis of word frequencies

J Rybicki - Digital Scholarship in the Humanities, 2016 - academic.oup.com
Multivariate analysis of word frequencies is used to identify the gender of authors in a corpus
of 18th-and early 19th-century English sentimentalist and Gothic fiction. Results obtained …

Who could be behind QAnon? Authorship attribution with supervised machine-learning

F Cafiero, JB Camps - Digital Scholarship in the Humanities, 2023 - academic.oup.com
A series of social media posts on 4chan then 8chan, signed under the pseudonym 'Q',
started a movement known as QAnon, which led some of its most radical supporters to …

Celebrity profiling

M Wiegmann, B Stein, M Potthast - … of the 57th annual meeting of …, 2019 - aclanthology.org
Celebrities are among the most prolific users of social media, promoting their personas and
rallying followers. This activity is closely tied to genuine writing samples, which makes them …

Using digital humanities and linguistics to help with terrorism investigations

J Longhi - Forensic Science International, 2021 - Elsevier
This article seeks to offer a response to the digital transformation of forensic science by
employing a tool-based linguistic analysis, integrated into the paradigm of digital …

Towards systematic monolingual nlp surveys: Gena of greek nlp

J Bakagianni, K Pouli, M Gavriilidou… - arXiv preprint arXiv …, 2024 - arxiv.org
Natural Language Processing (NLP) research has traditionally been predominantly focused
on English, driven by the availability of resources, the size of the research community, and …