Automatic genre identification: a survey

T Kuzman, N Ljubešić - Language Resources and Evaluation, 2023 - Springer
Automatic genre identification (AGI) is a text classification task focused on genres, ie, text
categories defined by the author's purpose, common function of the text, and the text's …

Information retrieval and text mining technologies for chemistry

M Krallinger, O Rabal, A Lourenco, J Oyarzabal… - Chemical …, 2017 - ACS Publications
Efficient access to chemical information contained in scientific literature, patents, technical
reports, or the web is a pressing need shared by researchers and patent attorneys from …

Stylometry with R: a package for computational text analysis

M Eder, J Rybicki, M Kestemont - The R Journal, 2016 - ruj.uj.edu.pl
This software paper describes 'Stylometry with R'(stylo), a flexible R package for the
highlevel analysis of writing style in stylometry. Stylometry (computational stylistics) is …

An ensemble scheme based on language function analysis and feature engineering for text genre classification

A Onan - Journal of Information Science, 2018 - journals.sagepub.com
Text genre classification is the process of identifying functional characteristics of text
documents. The immense quantity of text documents available on the web can be properly …

A survey of modern authorship attribution methods

E Stamatatos - Journal of the American Society for information …, 2009 - Wiley Online Library
Authorship attribution supported by statistical or computational methods has a long history
starting from the 19th century and is marked by the seminal study of Mosteller and Wallace …

Computational methods in authorship attribution

M Koppel, J Schler, S Argamon - Journal of the American …, 2009 - Wiley Online Library
Statistical authorship attribution has a long history, culminating in the use of modern
machine learning classification methods. Nevertheless, most of this work suffers from the …

Writeprints: A stylometric approach to identity-level identification and similarity detection in cyberspace

A Abbasi, H Chen - ACM Transactions on Information Systems (TOIS), 2008 - dl.acm.org
One of the problems often associated with online anonymity is that it hinders social
accountability, as substantiated by the high levels of cybercrime. Although identity cues are …

[PDF][PDF] N-gram-based author profiles for authorship attribution

V Kešelj, F Peng, N Cercone, C Thomas - Proceedings of the conference …, 2003 - cs.dal.ca
We present a novel method for computer-assisted authorship attribution based on
characterlevel n-gram author profiles, which is motivated by an almost-forgotten, pioneering …

[PDF][PDF] Not all character n-grams are created equal: A study in authorship attribution

U Sapkota, S Bethard, M Montes… - Proceedings of the 2015 …, 2015 - aclanthology.org
Character n-grams have been identified as the most successful feature in both singledomain
and cross-domain Authorship Attribution (AA), but the reasons for their discriminative value …

Authorship attribution for social media forensics

A Rocha, WJ Scheirer, CW Forstall… - IEEE transactions on …, 2016 - ieeexplore.ieee.org
The veil of anonymity provided by smartphones with pre-paid SIM cards, public Wi-Fi
hotspots, and distributed networks like Tor has drastically complicated the task of identifying …