Automatic genre identification: a survey
T Kuzman, N Ljubešić - Language Resources and Evaluation, 2023 - Springer
Automatic genre identification (AGI) is a text classification task focused on genres, ie, text
categories defined by the author's purpose, common function of the text, and the text's …
categories defined by the author's purpose, common function of the text, and the text's …
Information retrieval and text mining technologies for chemistry
Efficient access to chemical information contained in scientific literature, patents, technical
reports, or the web is a pressing need shared by researchers and patent attorneys from …
reports, or the web is a pressing need shared by researchers and patent attorneys from …
Stylometry with R: a package for computational text analysis
This software paper describes 'Stylometry with R'(stylo), a flexible R package for the
highlevel analysis of writing style in stylometry. Stylometry (computational stylistics) is …
highlevel analysis of writing style in stylometry. Stylometry (computational stylistics) is …
An ensemble scheme based on language function analysis and feature engineering for text genre classification
A Onan - Journal of Information Science, 2018 - journals.sagepub.com
Text genre classification is the process of identifying functional characteristics of text
documents. The immense quantity of text documents available on the web can be properly …
documents. The immense quantity of text documents available on the web can be properly …
A survey of modern authorship attribution methods
E Stamatatos - Journal of the American Society for information …, 2009 - Wiley Online Library
Authorship attribution supported by statistical or computational methods has a long history
starting from the 19th century and is marked by the seminal study of Mosteller and Wallace …
starting from the 19th century and is marked by the seminal study of Mosteller and Wallace …
Computational methods in authorship attribution
Statistical authorship attribution has a long history, culminating in the use of modern
machine learning classification methods. Nevertheless, most of this work suffers from the …
machine learning classification methods. Nevertheless, most of this work suffers from the …
Writeprints: A stylometric approach to identity-level identification and similarity detection in cyberspace
One of the problems often associated with online anonymity is that it hinders social
accountability, as substantiated by the high levels of cybercrime. Although identity cues are …
accountability, as substantiated by the high levels of cybercrime. Although identity cues are …
[PDF][PDF] N-gram-based author profiles for authorship attribution
We present a novel method for computer-assisted authorship attribution based on
characterlevel n-gram author profiles, which is motivated by an almost-forgotten, pioneering …
characterlevel n-gram author profiles, which is motivated by an almost-forgotten, pioneering …
[PDF][PDF] Not all character n-grams are created equal: A study in authorship attribution
Character n-grams have been identified as the most successful feature in both singledomain
and cross-domain Authorship Attribution (AA), but the reasons for their discriminative value …
and cross-domain Authorship Attribution (AA), but the reasons for their discriminative value …
Authorship attribution for social media forensics
A Rocha, WJ Scheirer, CW Forstall… - IEEE transactions on …, 2016 - ieeexplore.ieee.org
The veil of anonymity provided by smartphones with pre-paid SIM cards, public Wi-Fi
hotspots, and distributed networks like Tor has drastically complicated the task of identifying …
hotspots, and distributed networks like Tor has drastically complicated the task of identifying …