A survey on syntactic processing techniques

X Zhang, R Mao, E Cambria - Artificial Intelligence Review, 2023 - Springer
Computational syntactic processing is a fundamental technique in natural language
processing. It normally serves as a pre-processing method to transform natural language …

Macberth: Development and evaluation of a historically pre-trained language model for english (1450-1950)

E Manjavacas, L Fonteyn - … of the Workshop on Natural Language …, 2021 - aclanthology.org
The new pre-train-then-fine-tune paradigm in Natural made important performance gains
accessible to a wider audience. Once pre-trained, deploying a large language model …

Why Molière most likely did write his plays

F Cafiero, JB Camps - Science advances, 2019 - science.org
As for Shakespeare, a hard-fought debate has emerged about Molière, a supposedly
uneducated actor who, according to some, could not have written the masterpieces …

From FreEM to D'AlemBERT: a Large Corpus and a Language Model for Early Modern French

S Gabay, PO Suarez, A Bartz, A Chagué… - arXiv preprint arXiv …, 2022 - arxiv.org
Language models for historical states of language are becoming increasingly important to
allow the optimal digitisation and analysis of old textual sources. Because these historical …

A Systematic Review of Computational Approaches to Deciphering Bronze Age Aegean and Cypriot Scripts

M Braović, D Krstinić, M Štula, A Ivanda - Computational linguistics, 2024 - direct.mit.edu
This paper provides a detailed insight into computational approaches for deciphering
Bronze Age Aegean and Cypriot scripts, namely the Archanes script and the Archanes …

[PDF][PDF] Threat modelling and detection using semantic network for improving social media safety

F Fkih, G Al-Turaif - … Journal of Computer Network and Information …, 2023 - researchgate.net
Social media provides a free space to users to post their information, opinions, feelings, etc.
Also, it allows users to easily and simultaneously communicate with each other. As a result …

On the feasibility of automated detection of allusive text reuse

E Manjavacas, B Long, M Kestemont - arXiv preprint arXiv:1905.02973, 2019 - arxiv.org
The detection of allusive text reuse is particularly challenging due to the sparse evidence on
which allusive references rely---commonly based on none or very few shared words …

Contextual urdu lemmatization using recurrent neural network models

R Hafeez, MW Anwar, MH Jamal, T Fatima… - Mathematics, 2023 - mdpi.com
In the field of natural language processing, machine translation is a colossally developing
research area that helps humans communicate more effectively by bridging the linguistic …

Noisy medieval data, from digitized manuscript to stylometric analysis: Evaluating Paul Meyer's hagiographic hypothesis

JB Camps, T Clérice, A Pinche - Digital Scholarship in the …, 2021 - academic.oup.com
Stylometric analysis of medieval vernacular texts is still a significant challenge: the
importance of scribal variation, be it spelling or more substantial, as well as the variants and …

[PDF][PDF] Detecting Formulaic Language Use in Historical Administrative Corpora.

M Koolen, R Hoekstra - CHR, 2022 - ceur-ws.org
Historical administrative corpora are 昀椀 lled with jargon and formulaic expressions that
were used consistently across many documents. Governmental decisions, notarial deeds …