A survey on syntactic processing techniques
Computational syntactic processing is a fundamental technique in natural language
processing. It normally serves as a pre-processing method to transform natural language …
processing. It normally serves as a pre-processing method to transform natural language …
Macberth: Development and evaluation of a historically pre-trained language model for english (1450-1950)
E Manjavacas, L Fonteyn - … of the Workshop on Natural Language …, 2021 - aclanthology.org
The new pre-train-then-fine-tune paradigm in Natural made important performance gains
accessible to a wider audience. Once pre-trained, deploying a large language model …
accessible to a wider audience. Once pre-trained, deploying a large language model …
From FreEM to D'AlemBERT: a Large Corpus and a Language Model for Early Modern French
Language models for historical states of language are becoming increasingly important to
allow the optimal digitisation and analysis of old textual sources. Because these historical …
allow the optimal digitisation and analysis of old textual sources. Because these historical …
A Systematic Review of Computational Approaches to Deciphering Bronze Age Aegean and Cypriot Scripts
This paper provides a detailed insight into computational approaches for deciphering
Bronze Age Aegean and Cypriot scripts, namely the Archanes script and the Archanes …
Bronze Age Aegean and Cypriot scripts, namely the Archanes script and the Archanes …
[PDF][PDF] Threat modelling and detection using semantic network for improving social media safety
F Fkih, G Al-Turaif - … Journal of Computer Network and Information …, 2023 - researchgate.net
Social media provides a free space to users to post their information, opinions, feelings, etc.
Also, it allows users to easily and simultaneously communicate with each other. As a result …
Also, it allows users to easily and simultaneously communicate with each other. As a result …
On the feasibility of automated detection of allusive text reuse
E Manjavacas, B Long, M Kestemont - arXiv preprint arXiv:1905.02973, 2019 - arxiv.org
The detection of allusive text reuse is particularly challenging due to the sparse evidence on
which allusive references rely---commonly based on none or very few shared words …
which allusive references rely---commonly based on none or very few shared words …
Contextual urdu lemmatization using recurrent neural network models
In the field of natural language processing, machine translation is a colossally developing
research area that helps humans communicate more effectively by bridging the linguistic …
research area that helps humans communicate more effectively by bridging the linguistic …
Noisy medieval data, from digitized manuscript to stylometric analysis: Evaluating Paul Meyer's hagiographic hypothesis
Stylometric analysis of medieval vernacular texts is still a significant challenge: the
importance of scribal variation, be it spelling or more substantial, as well as the variants and …
importance of scribal variation, be it spelling or more substantial, as well as the variants and …
[PDF][PDF] Detecting Formulaic Language Use in Historical Administrative Corpora.
M Koolen, R Hoekstra - CHR, 2022 - ceur-ws.org
Historical administrative corpora are 昀椀 lled with jargon and formulaic expressions that
were used consistently across many documents. Governmental decisions, notarial deeds …
were used consistently across many documents. Governmental decisions, notarial deeds …