Improving lemmatization of non-standard languages with joint learning

X Zhang, R Mao, E Cambria - Artificial Intelligence Review, 2023 - Springer

Computational syntactic processing is a fundamental technique in natural language
processing. It normally serves as a pre-processing method to transform natural language …

被引用次数：18 相关文章所有 12 个版本

[PDF] aclanthology.org

Macberth: Development and evaluation of a historically pre-trained language model for english (1450-1950)

E Manjavacas, L Fonteyn - … of the Workshop on Natural Language …, 2021 - aclanthology.org

The new pre-train-then-fine-tune paradigm in Natural made important performance gains
accessible to a wider audience. Once pre-trained, deploying a large language model …

被引用次数：28 相关文章所有 2 个版本

[PDF] science.org Full View

Why Molière most likely did write his plays

F Cafiero, JB Camps - Science advances, 2019 - science.org

As for Shakespeare, a hard-fought debate has emerged about Molière, a supposedly
uneducated actor who, according to some, could not have written the masterpieces …

被引用次数：40 相关文章所有 11 个版本

[PDF] arxiv.org

From FreEM to D'AlemBERT: a Large Corpus and a Language Model for Early Modern French

S Gabay, PO Suarez, A Bartz, A Chagué… - arXiv preprint arXiv …, 2022 - arxiv.org

Language models for historical states of language are becoming increasingly important to
allow the optimal digitisation and analysis of old textual sources. Because these historical …

被引用次数：16 相关文章所有 7 个版本

[PDF] mit.edu

A Systematic Review of Computational Approaches to Deciphering Bronze Age Aegean and Cypriot Scripts

M Braović, D Krstinić, M Štula, A Ivanda - Computational linguistics, 2024 - direct.mit.edu

This paper provides a detailed insight into computational approaches for deciphering
Bronze Age Aegean and Cypriot scripts, namely the Archanes script and the Archanes …

被引用次数：1 相关文章所有 3 个版本

[PDF] researchgate.net

[PDF][PDF] Threat modelling and detection using semantic network for improving social media safety

F Fkih, G Al-Turaif - … Journal of Computer Network and Information …, 2023 - researchgate.net

Social media provides a free space to users to post their information, opinions, feelings, etc.
Also, it allows users to easily and simultaneously communicate with each other. As a result …

被引用次数：9 相关文章所有 3 个版本

[PDF] arxiv.org

On the feasibility of automated detection of allusive text reuse

E Manjavacas, B Long, M Kestemont - arXiv preprint arXiv:1905.02973, 2019 - arxiv.org

The detection of allusive text reuse is particularly challenging due to the sparse evidence on
which allusive references rely---commonly based on none or very few shared words …

被引用次数：27 相关文章所有 5 个版本

[PDF] mdpi.com

Contextual urdu lemmatization using recurrent neural network models

R Hafeez, MW Anwar, MH Jamal, T Fatima… - Mathematics, 2023 - mdpi.com

In the field of natural language processing, machine translation is a colossally developing
research area that helps humans communicate more effectively by bridging the linguistic …

被引用次数：9 相关文章所有 11 个版本

[PDF] hal.science

Noisy medieval data, from digitized manuscript to stylometric analysis: Evaluating Paul Meyer's hagiographic hypothesis

JB Camps, T Clérice, A Pinche - Digital Scholarship in the …, 2021 - academic.oup.com

Stylometric analysis of medieval vernacular texts is still a significant challenge: the
importance of scribal variation, be it spelling or more substantial, as well as the variants and …

被引用次数：13 相关文章所有 10 个版本

[PDF] ceur-ws.org

[PDF][PDF] Detecting Formulaic Language Use in Historical Administrative Corpora.

M Koolen, R Hoekstra - CHR, 2022 - ceur-ws.org

Historical administrative corpora are 昀椀 lled with jargon and formulaic expressions that
were used consistently across many documents. Governmental decisions, notarial deeds …

被引用次数：6 相关文章所有 2 个版本