The SIGTYP 2022 shared task on the prediction of cognate reflexes

JM List, E Vylomova, R Forkel… - Proceedings of the …, 2022 - research-collection.ethz.ch
This study describes the structure and the results of the SIGTYP 2022 shared task on the
prediction of cognate reflexes from multilingual wordlists. We asked participants to submit …

Automatic normalisation of early Modern French

R Bawden, J Poinhos, E Kogkitsidou… - Proceedings of the …, 2022 - aclanthology.org
Spelling normalisation is a useful step in the study and analysis of historical language texts,
whether it is manual analysis by experts or automatic analysis using downstream natural …

A new framework for fast automated phonological reconstruction using trimmed alignments and sound correspondence patterns

JM List, R Forkel, NW Hill - arXiv preprint arXiv:2204.04619, 2022 - arxiv.org
Computational approaches in historical linguistics have been increasingly applied during
the past decade and many new methods that implement parts of the traditional comparative …

Jambu: A historical linguistic database for South Asian languages

A Arora, A Farris, S Basu, S Kolichala - arXiv preprint arXiv:2306.02514, 2023 - arxiv.org
We introduce Jambu, a cognate database of South Asian languages which unifies dozens of
previous sources in a structured and accessible format. The database includes 287k …

Neural Approaches to Historical Words Reconstruction

C Fourrier - 2022 - theses.hal.science
In historical linguistics, cognates are words that descend in direct line from a common
ancestor, called their proto-form, and therefore are representative of their respective …

Probing multilingual cognate prediction models

C Fourrier, B Sagot - Findings of the Association for …, 2022 - aclanthology.org
Character-based neural machine translation models have become the reference models for
cognate prediction, a historical linguistics task. So far, all linguistic interpretations about …

Combining Noisy Semantic Signals with Orthographic Cues: Cognate Induction for the Indic Dialect Continuum

N Bafna, J van Genabith, C España-Bonet… - Proceedings of the …, 2022 - aclanthology.org
We present a novel method for unsupervised cognate/borrowing identification from
monolingual corpora designed for low and extremely low resource scenarios, based on …

Computational approaches to historical language comparison

JM List - 2022 - hcommons.org
The chapter discusses recently developed computational techniques providing concrete
help in addressing various tasks in historical language comparison, focusing specifically on …

Representing and computing uncertainty in phonological reconstruction

JM List, NW Hill, R Forkel, F Blum - arXiv preprint arXiv:2310.12727, 2023 - arxiv.org
Despite the inherently fuzzy nature of reconstructions in historical linguistics, most scholars
do not represent their uncertainty when proposing proto-forms. With the increasing success …

Le projet FREEM: ressources, outils et enjeux pour l'étude du français d'Ancien Régime

S Gabay, PO Suarez, R Bawden, A Bartz… - TALN 2022-Traitement …, 2022 - hal.science
En dépit de leur qualité certaine, les ressources et outils disponibles pour l'analyse du
français d'Ancien Régime ne sont plus à même de répondre aux enjeux de la recherche en …