NorthEuraLex: A wide-coverage lexical database of Northern Eurasia

J Dellert, T Daneyko, A Münch, A Ladygina… - Language resources …, 2020 - Springer
This article describes the first release version of a new lexicostatistical database of Northern
Eurasia, which includes Europe as the most well-researched linguistic area. Unlike in other …

Linking norms, ratings, and relations of words and concepts across multiple language varieties

A Tjuka, R Forkel, JM List - Behavior research methods, 2022 - Springer
Psychologists and linguists collect various data on word and concept properties. In
psychology, scholars have accumulated norms and ratings for a large number of words in …

The causality of borrowing: Lexical loans in Eurasian languages

G Carling, S Cronhamn, R Farren, E Aliyev, J Frid - PloS one, 2019 - journals.plos.org
All languages borrow words from other languages. Some languages are more prone to
borrowing, while others borrow less, and different domains of the vocabulary are unequally …

Probing multilingual BERT for genetic and typological signals

T Rama, L Beinborn, S Eger - arXiv preprint arXiv:2011.02070, 2020 - arxiv.org
We probe the layers in multilingual BERT (mBERT) for phylogenetic and geographic
language signals across 100 languages and compute language distances based on the …

The evolution of lexical semantics dynamics, directionality, and drift

G Carling, S Cronhamn, O Lundgren… - Frontiers in …, 2023 - frontiersin.org
Introduction The directionality of semantic change is problematic in traditional comparative
models of language reconstruction. Compared to, eg, phonological and morphological …

GHisBERT–Training BERT from scratch for lexical semantic investigations across historical German language stages

C Beck, M Köllner - Proceedings of the 4th Workshop on …, 2023 - aclanthology.org
While static embeddings have dominated computational approaches to lexical semantic
change for quite some time, recent approaches try to leverage the contextualized …

Crouching TIGER, hidden structure: Exploring the nature of linguistic data using TIGER values

K Syrjänen, L Maurits, U Leino… - Journal of Language …, 2021 - academic.oup.com
In recent years, techniques such as Bayesian inference of phylogeny have become a
standard part of the quantitative linguistic toolkit. While these tools successfully model the …

Preferred sound groups of vocal iconicity reflect evolutionary mechanisms of sound stability and first language acquisition: evidence from Eurasia

J Dellert, N Erben Johansson… - … Transactions of the …, 2021 - royalsocietypublishing.org
In speech, the connection between sounds and word meanings is mostly arbitrary. However,
among basic concepts of the vocabulary, several words can be shown to exhibit some …

Structural markedness and depiction: The case of lower sequential predictability in Cantonese ideophones

AL Thompson, MPY Chan, PH Yeung, Y Do - The Mental Lexicon, 2022 - jbe-platform.com
Ideophones are marked words that depict sensory imagery and are hypothesized to be
structurally marked, ie, exhibiting unique structural properties. In this paper,“marked” is …

Multiple evolutionary pressures shape identical consonant avoidance in the world's languages

CA Cathcart - Proceedings of the National Academy of Sciences, 2024 - pnas.org
Languages disfavor word forms containing sequences of similar or identical consonants,
due to the biomechanical and cognitive difficulties posed by patterns of this sort. However …