Complex systems approach to natural language

T Stanisz, S Drożdż, J Kwapień - Physics Reports, 2024 - Elsevier
The science of complexity aims to answer the question of what rules nature chooses when
assembling the basic constituents of matter and energy into structures and dynamical …

[HTML][HTML] Languages cool as they expand: Allometric scaling and the decreasing need for new words

AM Petersen, JN Tenenbaum, S Havlin, HE Stanley… - Scientific reports, 2012 - nature.com
We analyze the occurrence frequencies of over 15 million words recorded in millions of
books published during the past two centuries in seven different languages. For all …

Stochastic model for the vocabulary growth in natural languages

M Gerlach, EG Altmann - Physical Review X, 2013 - APS
We propose a stochastic model for the number of different words in a given database which
incorporates the dependence on the database size and historical changes. The main feature …

[图书][B] Mathematical linguistics

A Kornai - 2007 - books.google.com
Mathematical Linguistics introduces the mathematical foundations of linguistics to computer
scientists, engineers, and mathematicians interested in natural language processing. The …

[HTML][HTML] Zipf's law leads to Heaps' law: Analyzing their relation in finite-size systems

L Lü, ZK Zhang, T Zhou - PloS one, 2010 - journals.plos.org
Background Zipf's law and Heaps' law are observed in disparate complex systems. Of
particular interests, these two laws often appear together. Many theoretical models and …

The coupon collector's problem

M Ferrante, M Saltalamacchia - Materials matematics, 2014 - ddd.uab.cat
In this note we will consider the following problem: how many coupons we have to purchase
(on average) to complete a collection. This problem, which takes everybody back to his …

Encoding sequential information in semantic space models: Comparing holographic reduced representation and random permutation

G Recchia, M Sahlgren, P Kanerva… - Computational …, 2015 - Wiley Online Library
Circular convolution and random permutation have each been proposed as neurally
plausible binding operators capable of encoding sequential information in semantic …

A scaling law beyond Zipf's law and its relation to Heaps' law

F Font-Clos, G Boleda, A Corral - New Journal of Physics, 2013 - iopscience.iop.org
The dependence on text length of the statistical properties of word occurrences has long
been considered a severe limitation on the usefulness of quantitative linguistics. We …

[HTML][HTML] Scaling laws and fluctuations in the statistics of word frequencies

M Gerlach, EG Altmann - New Journal of Physics, 2014 - iopscience.iop.org
In this paper, we combine statistical analysis of written texts and simple stochastic models to
explain the appearance of scaling laws in the statistics of word frequencies. The average …

[HTML][HTML] A practical approach to language complexity: a Wikipedia case study

T Yasseri, A Kornai, J Kertész - PloS one, 2012 - journals.plos.org
In this paper we present statistical analysis of English texts from Wikipedia. We try to address
the issue of language complexity empirically by comparing the simple English Wikipedia …