Complex systems approach to natural language
The science of complexity aims to answer the question of what rules nature chooses when
assembling the basic constituents of matter and energy into structures and dynamical …
assembling the basic constituents of matter and energy into structures and dynamical …
[HTML][HTML] Languages cool as they expand: Allometric scaling and the decreasing need for new words
We analyze the occurrence frequencies of over 15 million words recorded in millions of
books published during the past two centuries in seven different languages. For all …
books published during the past two centuries in seven different languages. For all …
Stochastic model for the vocabulary growth in natural languages
M Gerlach, EG Altmann - Physical Review X, 2013 - APS
We propose a stochastic model for the number of different words in a given database which
incorporates the dependence on the database size and historical changes. The main feature …
incorporates the dependence on the database size and historical changes. The main feature …
[图书][B] Mathematical linguistics
A Kornai - 2007 - books.google.com
Mathematical Linguistics introduces the mathematical foundations of linguistics to computer
scientists, engineers, and mathematicians interested in natural language processing. The …
scientists, engineers, and mathematicians interested in natural language processing. The …
[HTML][HTML] Zipf's law leads to Heaps' law: Analyzing their relation in finite-size systems
Background Zipf's law and Heaps' law are observed in disparate complex systems. Of
particular interests, these two laws often appear together. Many theoretical models and …
particular interests, these two laws often appear together. Many theoretical models and …
The coupon collector's problem
M Ferrante, M Saltalamacchia - Materials matematics, 2014 - ddd.uab.cat
In this note we will consider the following problem: how many coupons we have to purchase
(on average) to complete a collection. This problem, which takes everybody back to his …
(on average) to complete a collection. This problem, which takes everybody back to his …
Encoding sequential information in semantic space models: Comparing holographic reduced representation and random permutation
G Recchia, M Sahlgren, P Kanerva… - Computational …, 2015 - Wiley Online Library
Circular convolution and random permutation have each been proposed as neurally
plausible binding operators capable of encoding sequential information in semantic …
plausible binding operators capable of encoding sequential information in semantic …
A scaling law beyond Zipf's law and its relation to Heaps' law
The dependence on text length of the statistical properties of word occurrences has long
been considered a severe limitation on the usefulness of quantitative linguistics. We …
been considered a severe limitation on the usefulness of quantitative linguistics. We …
[HTML][HTML] Scaling laws and fluctuations in the statistics of word frequencies
M Gerlach, EG Altmann - New Journal of Physics, 2014 - iopscience.iop.org
In this paper, we combine statistical analysis of written texts and simple stochastic models to
explain the appearance of scaling laws in the statistics of word frequencies. The average …
explain the appearance of scaling laws in the statistics of word frequencies. The average …
[HTML][HTML] A practical approach to language complexity: a Wikipedia case study
In this paper we present statistical analysis of English texts from Wikipedia. We try to address
the issue of language complexity empirically by comparing the simple English Wikipedia …
the issue of language complexity empirically by comparing the simple English Wikipedia …