Textual patterns
M Scott, C Tribble - 2006 - torrossa.com
This volume is divided into two major sections, the first being more resource and theory-
based, the second largely applied to a set of distinct areas of knowledge. In the first, in …
based, the second largely applied to a set of distinct areas of knowledge. In the first, in …
[PDF][PDF] Corpora and collocations
S Evert - Corpus linguistics. An international handbook, 2008 - stephanie-evert.de
The concept of collocations is certainly one of the most controversial notions in linguis-3 tics,
even though it is based on a compelling, widely-shared intuition that certain words 4 have a …
even though it is based on a compelling, widely-shared intuition that certain words 4 have a …
[PDF][PDF] The statistics of word cooccurrences: word pairs and collocations
S Evert - 2005 - academia.edu
2.1 List of special situations for the comparison of different coefficients of association
strength. The symbol ǫ in Equations B and E indicates a first-order approximation for ǫ→ …
strength. The symbol ǫ in Equations B and E indicates a first-order approximation for ǫ→ …
[图书][B] Syntax-based collocation extraction
V Seretan - 2011 - direct.mit.edu
Collocation is a common language phenomenon which has attracted the interest of
researchers in many subfields of both theoretical and computational linguistics. Although …
researchers in many subfields of both theoretical and computational linguistics. Although …
Exploring variability within and between corpora: some methodological considerations
ST Gries - Corpora, 2006 - euppublishing.com
The results usually reported in corpus-linguistic studies are quantitative: frequencies,
percentages, model parameters, etc. However, given that no corpora are alike, and that …
percentages, model parameters, etc. However, given that no corpora are alike, and that …
[PDF][PDF] The statistics of word cooccurrences
S Evert - 2005 - stefan-evert.de
2.1 List of special situations for the comparison of different coefficients of association
strength. The symbol ϵ in Equations B and E indicates a first-order approximation for ϵ→ …
strength. The symbol ϵ in Equations B and E indicates a first-order approximation for ϵ→ …
Corpus linguistics, theoretical linguistics, and cognitive/psycholinguistics: Towards more and more fruitful exchanges
ST Gries - Corpus Linguistics and Variation in English, 2012 - brill.com
This article discusses my version of corpus linguistics, its relation to what I think are
neighboring fields (mainly cognitive and psycholinguistics), how corpus linguistics can and …
neighboring fields (mainly cognitive and psycholinguistics), how corpus linguistics can and …
[PDF][PDF] Scalable construction of high-quality web corpora
In this article, we give an overview about the necessary steps to construct high-quality
corpora from web texts. We first focus on web crawling and the pros and cons of the existing …
corpora from web texts. We first focus on web crawling and the pros and cons of the existing …
[PDF][PDF] mwetoolkit: A framework for multiword expression identification.
C Ramisch, A Villavicencio, C Boitet - LREC, 2010 - academia.edu
This paper presents the Multiword Expression Toolkit (mwetoolkit), an environment for type
and language-independent MWE identification from corpora. The mwetoolkit provides a …
and language-independent MWE identification from corpora. The mwetoolkit provides a …
Alignment-based extraction of multiword expressions
HM de Caseli, C Ramisch… - Language resources …, 2010 - Springer
Due to idiosyncrasies in their syntax, semantics or frequency, Multiword Expressions
(MWEs) have received special attention from the NLP community, as the methods and …
(MWEs) have received special attention from the NLP community, as the methods and …