Textual patterns

M Scott, C Tribble - 2006 - torrossa.com
This volume is divided into two major sections, the first being more resource and theory-
based, the second largely applied to a set of distinct areas of knowledge. In the first, in …

[PDF][PDF] Corpora and collocations

S Evert - Corpus linguistics. An international handbook, 2008 - stephanie-evert.de
The concept of collocations is certainly one of the most controversial notions in linguis-3 tics,
even though it is based on a compelling, widely-shared intuition that certain words 4 have a …

[PDF][PDF] The statistics of word cooccurrences: word pairs and collocations

S Evert - 2005 - academia.edu
2.1 List of special situations for the comparison of different coefficients of association
strength. The symbol ǫ in Equations B and E indicates a first-order approximation for ǫ→ …

[图书][B] Syntax-based collocation extraction

V Seretan - 2011 - direct.mit.edu
Collocation is a common language phenomenon which has attracted the interest of
researchers in many subfields of both theoretical and computational linguistics. Although …

Exploring variability within and between corpora: some methodological considerations

ST Gries - Corpora, 2006 - euppublishing.com
The results usually reported in corpus-linguistic studies are quantitative: frequencies,
percentages, model parameters, etc. However, given that no corpora are alike, and that …

[PDF][PDF] The statistics of word cooccurrences

S Evert - 2005 - stefan-evert.de
2.1 List of special situations for the comparison of different coefficients of association
strength. The symbol ϵ in Equations B and E indicates a first-order approximation for ϵ→ …

Corpus linguistics, theoretical linguistics, and cognitive/psycholinguistics: Towards more and more fruitful exchanges

ST Gries - Corpus Linguistics and Variation in English, 2012 - brill.com
This article discusses my version of corpus linguistics, its relation to what I think are
neighboring fields (mainly cognitive and psycholinguistics), how corpus linguistics can and …

[PDF][PDF] Scalable construction of high-quality web corpora

C Biemann, F Bildhauer, S Evert, D Goldhahn… - Journal for Language …, 2013 - jlcl.org
In this article, we give an overview about the necessary steps to construct high-quality
corpora from web texts. We first focus on web crawling and the pros and cons of the existing …

[PDF][PDF] mwetoolkit: A framework for multiword expression identification.

C Ramisch, A Villavicencio, C Boitet - LREC, 2010 - academia.edu
This paper presents the Multiword Expression Toolkit (mwetoolkit), an environment for type
and language-independent MWE identification from corpora. The mwetoolkit provides a …

Alignment-based extraction of multiword expressions

HM de Caseli, C Ramisch… - Language resources …, 2010 - Springer
Due to idiosyncrasies in their syntax, semantics or frequency, Multiword Expressions
(MWEs) have received special attention from the NLP community, as the methods and …