[图书][B] Corpus linguistics: A guide to the methodology
A Stefanowitsch - 2020 - library.oapen.org
Corpora are widely used in linguistics, but not always wisely. This book attempts to frame
corpus linguistics systematically as a variant of the observational method. The first part …
corpus linguistics systematically as a variant of the observational method. The first part …
The ParlaMint corpora of parliamentary proceedings
This paper presents the ParlaMint corpora containing transcriptions of the sessions of the 17
European national parliaments with half a billion words. The corpora are uniformly encoded …
European national parliaments with half a billion words. The corpora are uniformly encoded …
[PDF][PDF] Parallel data, tools and interfaces in OPUS.
J Tiedemann - Lrec, 2012 - Citeseer
This paper presents the current status of OPUS, a growing language resource of parallel
corpora and related tools. The focus in OPUS is to provide freely available data sets in …
corpora and related tools. The focus in OPUS is to provide freely available data sets in …
CQPweb—combining power, flexibility and usability in a corpus analysis tool
A Hardie - International journal of corpus linguistics, 2012 - jbe-platform.com
CQPweb is a new web-based corpus analysis system, intended to address the conflicting
requirements for usability and power in corpus analysis software. To do this, its user …
requirements for usability and power in corpus analysis software. To do this, its user …
[PDF][PDF] Processing and querying large web corpora with the COW14 architecture
R Schäfer - Proceedings of the 3rd Workshop on Challenges in …, 2015 - ids-pub.bsz-bw.de
In this paper, I present the COW14 tool chain, which comprises a web corpus creation tool
called texrex, wrappers for existing linguistic annotation tools as well as an online query …
called texrex, wrappers for existing linguistic annotation tools as well as an online query …
When linguistics meets web technologies. Recent advances in modelling linguistic linked data
When linguistics meets web technologies. Recent advances in modelling linguistic linked data
- IOS Press You are viewing a javascript disabled version of the site. Please enable Javascript …
- IOS Press You are viewing a javascript disabled version of the site. Please enable Javascript …
[PDF][PDF] A broad-coverage collection of portable NLP components for building shareable analysis pipelines
RE De Castilho, I Gurevych - Proceedings of the Workshop on …, 2014 - aclanthology.org
Due to the diversity of natural language processing (NLP) tools and resources, combining
them into processing pipelines is an important issue, and sharing these pipelines with others …
them into processing pipelines is an important issue, and sharing these pipelines with others …
[PDF][PDF] The paisa'corpus of italian web texts
The PAISA Corpus of Italian Web Texts Page 1 Felix Bildhauer & Roland Schäfer (eds.),
Proceedings of the 9th Web as Corpus Workshop (WaC-9) @ EACL 2014, pages 36–43 …
Proceedings of the 9th Web as Corpus Workshop (WaC-9) @ EACL 2014, pages 36–43 …
[图书][B] Using corpus methods to triangulate linguistic analysis
The efficacy of triangulation has ensured that it is still used today. For example, land
surveyors use distance from and direction to two landmarks in order to elicit bearings on the …
surveyors use distance from and direction to two landmarks in order to elicit bearings on the …
[PDF][PDF] Risamálheild: A very large Icelandic text corpus
S Steingrímsson, S Helgadóttir… - Proceedings of the …, 2018 - aclanthology.org
We present Risamálheild, the Icelandic Gigaword Corpus (IGC), a corpus containing more
than one billion running words from mostly contemporary texts. The work was carried out …
than one billion running words from mostly contemporary texts. The work was carried out …