[PDF][PDF] MTAS: A Solr/Lucene based multi tier annotation search solution

PM Brouwer, H Brugman, M Kemps-Snijders - 2017 - pure.knaw.nl
In recent years, multiple solutions have become available providing search on huge
amounts of plain text and metadata. Scalable searchability on annotated text however still …

The CLIN27 shared task: Translating historical text to contemporary language for improving automatic linguistic annotation

ETK Sang, M Bollmann, R Boschker… - … Linguistics in the …, 2017 - research.tue.nl
The CLIN27 shared task evaluates the effect of translating historical text to modern text with
the goal of improving the quality of the output of contemporary natural language processing …

Socio-cultural challenges in collections digital infrastructures

M Humbel, J Nyhan, N Pearlman, A Vlachidis… - Journal of …, 2024 - emerald.com
Purpose This paper aims to explore the accelerations and constraints libraries, archives,
museums and heritage organisations (“collections-holding organisations”) face in their role …

[PDF][PDF] FoLiA in Practice. The Infrastructure of a Linguistic Annotation Format

M Gompel, K Sloot, M Reynaert, APJ van den Bosch - 2017 - ubiquitypress.com
We present an overview of the software and data infrastructure for FoLiA, a Format for
Linguistic Annotation developed within the scope of the CLARIN-NL project and other …

OCR post-correction evaluation of early dutch books online-revisited

M Reynaert - International Conference on Language …, 2016 - research.tilburguniversity.edu
We present further work on evaluation of the fully automatic post-correction of Early Dutch
Books Online, a collection of 10,333 18th century books. In prior work we evaluated the new …

[PDF][PDF] How to get the computation near the data: Improving data accessibility to, and reusability of analysis functions in corpus query platforms

M Kupietz, N Diewald… - Proceedings of the LREC …, 2018 - ids-pub.bsz-bw.de
How to get the computation near the data: Improving data accessibility to, and reusability of
analysis functions in corpus query platforms Page 27 How to Get the Computation Near the …

Improving part-of-speech tagging of historical text by first translating to modern text

ETK Sang - Computational History and Data-Driven Humanities …, 2016 - Springer
We explore the task of automatically assigning syntactic tags (known as part-of-speech tags)
like Noun and Verb to words in seventeenth-century Dutch text. Tools exist for performing …

Sustainability and genericity of CLARIN services in the Netherlands

D Broeder, J Odijk, D Fišer, A Witt - CLARIN: The Infrastructure …, 2022 - books.google.com
Based on the ten years that have elapsed since the start of the CLARIN-NL project and its
follow-up CLARIAH-NL, this chapter offers an analysis of the sustainability and genericity of …

Finding rising and falling words

ETK Sang - Proceedings of the workshop on language technology …, 2016 - aclanthology.org
We examine two different methods for finding rising words (among which neologisms) and
falling words (among which archaisms) in decades of magazine texts (millions of words) and …

Prescriptivism on its own terms. Perceptions and realities of usage in Siegenbeek's Lijst (1847)

M van der Meulen, G Rutten - Language & History, 2022 - Taylor & Francis
In 1847, one of the first professors of Dutch, Matthijs Siegenbeek (1774–1854), published a
purist word list entitled Lijst van woorden en uitdrukkingen met het Nederlandsch taaleigen …