Extended overview of CLEF HIPE 2020: named entity processing on historical newspapers

M Ehrmann, M Romanello, A Flückiger… - CEUR Workshop …, 2020 - zora.uzh.ch
This paper presents an extended overview of the first edition of HIPE (Identifying Historical
People, Places and other Entities), a pioneering shared task dedicated to the evaluation of …

[PDF][PDF] BERT for named entity recognition in contemporary and historical German

K Labusch, P Kulturbesitz, C Neudecker… - Proceedings of the 15th …, 2019 - konvens.org
We apply a pre-trained transformer based representational language model, ie BERT
(Devlin et al., 2018), to named entity recognition (NER) in contemporary and historical …

Overview of CLEF HIPE 2020: Named entity recognition and linking on historical newspapers

M Ehrmann, M Romanello, A Flückiger… - Experimental IR Meets …, 2020 - Springer
This paper presents an overview of the first edition of HIPE (Identifying Historical People,
Places and other Entities), a pioneering shared task dedicated to the evaluation of named …

Diachronic evaluation of NER systems on old newspapers

M Ehrmann, G Colavizza, Y Rochat… - Proceedings of the 13th …, 2016 - infoscience.epfl.ch
In recent years, many cultural institutions have engaged in large-scale newspaper
digitization projects and large amounts of historical texts are being acquired (via …

Establishing a new state-of-the-art for French named entity recognition

PJO Suárez, Y Dupont, B Muller, L Romary… - arXiv preprint arXiv …, 2020 - arxiv.org
The French TreeBank developed at the University Paris 7 is the main source of
morphosyntactic and syntactic annotations for French. However, it does not include explicit …

Automatic reconstruction of itineraries from descriptive texts

L Moncla - 2015 - theses.hal.science
This PhD thesis is part of the research project PERDIDO, which aims at extracting and
retrieving displacements from textual documents. This work was conducted in collaboration …

Communiquer par SMS: Analyse automatique du langage et extraction de l'information véhiculée

E Kogkitsidou - 2018 - theses.hal.science
Cette thèse concerne l'analyse automatique des SMS et l'extraction des informations qui y
sont contenues. Le point de départ de notre recherche est le constat que la plupart des …

A Data-driven Approach to Natural Language Processing for Contemporary and Historical French

PO Suarez - 2022 - theses.hal.science
In recent years, neural methods for Natural Language Processing (NLP) have consistently
and repeatedly improved the state of the art in a wide variety of NLP tasks. One of the main …

Issues in Named Entity Recognition on Early Modern English Letters

V Woldenga-Racine - 2019 - digital.lib.washington.edu
The influx of digitized historical documents into online collections has made the study of
these documents much more accessible to researchers and the general public. This data …

Using Verb-Noun Patterns to Detect Process Inputs

M Asadullah, D Nouvel, P Paroubek - Text, Speech and Dialogue: 17th …, 2014 - Springer
We present the preliminary results of an ongoing work aimed at using morpho-syntactic
patterns to extract information from process descriptions in a semi-supervised manner. The …