A Benchmark of Named Entity Recognition Approaches in Historical Documents Application to 19 Century French Directories
Named entity recognition (NER) is a necessary step in many pipelines targeting historical
documents. Indeed, such natural language processing techniques identify which class each …
documents. Indeed, such natural language processing techniques identify which class each …
Are opinions expressed in land-use planning documents?
E Kergosien, B Laval, M Roche… - International Journal of …, 2014 - Taylor & Francis
A great deal of research on information extraction from textual datasets has been performed
in specific data contexts, such as movie reviews, commercial product evaluations, campaign …
in specific data contexts, such as movie reviews, commercial product evaluations, campaign …
Text2geo: from textual data to geospatial information
In this paper, we focus on methods for extracting spatial information in text documents. After
presenting textual description of space and manual annotation of named entities, mainly …
presenting textual description of space and manual annotation of named entities, mainly …
When textual information becomes spatial information compatible with satellite images
E Kergosien, H Alatrista-Salas, M Gaio… - … 7th International joint …, 2015 - ieeexplore.ieee.org
With the amount of textual data available on the web, new methodologies of knowledge
extraction domain are provided. Some original methods allow the users to combine different …
extraction domain are provided. Some original methods allow the users to combine different …
Entry Separation using a Mixed Visual and Textual Language Model: Application to 19th century French Trade Directories
B Duménieu, E Carlinet, N Abadie… - arXiv preprint arXiv …, 2023 - arxiv.org
When extracting structured data from repetitively organized documents, such as dictionaries,
directories, or even newspapers, a key challenge is to correctly segment what constitutes the …
directories, or even newspapers, a key challenge is to correctly segment what constitutes the …
Discovering types of spatial relations with a text mining approach
S Zenasni, E Kergosien, M Roche… - Foundations of Intelligent …, 2015 - Springer
Abstract Knowledge discovery from texts, particularly the identification of spatial information
is a difficult task due to the complexity of texts written in natural language. Here we propose …
is a difficult task due to the complexity of texts written in natural language. Here we propose …
[图书][B] Recherche d'information géographique dans des corpus textuels
C Sallaberry - 2014 - books.google.com
Favorisés par la montée en puissance d'Internet et les possibilités nouvelles de diffusion de
données, le nombre et le volume des corpus numériques sont toujours plus importants et la …
données, le nombre et le volume des corpus numériques sont toujours plus importants et la …
Automatic identification of research fields in scientific papers
E Kergosien, A Farvardin, M Teisseire… - arXiv preprint arXiv …, 2018 - arxiv.org
The TERRE-ISTEX project aims to identify scientific research dealing with specific
geographical territories areas based on heterogeneous digital content available in scientific …
geographical territories areas based on heterogeneous digital content available in scientific …
De la parole à la carte
H Flamein, I Eshkol-Taravella - SHS Web of Conferences, 2020 - hal.science
A l'heure où de plus en plus de corpus et de données sont accessibles, le travail initié s'
interroge sur l'exploitation de données linguistiques dans un corpus d'oral à dimension …
interroge sur l'exploitation de données linguistiques dans un corpus d'oral à dimension …
Création d'un graphe de connaissances géohistorique à partir d'annuaires du commerce parisien du 19 ème siècle: application aux métiers de la photographie
Les annuaires professionnels anciens, édités à un rythme soutenu dans de nombreuses
villes européennes tout au long des XIX e et XX e siècles, forment un corpus de sources …
villes européennes tout au long des XIX e et XX e siècles, forment un corpus de sources …