Processing the structure of documents: logical layout analysis of historical newspapers in French

N Gutehrlé, I Atanassova - Journal of Data Mining & Digital …, 2022 - jdmdh.episciences.org
Background. In recent years, libraries and archives led important digitisation campaigns that
opened the access to vast collections of historical documents. While such documents are
often available as XML ALTO documents, they lack information about their logical structure.
In this paper, we address the problem of Logical Layout Analysis applied to historical
documents in French. We propose a rule-based method, that we evaluate and compare with
two Machine-Learning models, namely RIPPER and Gradient Boosting. Our data set …

[引用][C] Processing the structure of documents: Logical layout analysis of historical newspapers in French

I Atanassova, N Gutehrlé - Journal of Data Mining & Digital Humanities, 2022
以上显示的是最相近的搜索结果。 查看全部搜索结果