Human and automatic speech recognition performance on german oral history interviews
Automatic speech recognition systems have accomplished remarkable improvements in
transcription accuracy in recent years. On some domains, models now achieve near-human …
transcription accuracy in recent years. On some domains, models now achieve near-human …
OCR improvements for images of multi-page historical documents
This work presents a pipeline for processing digitally scanned documents, reading their
textual content, and storing it in a dataset for the purpose of information retrieval. The …
textual content, and storing it in a dataset for the purpose of information retrieval. The …
The System for Efficient Indexing and Search in the Large Archives of Scanned Historical Documents
The paper introduces software capable of indexing and searching large archives of scanned
historical documents. The system capabilities are demonstrated on the collection containing …
historical documents. The system capabilities are demonstrated on the collection containing …
Automatic information extraction from scanned documents
L Bureš, P Neduchal, L Müller - … 2020, St. Petersburg, Russia, October 7 …, 2020 - Springer
This paper deals with the task of information extraction from a structured document scanned
by an ordinary office scanner device. It explores the processing pipeline from scanned paper …
by an ordinary office scanner device. It explores the processing pipeline from scanned paper …
An automated pipeline for robust image processing and optical character recognition of historical documents
In this paper we propose a pipeline for processing of scanned historical documents into the
electronic text form that could then be indexed and stored in a database. The nature of the …
electronic text form that could then be indexed and stored in a database. The nature of the …
Robust Speech Recognition via Adaptation for German Oral History Interviews
M Gref - 2022 - bonndoc.ulb.uni-bonn.de
Automatic speech recognition systems often achieve remarkable performance when trained
on thousands of hours of manually annotated and time-aligned speech. However, when …
on thousands of hours of manually annotated and time-aligned speech. However, when …
Initial experiments on question answering from the intrinsic structure of oral history archives
Large audio archives with spoken content are natural candidates for question answering
systems. Oral history archives generally contain many facts and stories that would be …
systems. Oral history archives generally contain many facts and stories that would be …