Human and automatic speech recognition performance on german oral history interviews

M Gref, N Matthiesen, C Schmidt, S Behnke… - arXiv preprint arXiv …, 2022 - arxiv.org
Automatic speech recognition systems have accomplished remarkable improvements in
transcription accuracy in recent years. On some domains, models now achieve near-human …

OCR improvements for images of multi-page historical documents

I Gruber, M Hrúz, P Ircing, P Neduchal, T Zítka… - … Conference on Speech …, 2021 - Springer
This work presents a pipeline for processing digitally scanned documents, reading their
textual content, and storing it in a dataset for the purpose of information retrieval. The …

The System for Efficient Indexing and Search in the Large Archives of Scanned Historical Documents

M Bulín, J Švec, P Ircing - European Conference on Information Retrieval, 2023 - Springer
The paper introduces software capable of indexing and searching large archives of scanned
historical documents. The system capabilities are demonstrated on the collection containing …

Automatic information extraction from scanned documents

L Bureš, P Neduchal, L Müller - … 2020, St. Petersburg, Russia, October 7 …, 2020 - Springer
This paper deals with the task of information extraction from a structured document scanned
by an ordinary office scanner device. It explores the processing pipeline from scanned paper …

An automated pipeline for robust image processing and optical character recognition of historical documents

I Gruber, P Ircing, P Neduchal, M Hrúz… - … Conference on Speech …, 2020 - Springer
In this paper we propose a pipeline for processing of scanned historical documents into the
electronic text form that could then be indexed and stored in a database. The nature of the …

Robust Speech Recognition via Adaptation for German Oral History Interviews

M Gref - 2022 - bonndoc.ulb.uni-bonn.de
Automatic speech recognition systems often achieve remarkable performance when trained
on thousands of hours of manually annotated and time-aligned speech. However, when …

Initial experiments on question answering from the intrinsic structure of oral history archives

A Chýlek, J Švec, L Šmídl - International Conference on Speech and …, 2021 - Springer
Large audio archives with spoken content are natural candidates for question answering
systems. Oral history archives generally contain many facts and stories that would be …