Help yourself from the buffet: National language technology infrastructure initiative on clarin-is
AB Nikulásdóttir, Þ Arnardóttir, S Barkarson… - CLARIN Annual …, 2022 - ecp.ep.liu.se
In this paper we describe how a fairly new CLARIN member is building a broad collection of
national language resources for use in language technology (LT). As a CLARIN C-centre …
national language resources for use in language technology (LT). As a CLARIN C-centre …
Samrómur: Crowd-sourcing data collection for Icelandic speech recognition
DE Mollberg, ÓH Jónsson… - Proceedings of the …, 2020 - aclanthology.org
This contribution describes an ongoing project of speech data collection, using the web
application Samrómur which is built upon Common Voice, Mozilla Foundation's web …
application Samrómur which is built upon Common Voice, Mozilla Foundation's web …
[PDF][PDF] Building open Javanese and Sundanese corpora for multilingual text-to-speech
JAE Wibawa, S Sarin, C Li… - Proceedings of the …, 2018 - aclanthology.org
We present multi-speaker text-to-speech corpora for Javanese and Sundanese, the second
and third largest languages of Indonesia spoken by well over a hundred million people. The …
and third largest languages of Indonesia spoken by well over a hundred million people. The …
Samrómur children: An icelandic speech corpus
CDH Mena, DE Mollberg, M Borský… - Proceedings of the …, 2022 - aclanthology.org
Samrómur Children is an Icelandic speech corpus intended for the field of automatic speech
recognition. It contains 131 hours of read speech from Icelandic children aged between 4 to …
recognition. It contains 131 hours of read speech from Icelandic children aged between 4 to …
[PDF][PDF] Open ASR for Icelandic: Resources and a baseline system
AB Nikulasdóttir, IR Helgadóttir… - Proceedings of the …, 2018 - aclanthology.org
Developing language resources is an important task when creating a speech recognition
system for a less-resourced language. In this paper we describe available language …
system for a less-resourced language. In this paper we describe available language …
Manual speech synthesis data acquisition-from script design to recording speech
A Sigurgeirsson, G Örnólfsson… - Proceedings of the 1st …, 2020 - aclanthology.org
Atli Þór Sigurgeirsson, atlithors@ ru. is, Reykjavik University Gunnar Thor Örnólfsson,
gunnarthor@ hi. is, Árni Magnússon institute of Icelandic studies Dr. Jón Guðnason, jg@ ru …
gunnarthor@ hi. is, Árni Magnússon institute of Icelandic studies Dr. Jón Guðnason, jg@ ru …
[PDF][PDF] Lattice Re-Scoring During Manual Editing for Automatic Error Correction of ASR Transcripts.
AV Rúnarsdóttir, IR Helgadóttir, J Guðnason - INTERSPEECH, 2019 - isca-archive.org
Automatic speech recognition (ASR) systems are increasingly used to transcribe text for
publication or official uses. However, even the best ASR systems make mistakes that can …
publication or official uses. However, even the best ASR systems make mistakes that can …
SamróMur MilljóN: An ASR Corpus of One Million Verified Read Prompts in Icelandic
CDH Mena, ÞD Gunnarsson… - Proceedings of the 2024 …, 2024 - aclanthology.org
The platform samromur. is, or “Samrómur” for short, is a crowdsourcing web application built
on Mozilla's Common Voice, designed to accumulate speech data for the advancement of …
on Mozilla's Common Voice, designed to accumulate speech data for the advancement of …
Manual post-editing of automatically transcribed speeches from the icelandic parliament-althingi
JY Fong, M Borsky, IR Helgadóttir… - arXiv preprint arXiv …, 2018 - arxiv.org
The design objectives for an automatic transcription system are to produce text readable by
humans and to minimize the impact on manual post-editing. This study reports on a …
humans and to minimize the impact on manual post-editing. This study reports on a …