The ParlaMint corpora of parliamentary proceedings

T Erjavec, M Ogrodniczuk, P Osenova… - Language resources …, 2023 - Springer
This paper presents the ParlaMint corpora containing transcriptions of the sessions of the 17
European national parliaments with half a billion words. The corpora are uniformly encoded …

Resources for Turkish natural language processing: A critical survey

Ç Çöltekin, AS Doğruöz, Ö Çetinoğlu - Language Resources and …, 2023 - Springer
This paper presents a comprehensive survey of corpora and lexical resources available for
Turkish. We review a broad range of resources, focusing on the ones that are publicly …

ParlaMint II: The show must go on

M Ogrodniczuk, P Osenova, T Erjavec… - Proceedings of the …, 2022 - aclanthology.org
Abstract In ParlaMint I, a CLARIN-ERIC supported project in pandemic times, a set of
comparable and uniformly annotated multilingual corpora for 17 national parliaments were …

ParlaSpeech-HR-a freely available ASR dataset for croatian bootstrapped from the parlaMint corpus

N Ljubešić, D Koržinek, P Rupnik… - Proceedings of the …, 2022 - aclanthology.org
This paper presents our bootstrapping efforts of producing the first large freely available
Croatian automatic speech recognition (ASR) dataset, 1,816 hours in size, obtained from …

ParlaMint II: advancing comparable parliamentary corpora across Europe

T Erjavec, M Kopp, N Ljubešić, T Kuzman… - Language Resources …, 2024 - Springer
The paper presents the results of the ParlaMint II project, which comprise comparable
corpora of parliamentary debates of 29 European countries and autonomous regions …

Basqueparl: A bilingual corpus of basque parliamentary transcriptions

N Escribano, JA González… - arXiv preprint arXiv …, 2022 - arxiv.org
Parliamentary transcripts provide a valuable resource to understand the reality and know
about the most important facts that occur over time in our societies. Furthermore, the political …

Neural Coreference Resolution for Dutch Parliamentary Documents with the DutchParliament Dataset

R Van Heusden, J Kamps, M Marx - Data, 2023 - mdpi.com
The task of coreference resolution concerns the clustering of words and phrases referring to
the same entity in text, either in the same document or across multiple documents. The task …

Οpen Parliamentary Data as a Tool for Linguistic Research: Exploring the 'Greek Language Question' in the Journal of Parliamentary Debates

M Kamilaki - International Conference on Document Analysis and …, 2024 - Springer
Parliamentary Libraries currently face the challenge of shifting from being gate-keepers of a
Parliament's archival and contemporary “treasures” to functioning as dynamic information …

Parlamint: comparable corpora of european parliamentary data

T Erjavec, M Ogrodniczuk, P Osenova… - … of CLARIN annual …, 2021 - epubl.ktu.edu
Abstract [eng] This paper outlines the ParlaMint project from the perspective of its goals,
tasks, participants, results and applications potential. The project produced language …

[PDF][PDF] Empirical Approaches to Variation. The Case of Timok Variety of Torlak

T Vukovic - 2024 - zora.uzh.ch
The present dissertation delves into the intricate world of language variation, focusing
specifically on the Timok dialect, a sub-standard variety spoken in Southeast Serbia and part …