Data-driven news generation for automated journalism

L Leppänen, M Munezero… - Proceedings of the …, 2017 - aclanthology.org
Despite increasing amounts of data and ever improving natural language generation
techniques, work on automated journalism is still relatively scarce. In this paper, we explore …

Morphology matters: A multilingual language modeling analysis

HH Park, KJ Zhang, C Haley, K Steimel… - Transactions of the …, 2021 - direct.mit.edu
Prior studies in multilingual language modeling (eg, Cotterell et al.,; Mielke et al.,) disagree
on whether or not inflectional morphology makes languages harder to model. We attempt to …

A broad-coverage corpus for Finnish named entity recognition

J Luoma, M Oinonen, M Pyykönen… - Proceedings of the …, 2020 - aclanthology.org
We present a new manually annotated corpus for broad-coverage named entity recognition
for Finnish. Building on the original Universal Dependencies Finnish corpus of 754 …

The WMT'18 Morpheval test suites for English-Czech, English-German, English-Finnish and Turkish-English

F Burlot, Y Scherrer, V Ravishankar, O Bojar… - 3rd Conference on …, 2018 - hal.science
Progress in the quality of machine translation output calls for new automatic evaluation
procedures and metrics. In this paper, we extend the Morpheval protocol introduced by\citet …

The effect of morphology in named entity recognition with sequence tagging

O Güngör, T Güngör, S Üsküdarli - Natural Language Engineering, 2019 - cambridge.org
This work proposes a sequential tagger for named entity recognition in morphologically rich
languages. Several schemes for representing the morphological analysis of a word in the …

[PDF][PDF] Rule-based machine translation from English to Finnish

A Hurskainen, J Tiedemann - proceedings of the second …, 2017 - aclanthology.org
The paper describes a rule-based machine translation system adapted to English to Finnish
translation. Although the translation system participates in the shared task of news …

[PDF][PDF] LAS: an integrated language analysis tool for multiple languages.

E Mäkelä - J. Open Source Softw., 2016 - pdfs.semanticscholar.org
LAS is a command-line tool for lemmatizing, morphological analysis, inflected form
generation, hyphenation and language identification of multiple languages. These …

Affect as a proxy for literary mood

E Öhman, R Rossi - Journal of Data Mining & Digital …, 2023 - jdmdh.episciences.org
We propose to use affect as a proxy for mood in literary texts. In this study, we explore the
differences in computationally detecting tone versus detecting mood. Methodologically we …

[PDF][PDF] Abu-matran at wmt 2016 translation task: Deep learning, morphological segmentation and tuning on character sequences

VM Sánchez-Cartagena, A Toral - Proceedings of the First …, 2016 - aclanthology.org
This paper presents the systems submitted by the Abu-MaTran project to the Englishto-
Finnish language pair at the WMT 2016 news translation task. We applied morphological …

An OCR pipeline for transforming parliamentary debates into linked data: Case ParliamentSampo–Parliament of Finland on the semantic web

S Drobac, L Sinikallio, E Hyvönen - Digital Humanities in the …, 2023 - research.aalto.fi
This paper presents the OCR pipeline created for ParliamentSampo-Parliament of Finland
on the Semantic Web, a Linked Open Data (LOD) service, data infrastructure, and semantic …