Designing a uniform meaning representation for natural language processing

JEL Van Gysel, M Vigus, J Chun, K Lai, S Moeller… - KI-Künstliche …, 2021 - Springer
In this paper we present Uniform Meaning Representation (UMR), a meaning representation
designed to annotate the semantic content of a text. UMR is primarily based on Abstract …

Nefnir: A high accuracy lemmatizer for Icelandic

SL Ingólfsdóttir, H Loftsson, JF Daðason… - arXiv preprint arXiv …, 2019 - arxiv.org
Lemmatization, finding the basic morphological form of a word in a corpus, is an important
step in many natural language processing tasks when working with morphologically rich …

[PDF][PDF] Diacritics restoration using neural networks

J Náplava, M Straka, P Straňák… - Proceedings of the …, 2018 - aclanthology.org
In this paper, we describe a novel combination of a character-level recurrent neural-network
based model and a language model applied to diacritics restoration. In many cases in the …

[PDF][PDF] Morphological resources of derivational word-formation relations

L Kyjánek - Praha, Czech Republic: ÚFAL MFF UK. ISSN, 2018 - lukyjanek.github.io
The report focuses on existing morphological resources containing derivational
wordformation relations. For each resource, the report describes history, licence, format …

Action nouns vs. nouns as bases for denominal verbs in Czech: A case study on directionality in derivation

M Ševčíková - Word Structure, 2021 - euppublishing.com
Suffixless action nouns are mostly analysed as deverbal derivatives (eg, výběr 'choice'<
vybírat 'to choose. ipfv'), but dictionaries ascribe the reverse direction to some noun–verb …

Creating a large-scale diachronic corpus resource: Automated parsing in the Greek papyri (and beyond)

A Keersmaekers, T Van Hal - Natural Language Engineering, 2023 - cambridge.org
This paper explores how to syntactically parse Ancient Greek texts automatically and maps
ways of fruitfully employing the results of such an automated analysis. Special attention is …

Discourse relations and connectives in higher text structure

L Poláková, J Mírovský, Š Zikánová… - Dialogue & …, 2021 - journals.uic.edu
The present article investigates possibilities and limits of local (shallow) analysis of
discourse coherence with respect to the phenomena of global coherence and higher …

Modifications of the Czech morphological dictionary for consistent corpus annotation

J Hlaváčová, M Mikulová, B Štěpánková… - Journal of Linguistics …, 2019 - sciendo.com
We describe systematic changes that have been made to the Czech morphological
dictionary related to annotating new data within the project of Prague Dependency Treebank …

Automatic question generation using semantic role labeling for morphologically rich languages

B Žitko, H Ljubić - Tehnički vjesnik, 2021 - hrcak.srce.hr
Sažetak In this paper, a novel approach to automatic question generation (AQG) using
semantic role labeling (SRL) for morphologically rich languages is presented. A model for …

Opera Graeca Adnotata: Building a 34M+ Token Multilayer Corpus for Ancient Greek

GGA Celano - arXiv preprint arXiv:2404.00739, 2024 - arxiv.org
In this article, the beta version 0.1. 0 of Opera Graeca Adnotata (OGA), the largest open-
access multilayer corpus for Ancient Greek (AG) is presented. OGA consists of 1,687 literary …