[PDF][PDF] NoSta-D Named Entity Annotation for German: Guidelines and Dataset.

D Benikova, C Biemann, M Reznicek - LREC, 2014 - lrec-conf.org
We describe the annotation of a new dataset for German Named Entity Recognition (NER).
The need for this dataset is motivated by licensing issues and consistency issues of existing …

A survey on syntactic processing techniques

X Zhang, R Mao, E Cambria - Artificial Intelligence Review, 2023 - Springer
Computational syntactic processing is a fundamental technique in natural language
processing. It normally serves as a pre-processing method to transform natural language …

Digitising Swiss German: how to process and study a polycentric spoken language

Y Scherrer, T Samardžić, E Glaser - Language Resources and Evaluation, 2019 - Springer
Swiss dialects of German are, unlike many dialects of other standardised languages, widely
used in everyday communication. Despite this fact, automatic processing of Swiss German is …

German dialect identification in interview transcriptions

S Malmasi, M Zampieri - Proceedings of the Fourth Workshop on …, 2017 - aclanthology.org
This paper presents three systems submitted to the German Dialect Identification (GDI) task
at the VarDial Evaluation Campaign 2017. The task consists of training models to identify the …

Flexible multi-layer spoken dialogue corpora

S Sauer, A Lüdeling - International Journal of Corpus Linguistics, 2016 - jbe-platform.com
This paper describes the construction of deeply annotated spoken dialogue corpora. To
ensure a maximum of flexibility—in the degree of normalization, the types and formats of …

Towards coreference for literary text: Analyzing domain-specific phenomena

I Rösiger, S Schulz, N Reiter - Proceedings of the Second Joint …, 2018 - aclanthology.org
Coreference resolution is the task of grouping together references to the same discourse
entity. Resolving coreference in literary texts could benefit a number of Digital Humanities …

[PDF][PDF] Experiments with easy-first nonprojective constituent parsing

Y Versley - Proceedings of the First Joint Workshop on Statistical …, 2014 - aclanthology.org
Less-configurational languages such as German often show not just morphological variation
but also free word order and nonprojectivity. German is not exceptional in this regard, as …

Collaborative web-based tools for multi-layer text annotation

C Biemann, K Bontcheva, R Eckart de Castilho… - Handbook of Linguistic …, 2017 - Springer
Effectively managing the collaboration of many annotators is a crucial ingredient for the
success of larger annotation projects. For collaboration, web-based tools offer a low-entry …

Sebastian, Basti, Wastl?! Recognizing Named Entities in Bavarian Dialectal Data

S Peng, Z Sun, H Shan, M Kolm, V Blaschke… - arXiv preprint arXiv …, 2024 - arxiv.org
Named Entity Recognition (NER) is a fundamental task to extract key information from texts,
but annotated resources are scarce for dialects. This paper introduces the first dialectal NER …

[PDF][PDF] Seq2seq or perceptrons for robust lemmatization. an empirical examination

T Pütz, D De Kok, S Pütz, E Hinrichs - Proceedings of the 17th …, 2018 - twuebi.github.io
We propose a morphologically-informed neural Sequence to Sequence (Seq2Seq)
architecture for lemmatization. We evaluate the architecture on German and compare it to a …