[PDF][PDF] NoSta-D Named Entity Annotation for German: Guidelines and Dataset.
We describe the annotation of a new dataset for German Named Entity Recognition (NER).
The need for this dataset is motivated by licensing issues and consistency issues of existing …
The need for this dataset is motivated by licensing issues and consistency issues of existing …
A survey on syntactic processing techniques
Computational syntactic processing is a fundamental technique in natural language
processing. It normally serves as a pre-processing method to transform natural language …
processing. It normally serves as a pre-processing method to transform natural language …
Digitising Swiss German: how to process and study a polycentric spoken language
Swiss dialects of German are, unlike many dialects of other standardised languages, widely
used in everyday communication. Despite this fact, automatic processing of Swiss German is …
used in everyday communication. Despite this fact, automatic processing of Swiss German is …
German dialect identification in interview transcriptions
S Malmasi, M Zampieri - Proceedings of the Fourth Workshop on …, 2017 - aclanthology.org
This paper presents three systems submitted to the German Dialect Identification (GDI) task
at the VarDial Evaluation Campaign 2017. The task consists of training models to identify the …
at the VarDial Evaluation Campaign 2017. The task consists of training models to identify the …
Flexible multi-layer spoken dialogue corpora
S Sauer, A Lüdeling - International Journal of Corpus Linguistics, 2016 - jbe-platform.com
This paper describes the construction of deeply annotated spoken dialogue corpora. To
ensure a maximum of flexibility—in the degree of normalization, the types and formats of …
ensure a maximum of flexibility—in the degree of normalization, the types and formats of …
Towards coreference for literary text: Analyzing domain-specific phenomena
I Rösiger, S Schulz, N Reiter - Proceedings of the Second Joint …, 2018 - aclanthology.org
Coreference resolution is the task of grouping together references to the same discourse
entity. Resolving coreference in literary texts could benefit a number of Digital Humanities …
entity. Resolving coreference in literary texts could benefit a number of Digital Humanities …
[PDF][PDF] Experiments with easy-first nonprojective constituent parsing
Y Versley - Proceedings of the First Joint Workshop on Statistical …, 2014 - aclanthology.org
Less-configurational languages such as German often show not just morphological variation
but also free word order and nonprojectivity. German is not exceptional in this regard, as …
but also free word order and nonprojectivity. German is not exceptional in this regard, as …
Collaborative web-based tools for multi-layer text annotation
Effectively managing the collaboration of many annotators is a crucial ingredient for the
success of larger annotation projects. For collaboration, web-based tools offer a low-entry …
success of larger annotation projects. For collaboration, web-based tools offer a low-entry …
Sebastian, Basti, Wastl?! Recognizing Named Entities in Bavarian Dialectal Data
Named Entity Recognition (NER) is a fundamental task to extract key information from texts,
but annotated resources are scarce for dialects. This paper introduces the first dialectal NER …
but annotated resources are scarce for dialects. This paper introduces the first dialectal NER …
[PDF][PDF] Seq2seq or perceptrons for robust lemmatization. an empirical examination
T Pütz, D De Kok, S Pütz, E Hinrichs - Proceedings of the 17th …, 2018 - twuebi.github.io
We propose a morphologically-informed neural Sequence to Sequence (Seq2Seq)
architecture for lemmatization. We evaluate the architecture on German and compare it to a …
architecture for lemmatization. We evaluate the architecture on German and compare it to a …