A corpus linguistic perspective on contemporary German pop lyrics with the multi-layer annotated “Songkorpus”

R Schneider - Proceedings of the Twelfth Language Resources …, 2020 - aclanthology.org
Song lyrics can be considered as a text genre that has features of both written and spoken
discourse, and potentially provides extensive linguistic and cultural information to scientists …

[PDF][PDF] Internetbasierte Kommunikation und Korpuslinguistik: Repräsentation basaler Interaktionsformate in TEI

M Beißwenger - Digitale Infrastrukturen für die germanistische …, 2018 - library.oapen.org
Der Beitrag beschreibt ein Basisschema für die Repräsentation von Korpora
internetbasierter Kommunikation auf der Grundlage der Guidelines for Electronic Text …

Integrating corpora of computer-mediated communication in CLARIN-D: Results from the curation project ChatCorpus2CLARIN

H Lüngen, M Beißwenger, E Ehrhardt… - Proceedings of the …, 2016 - ids-pub.bsz-bw.de
We introduce our pipeline to integrate CMC and SM corpora into the CLARIN-D corpus
infrastructure. The pipeline was developed by transforming an existing CMC corpus, the …

Closing a gap in the language resources landscape: Groundwork and best practices from projects on computer-mediated communication in four European countries.

M Beißwenger, T Chanier, I Chiari, T Erjavec… - CLARIN Annual …, 2017 - hal.science
The paper presents best practices and results from projects in four countries dedicated to the
creation of corpora of computer-mediated communication and social media interactions …

Internet corpora: A challenge for linguistic processing

A Horbach, S Thater, D Steffen, PM Fischer, A Witt… - Datenbank …, 2015 - Springer
Natural language processing tools are mostly developed for and optimized on newspaper
texts, and often show a substantial performance drop when applied to other types of texts …

[PDF][PDF] Adding value to CMC corpora: CLARINification and part-of-speech annotation of the Dortmund chat corpus

M Beißwenger, E Ehrhardt, A Horbach… - … /Zesch, Torsten (Hg.) …, 2015 - cmc-corpora.org
ChatCorpus2CLARIN is a curation project of the discipline-specific working group “German
Philology”(F-AG 1) within the joint infrastructure project CLARIN-D. In this project, an existing …

[图书][B] Rechtliche Bedingungen für die Bereitstellung eines Chat-Korpus in CLARIN-D. Ergebnisse eines Rechtsgutachtens

M Beißwenger, H Lüngen, J Schallaböck… - 2017 - degruyter.com
In ihrem Gutachten beurteilten die beiden Gutachter das Korpus unter
datenschutzrechtlicher, persönlichkeitsrechtlicher, urheber-und leistungsschutzrechtlicher …

The making of the Litkey Corpus, a richly annotated longitudinal corpus of German texts written by primary school children

R Laarmann-Quante, S Dipper… - Proceedings of the 13th …, 2019 - aclanthology.org
To date, corpus and computational linguistic work on written language acquisition has
mostly dealt with second language learners who have usually already mastered …

[PDF][PDF] “Konservenglück in Tiefkühl-Town”–Das Songkorpus als empirische Ressource interdisziplinärer Erforschung deutschsprachiger Poptexte

R Schneider - Preliminary proceedings of the 15th Conference …, 2019 - ids-pub.bsz-bw.de
Der Beitrag beschreibt ein mehrfach annotiertes Korpus deutschsprachiger Songtexte als
Datenbasis für interdisziplinäre Untersuchungsszenarien. Die Ressource erlaubt empirisch …

Improving pos tagging of german learner language in a reading comprehension scenario

L Keiper, A Horbach, S Thater - Proceedings of the Tenth …, 2016 - aclanthology.org
We present a novel method to automatically improve the accurracy of part-of-speech taggers
on learner language. The key idea underlying our approach is to exploit the structure of a …