From news to comment: Resources and benchmarks for parsing the language of web 2.0

A comprehensive survey on word representation models: From classical to state-of-the-art word representation language models

U Naseem, I Razzak, SK Khan, M Prasad - Transactions on Asian and …, 2021 - dl.acm.org

Word representation has always been an important research area in the history of natural
language processing (NLP). Understanding such complex text data is imperative, given that …

被引用次数：220 相关文章所有 10 个版本

[PDF] aclanthology.org

[PDF][PDF] What to do about bad language on the internet

J Eisenstein - Proceedings of the 2013 conference of the North …, 2013 - aclanthology.org

The rise of social media has brought computational linguistics in ever-closer contact with
bad language: text that defies our expectations about vocabulary, spelling, and syntax. This …

被引用次数：520 相关文章所有 6 个版本

[PDF] mit.edu

Argumentation mining in user-generated web discourse

I Habernal, I Gurevych - Computational linguistics, 2017 - direct.mit.edu

The goal of argumentation mining, an evolving research field in computational linguistics, is
to design methods capable of analyzing people's argumentation. In this article, we go …

被引用次数：336 相关文章所有 12 个版本

[PDF] semanticscholar.org

[图书][B] Natural language processing for social media

A Farzindar, D Inkpen, G Hirst - 2015 - Springer

In recent years, online social networking has revolutionized interpersonal communication.
The newer research on language analysis in social media has been increasingly focusing …

被引用次数：272 相关文章所有 9 个版本

[PDF] aclanthology.org

[PDF][PDF] How noisy social media text, how diffrnt social media sources?

T Baldwin, P Cook, M Lui, A MacKinlay… - Proceedings of the sixth …, 2013 - aclanthology.org

While various claims have been made about text in social media text being noisy, there has
never been a systematic study to investigate just how linguistically noisy or otherwise it is …

被引用次数：342 相关文章所有 6 个版本

[PDF] hku.hk

A dependency parser for tweets

L Kong, N Schneider, S Swayamdipta… - Proceedings of the …, 2014 - hub.hku.hk

© 2014 Association for Computational Linguistics. We describe a new dependency parser
for English tweets, TWEEBOPARSER. The parser builds on several contributions: new …

被引用次数：296 相关文章所有 23 个版本

[PDF] arxiv.org

What to do about non-standard (or non-canonical) language in NLP

B Plank - arXiv preprint arXiv:1608.07836, 2016 - arxiv.org

Real world data differs radically from the benchmark corpora we use in natural language
processing (NLP). As soon as we apply our technologies to the real world, performance …

被引用次数：109 相关文章所有 9 个版本

[HTML] springer.com

[HTML][HTML] Building the essential resources for Finnish: the Turku Dependency Treebank

K Haverinen, J Nyblom, T Viljanen, V Laippala… - Language Resources …, 2014 - Springer

In this paper, we present the final version of a publicly available treebank of Finnish, the
Turku Dependency Treebank. The treebank contains 204,399 tokens (15,126 sentences) …

被引用次数：140 相关文章所有 11 个版本

Systematic patterning in phonologically‐motivated orthographic variation

J Eisenstein - Journal of Sociolinguistics, 2015 - Wiley Online Library

Social media features a wide range of non‐standard spellings, many of which appear
inspired by phonological variation. However, the nature of the connection between variation …

被引用次数：116 相关文章所有 3 个版本

[PDF] unibocconi.it

[PDF][PDF] Learning part-of-speech taggers with inter-annotator agreement loss

B Plank, D Hovy, A Sogaard - Proceedings of EACL, 2014 - iris.unibocconi.it

In natural language processing (NLP) annotation projects, we use inter-annotator
agreement measures and annotation guidelines to ensure consistent annotations. However …

被引用次数：123 相关文章所有 12 个版本