A survey on automatic detection of hate speech in text

P Fortuna, S Nunes - ACM Computing Surveys (CSUR), 2018 - dl.acm.org
The scientific study of hate speech, from a computer science point of view, is recent. This
survey organizes and describes the current state of the field, providing a structured overview …

Aspect-based sentiment analysis of movie reviews on discussion boards

TT Thet, JC Na, CSG Khoo - Journal of information science, 2010 - journals.sagepub.com
In this article, a method for automatic sentiment analysis of movie reviews is proposed,
implemented and evaluated. In contrast to most studies that focus on determining only …

Where to hide a stolen elephant: Leaps in creative writing with multimodal machine intelligence

N Singh, G Bernal, D Savchenko… - ACM Transactions on …, 2023 - dl.acm.org
While developing a story, novices and published writers alike have had to look outside
themselves for inspiration. Language models have recently been able to generate text …

IndoLEM and IndoBERT: A benchmark dataset and pre-trained language model for Indonesian NLP

F Koto, A Rahimi, JH Lau, T Baldwin - arXiv preprint arXiv:2011.00677, 2020 - arxiv.org
Although the Indonesian language is spoken by almost 200 million people and the 10th
most spoken language in the world, it is under-represented in NLP research. Previous work …

Chestx-ray8: Hospital-scale chest x-ray database and benchmarks on weakly-supervised classification and localization of common thorax diseases

X Wang, Y Peng, L Lu, Z Lu… - Proceedings of the …, 2017 - openaccess.thecvf.com
The chest X-ray is one of the most commonly accessible radiological examinations for
screening and diagnosis of many lung diseases. A tremendous number of X-ray imaging …

Faithful to the original: Fact aware neural abstractive summarization

Z Cao, F Wei, W Li, S Li - Proceedings of the AAAI Conference on …, 2018 - ojs.aaai.org
Unlike extractive summarization, abstractive summarization has to fuse different parts of the
source text, which inclines to create fake facts. Our preliminary study reveals nearly 30% of …

Simple and accurate dependency parsing using bidirectional LSTM feature representations

E Kiperwasser, Y Goldberg - Transactions of the Association for …, 2016 - direct.mit.edu
We present a simple and effective scheme for dependency parsing which is based on
bidirectional-LSTMs (BiLSTMs). Each sentence token is associated with a BiLSTM vector …

Demographic dialectal variation in social media: A case study of African-American English

SL Blodgett, L Green, B O'Connor - arXiv preprint arXiv:1608.08868, 2016 - arxiv.org
Though dialectal language is increasingly abundant on social media, few resources exist for
developing NLP tools to handle such language. We conduct a case study of dialectal …

Answering natural language questions by subgraph matching over knowledge graphs

S Hu, L Zou, JX Yu, H Wang… - IEEE Transactions on …, 2017 - ieeexplore.ieee.org
RDF question/answering (Q/A) allows users to ask questions in natural languages over a
knowledge base represented by RDF. To answer a natural language question, the existing …

The GUM corpus: Creating multilayer resources in the classroom

A Zeldes - Language Resources and Evaluation, 2017 - Springer
This paper presents the methodology, design principles and detailed evaluation of a new
freely available multilayer corpus, collected and edited via classroom annotation using …