Large-scale hate speech detection with cross-domain transfer

C Toraman, EH Yilmaz, F Şahinuç… - ACM Transactions on …, 2023 - dl.acm.org

Tokenization is an important text preprocessing step to prepare input tokens for deep
language models. WordPiece and BPE are de facto methods employed by important …

被引用次数：73 相关文章所有 4 个版本

Named entity recognition in Turkish: A comparative study with detailed error analysis

O Ozcelik, C Toraman - Information Processing & Management, 2022 - Elsevier

Named entity recognition aims to detect pre-determined entity types in unstructured text.
There is a limited number of studies on this task for low-resource languages such as Turkish …

被引用次数：19 相关文章所有 3 个版本

[PDF] springer.com

An approach to automatic classification of hate speech in sports domain on social media

S Vujičić Stanković, M Mladenović - Journal of Big Data, 2023 - Springer

Hate Speech encompasses different forms of trolling, bullying, harassment, and threats
directed against specific individuals or groups. This phenomena is mainly expressed on …

被引用次数：12 相关文章所有 6 个版本

[PDF] aclanthology.org

What did you learn to hate? a topic-oriented analysis of generalization in hate speech detection

T Bourgeade, P Chiril, F Benamara… - Proceedings of the 17th …, 2023 - aclanthology.org

Hate speech has unfortunately become a significant phenomenon on social media
platforms, and it can cover various topics (misogyny, sexism, racism, xenophobia, etc.) and …

被引用次数：15 相关文章所有 8 个版本

[PDF] aaai.org

Metahate: A dataset for unifying efforts on hate speech detection

P Piot, P Martín-Rodilla, J Parapar - Proceedings of the International …, 2024 - ojs.aaai.org

Hate speech represents a pervasive and detrimental form of online discourse, often
manifested through an array of slurs, from hateful tweets to defamatory posts. As such …

被引用次数：5 相关文章所有 4 个版本

[PDF] arxiv.org

Arc-nlp at multimodal hate speech event detection 2023: Multimodal methods boosted by ensemble learning, syntactical and entity features

U Sahin, IE Kucukkaya, O Ozcelik… - arXiv preprint arXiv …, 2023 - arxiv.org

Text-embedded images can serve as a means of spreading hate speech, propaganda, and
extremist beliefs. Throughout the Russia-Ukraine war, both opposing factions heavily relied …

被引用次数：7 相关文章所有 6 个版本

[PDF] arxiv.org

Revisiting hate speech benchmarks: From data curation to system deployment

A Kulkarni, S Masud, V Goyal… - Proceedings of the 29th …, 2023 - dl.acm.org

Social media is awash with hateful content, much of which is often veiled with linguistic and
topical diversity. The benchmark datasets used for hate speech detection do not account for …

被引用次数：8 相关文章所有 4 个版本

[PDF] springer.com

Resources for Turkish natural language processing: A critical survey

Ç Çöltekin, AS Doğruöz, Ö Çetinoğlu - Language Resources and …, 2023 - Springer

This paper presents a comprehensive survey of corpora and lexical resources available for
Turkish. We review a broad range of resources, focusing on the ones that are publicly …

被引用次数：17 相关文章所有 14 个版本

[PDF] iospress.nl

Nehate: Large-scale annotated data shedding light on hate speech in nepali local election discourse

S Thapa, K Rauniyar, S Shiwakoti, S Poudel… - ECAI 2023, 2023 - ebooks.iospress.nl

The use of social media during election campaigns has become increasingly popular.
However, the unbridled nature of online discourse can lead to the propagation of hate …

被引用次数：12 相关文章所有 5 个版本

[PDF] arxiv.org

LLMs and Finetuning: Benchmarking cross-domain performance for hate speech detection

A Nasir, A Sharma, K Jaidka - arXiv preprint arXiv:2310.18964, 2023 - arxiv.org

This paper compares different pre-trained and fine-tuned large language models (LLMs) for
hate speech detection. Our research underscores challenges in LLMs' cross-domain validity …

被引用次数：7 相关文章所有 2 个版本