A survey on sentiment analysis methods, applications, and challenges

M Wankhade, ACS Rao, C Kulkarni - Artificial Intelligence Review, 2022 - Springer
The rapid growth of Internet-based applications, such as social media platforms and blogs,
has resulted in comments and reviews concerning day-to-day activities. Sentiment analysis …

From population to production: 50 years of scientific literature on how to feed the world

L Tamburino, G Bravo, Y Clough, KA Nicholas - Global Food Security, 2020 - Elsevier
How to feed the world is a vigorously debated question, but the extent to which possible
solutions receive attention in the scientific literature has not been studied. Using textual …

How good is your tokenizer? on the monolingual performance of multilingual language models

P Rust, J Pfeiffer, I Vulić, S Ruder… - arXiv preprint arXiv …, 2020 - arxiv.org
In this work, we provide a systematic and comprehensive empirical comparison of pretrained
multilingual language models versus their monolingual counterparts with regard to their …

Tokenizing, pos tagging, lemmatizing and parsing ud 2.0 with udpipe

M Straka, J Straková - Proceedings of the CoNLL 2017 shared …, 2017 - aclanthology.org
Many natural language processing tasks, including the most advanced ones, routinely start
by several basic processing steps–tokenization and segmentation, most likely also POS …

The Tatoeba Translation Challenge--Realistic Data Sets for Low Resource and Multilingual MT

J Tiedemann - arXiv preprint arXiv:2010.06354, 2020 - arxiv.org
This paper describes the development of a new benchmark for machine translation that
provides training and test data for thousands of language pairs covering over 500 …

UDPipe 2.0 prototype at CoNLL 2018 UD shared task

M Straka - Proceedings of the CoNLL 2018 shared task …, 2018 - aclanthology.org
UDPipe is a trainable pipeline which performs sentence segmentation, tokenization, POS
tagging, lemmatization and dependency parsing. We present a prototype for UDPipe 2.0 …

[PDF][PDF] Machine learning for ancient languages: A survey

T Sommerschield, Y Assael, J Pavlopoulos… - Computational …, 2023 - direct.mit.edu
Ancient languages preserve the cultures and histories of the past. However, their study is
fraught with difficulties, and experts must tackle a range of challenging text-based tasks, from …

Towards better UD parsing: Deep contextualized word embeddings, ensemble, and treebank concatenation

W Che, Y Liu, Y Wang, B Zheng, T Liu - arXiv preprint arXiv:1807.03121, 2018 - arxiv.org
This paper describes our system (HIT-SCIR) submitted to the CoNLL 2018 shared task on
Multilingual Parsing from Raw Text to Universal Dependencies. We base our submission on …

Can language models encode perceptual structure without grounding? a case study in color

M Abdou, A Kulmizev, D Hershcovich, S Frank… - arXiv preprint arXiv …, 2021 - arxiv.org
Pretrained language models have been shown to encode relational information, such as the
relations between entities or concepts in knowledge-bases--(Paris, Capital, France) …

Stanford's graph-based neural dependency parser at the conll 2017 shared task

T Dozat, P Qi, CD Manning - … of the CoNLL 2017 shared task …, 2017 - aclanthology.org
This paper describes the neural dependency parser submitted by Stanford to the CoNLL
2017 Shared Task on parsing Universal Dependencies. Our system uses relatively simple …