NusaCrowd: Open source initiative for Indonesian NLP resources

S Cahyawijaya, H Lovenia, AF Aji… - Findings of the …, 2023 - aclanthology.org
We present NusaCrowd, a collaborative initiative to collect and unify existing resources for
Indonesian languages, including opening access to previously non-public resources …

Learning word vectors for 157 languages

E Grave, P Bojanowski, P Gupta, A Joulin… - arXiv preprint arXiv …, 2018 - arxiv.org
Distributed word representations, or word vectors, have recently been applied to many tasks
in natural language processing, leading to state-of-the-art performance. A key ingredient to …

Facebook FAIR's WMT19 news translation task submission

N Ng, K Yee, A Baevski, M Ott, M Auli… - arXiv preprint arXiv …, 2019 - arxiv.org
This paper describes Facebook FAIR's submission to the WMT19 shared news translation
task. We participate in two language pairs and four language directions, English<-> German …

Opensubtitles2016: Extracting large parallel corpora from movie and tv subtitles

P Lison, J Tiedemann - 2016 - duo.uio.no
We present a new major release of the OpenSubtitles collection of parallel corpora. The
release is compiled from a large database of movie and TV subtitles and includes a total of …

Results of the WNUT2017 shared task on novel and emerging entity recognition

L Derczynski, E Nichols, M Van Erp… - Proceedings of the 3rd …, 2017 - aclanthology.org
This shared task focuses on identifying unusual, previously-unseen entities in the context of
emerging discussions. Named entities form the basis of many modern approaches to other …

Emonet: Fine-grained emotion detection with gated recurrent neural networks

M Abdul-Mageed, L Ungar - … of the 55th annual meeting of the …, 2017 - aclanthology.org
Accurate detection of emotion from natural language has applications ranging from building
emotional chatbots to better understanding individuals and their lives. However, progress on …

Demographic dialectal variation in social media: A case study of African-American English

SL Blodgett, L Green, B O'Connor - arXiv preprint arXiv:1608.08868, 2016 - arxiv.org
Though dialectal language is increasingly abundant on social media, few resources exist for
developing NLP tools to handle such language. We conduct a case study of dialectal …

Sentiment analysis of public social media as a tool for health-related topics

F Arias, MZ Nunez, A Guerra-Adames… - IEEE …, 2022 - ieeexplore.ieee.org
For decades, researchers have experimented with the possibility that machines can equal
human linguistic capabilities. Recently, advances in the field of natural language processing …

The privacy policy landscape after the GDPR

T Linden, R Khandelwal, H Harkous… - arXiv preprint arXiv …, 2018 - arxiv.org
The EU General Data Protection Regulation (GDPR) is one of the most demanding and
comprehensive privacy regulations of all time. A year after it went into effect, we study its …

The 7 Ps marketing mix of home-sharing services: Mining travelers' online reviews on Airbnb

L Kwok, Y Tang, B Yu - International Journal of Hospitality Management, 2020 - Elsevier
The 7 Ps model is a very useful tool in helping service firms solve managerial issues in
marketing. Guided by the 7 Ps marketing mix framework, a big-data, supervised machine …