[PDF][PDF] Neural transfer learning for natural language processing

S Ruder - 2019 - aran.library.nuigalway.ie
Language is often regarded as the hallmark of human intelligence. Developing systems that
can understand human language is thus one of the main obstacles on the quest towards …

Bridging the gaps: Multi task learning for domain transfer of hate speech detection

Z Waseem, J Thorne, J Bingel - Online harassment, 2018 - Springer
Accurately detecting hate speech using supervised classification is dependent on data that
is annotated by humans. Attaining high agreement amongst annotators though is difficult …

A large-scale comparison of historical text normalization systems

M Bollmann - arXiv preprint arXiv:1904.02036, 2019 - arxiv.org
There is no consensus on the state-of-the-art approach to historical text normalization. Many
techniques have been proposed, including rule-based methods, distance metrics, character …

Multi-task learning of pairwise sequence classification tasks over disparate label spaces

I Augenstein, S Ruder, A Søgaard - arXiv preprint arXiv:1802.09913, 2018 - arxiv.org
We combine multi-task learning and semi-supervised learning by inducing a joint
embedding space between disparate label spaces and learning transfer functions between …

Neural networks for text correction and completion in keyboard decoding

S Ghosh, PO Kristensson - arXiv preprint arXiv:1709.06429, 2017 - arxiv.org
Despite the ubiquity of mobile and wearable text messaging applications, the problem of
keyboard text decoding is not tackled sufficiently in the light of the enormous success of the …

[PDF][PDF] Event extraction from historical texts: A new dataset for black rebellions

VD Lai, M Van Nguyen, H Kaufman… - Findings of the …, 2021 - aclanthology.org
Understanding historical events is necessary for the study of contemporary society, culture,
and politics. In this work, we focus on the event extraction task (EE) to detect event trigger …

Towards realistic practices in low-resource natural language processing: The development set

K Kann, K Cho, SR Bowman - arXiv preprint arXiv:1909.01522, 2019 - arxiv.org
Development sets are impractical to obtain for real low-resource languages, since using all
available data for training is often more effective. However, development sets are widely …

E-commerce review sentiment score prediction considering misspelled words: A deep learning approach

S Jain, PK Roy - Electronic Commerce Research, 2024 - Springer
Acquiring a single sentiment score dependent on all the reviews will benefit the buyers and
sellers in making the decision more accurately. The raw format of user-generated content …

Training data augmentation for low-resource morphological inflection

T Bergmanis, K Kann, H Schütze… - … 2017 Shared Task …, 2017 - research.ed.ac.uk
This work describes the UoE-LMU submission for the CoNLL-SIGMORPHON 2017 Shared
Task on Universal Morphological Reinflection, Subtask 1: given a lemma and target …

An evaluation of neural machine translation models on historical spelling normalization

G Tang, F Cap, E Pettersson, J Nivre - arXiv preprint arXiv:1806.05210, 2018 - arxiv.org
In this paper, we apply different NMT models to the problem of historical spelling
normalization for five languages: English, German, Hungarian, Icelandic, and Swedish. The …