Learning attention for historical text normalization by learning to pronounce

S Ruder - 2019 - aran.library.nuigalway.ie

Language is often regarded as the hallmark of human intelligence. Developing systems that
can understand human language is thus one of the main obstacles on the quest towards …

被引用次数：409 相关文章所有 2 个版本

Bridging the gaps: Multi task learning for domain transfer of hate speech detection

Z Waseem, J Thorne, J Bingel - Online harassment, 2018 - Springer

Accurately detecting hate speech using supervised classification is dependent on data that
is annotated by humans. Attaining high agreement amongst annotators though is difficult …

被引用次数：112 相关文章所有 3 个版本

[PDF] arxiv.org

A large-scale comparison of historical text normalization systems

M Bollmann - arXiv preprint arXiv:1904.02036, 2019 - arxiv.org

There is no consensus on the state-of-the-art approach to historical text normalization. Many
techniques have been proposed, including rule-based methods, distance metrics, character …

被引用次数：94 相关文章所有 7 个版本

[PDF] arxiv.org

Multi-task learning of pairwise sequence classification tasks over disparate label spaces

I Augenstein, S Ruder, A Søgaard - arXiv preprint arXiv:1802.09913, 2018 - arxiv.org

We combine multi-task learning and semi-supervised learning by inducing a joint
embedding space between disparate label spaces and learning transfer functions between …

被引用次数：86 相关文章所有 4 个版本

[PDF] arxiv.org

Neural networks for text correction and completion in keyboard decoding

S Ghosh, PO Kristensson - arXiv preprint arXiv:1709.06429, 2017 - arxiv.org

Despite the ubiquity of mobile and wearable text messaging applications, the problem of
keyboard text decoding is not tackled sufficiently in the light of the enormous success of the …

被引用次数：85 相关文章所有 2 个版本

[PDF] aclanthology.org

[PDF][PDF] Event extraction from historical texts: A new dataset for black rebellions

VD Lai, M Van Nguyen, H Kaufman… - Findings of the …, 2021 - aclanthology.org

Understanding historical events is necessary for the study of contemporary society, culture,
and politics. In this work, we focus on the event extraction task (EE) to detect event trigger …

被引用次数：29 相关文章所有 4 个版本

[PDF] arxiv.org

Towards realistic practices in low-resource natural language processing: The development set

K Kann, K Cho, SR Bowman - arXiv preprint arXiv:1909.01522, 2019 - arxiv.org

Development sets are impractical to obtain for real low-resource languages, since using all
available data for training is often more effective. However, development sets are widely …

被引用次数：53 相关文章所有 4 个版本

E-commerce review sentiment score prediction considering misspelled words: A deep learning approach

S Jain, PK Roy - Electronic Commerce Research, 2024 - Springer

Acquiring a single sentiment score dependent on all the reviews will benefit the buyers and
sellers in making the decision more accurately. The raw format of user-generated content …

被引用次数：16 相关文章

[PDF] ed.ac.uk

Training data augmentation for low-resource morphological inflection

T Bergmanis, K Kann, H Schütze… - … 2017 Shared Task …, 2017 - research.ed.ac.uk

This work describes the UoE-LMU submission for the CoNLL-SIGMORPHON 2017 Shared
Task on Universal Morphological Reinflection, Subtask 1: given a lemma and target …

被引用次数：63 相关文章所有 5 个版本

[PDF] arxiv.org

An evaluation of neural machine translation models on historical spelling normalization

G Tang, F Cap, E Pettersson, J Nivre - arXiv preprint arXiv:1806.05210, 2018 - arxiv.org

In this paper, we apply different NMT models to the problem of historical spelling
normalization for five languages: English, German, Hungarian, Icelandic, and Swedish. The …

被引用次数：53 相关文章所有 6 个版本