Domain adaptation: challenges, methods, datasets, and applications

P Singhal, R Walambe, S Ramanna, K Kotecha - IEEE access, 2023 - ieeexplore.ieee.org
Deep Neural Networks (DNNs) trained on one dataset (source domain) do not perform well
on another set of data (target domain), which is different but has similar properties as the …

[HTML][HTML] The history of writing reflects the effects of education on discourse structure: implications for literacy, orality, psychosis and the axial age

S Pinheiro, NB Mota, M Sigman… - Trends in neuroscience …, 2020 - Elsevier
Background Graph analysis detects psychosis and literacy acquisition. Bronze Age literature
has been proposed to contain childish or psychotic features, which would only have matured …

[PDF][PDF] Unsupervised multi-domain adaptation with feature embeddings

Y Yang, J Eisenstein - Proceedings of the 2015 conference of the …, 2015 - aclanthology.org
Abstract Representation learning is the dominant technique for unsupervised domain
adaptation, but existing approaches have two major weaknesses. First, they often require …

Measuring diachronic language distance using perplexity: Application to English, Portuguese, and Spanish

JRP Campos, PG Otero, IA Loinaz - Natural Language Engineering, 2020 - cambridge.org
The objective of this work is to set a corpus-driven methodology to quantify automatically
diachronic language distance between chronological periods of several languages. We …

Part-of-speech tagging for historical English

Y Yang, J Eisenstein - arXiv preprint arXiv:1603.03144, 2016 - arxiv.org
As more historical texts are digitized, there is interest in applying natural language
processing tools to these archives. However, the performance of these tools is often …

[PDF][PDF] Fast easy unsupervised domain adaptation with marginalized structured dropout

Y Yang, J Eisenstein - Proceedings of the 52nd Annual Meeting of …, 2014 - aclanthology.org
Unsupervised domain adaptation often relies on transforming the instance representation.
However, most such approaches are designed for bag-of-words models, and ignore the …

The HeliPaD: a parsed corpus of Old Saxon

G Walkden - International journal of corpus linguistics, 2016 - jbe-platform.com
This short paper introduces the HeliPaD, a new parsed corpus of Old Saxon (Old Low
German). It is annotated according to the standards of the Penn Corpora of Historical …

A Universal Dependencies conversion pipeline for a Penn-format constituency treebank

Þ Arnardóttir, H Hafsteinsson… - Proceedings of the …, 2020 - aclanthology.org
The topic of this paper is a rule-based pipeline for converting constituency treebanks based
on the Penn Treebank format to Universal Dependencies (UD). We describe an Icelandic …

[PDF][PDF] Literature studies in Literateca: between digital humanities and corpus linguistics

D Santos - quot; In Martin Doerr; Øyvind Eide; Oddrun Grønvik; …, 2019 - comum.rcaap.pt
The availability of more and more literary texts in electronic form has opened up the
possibility to do research on large quantities of data using computer-intensive methods …

Extraposition is disappearing

JC Wallenberg - Language, 2016 - muse.jhu.edu
This study describes a change in which relative clause extraposition is in the process of
being lost in English, Icelandic, French, and Portuguese. This current change in progress …