ALCIDE: Extracting and visualising content from large document collections to support humanities studies

G Moretti, R Sprugnoli, S Menini, S Tonelli - Knowledge-Based Systems, 2016 - Elsevier
The application of research practices and methodologies from the Information and
Communication Technologies to Humanities studies is having a great impact on the way …

Design of negative sampling strategies for distantly supervised skill extraction

JJ Decorte, J Van Hautte, J Deleu, C Develder… - arXiv preprint arXiv …, 2022 - arxiv.org
Skills play a central role in the job market and many human resources (HR) processes. In
the wake of other digital experiences, today's online job market has candidates expecting to …

[HTML][HTML] Clinical information extraction for preterm birth risk prediction

L Sterckx, G Vandewiele, I Dehaene… - Journal of Biomedical …, 2020 - Elsevier
This paper contributes to the pursuit of leveraging unstructured medical notes to structured
clinical decision making. In particular, we present a pipeline for clinical information …

Unsupervised feature selection based on the Morisita estimator of intrinsic dimension

J Golay, M Kanevski - Knowledge-Based Systems, 2017 - Elsevier
This paper deals with a new filter algorithm for selecting the smallest subset of features
carrying all the information content of a dataset (ie for removing redundant features). It is an …

Time expression as update operations: Normalizing time expressions via a distantly supervised neural semantic parser

W Ding, J Chen, E Longfei, J Li, Y Qu - Knowledge-Based Systems, 2023 - Elsevier
Existing approaches to normalize time expressions highly rely on expert-engineered rules or
grammars, which are labor-intensive and difficult to scale to different scenarios. In this …

Crowdsourcing semantic label propagation in relation classification

A Dumitrache, L Aroyo, C Welty - arXiv preprint arXiv:1809.00537, 2018 - arxiv.org
Distant supervision is a popular method for performing relation extraction from text that is
known to produce noisy labels. Most progress in relation extraction and classification has …

[PDF][PDF] Truth in disagreement: Crowdsourcing labeled data for natural language processing

A Dumitrache - 2019 - research.vu.nl
In this chapter, we present the motivation of this thesis, as well as related work on
crowdsourcing ground truth for natural language processing. We introduce the CrowdTruth …

Integrative KnowGen: integrative knowledge base generation for criminology as a domain of choice

GS Chhatwal, G Deepak - International Conference on Digital …, 2022 - Springer
Abstract Knowledge bases are essential in developing expert intelligent systems to solve
domain specific problems. The low availability of detailed knowledge bases is a byproduct of …

A comparative study of the effectiveness of sentiment tools and human coding in sarcasm detection

PL Teh, PB Ooi, NN Chan, YK Chuah - Journal of Systems and …, 2018 - emerald.com
Purpose Sarcasm is often used in everyday speech and writing and is prevalent in online
contexts. The purpose of this paper is to investigate the analogy between sarcasm …

Adversarial Learning for Supervised and Semi-supervised Relation Extraction in Biomedical Literature

P Su, K Vijay-Shanker - arXiv preprint arXiv:2005.04277, 2020 - arxiv.org
Adversarial training is a technique of improving model performance by involving adversarial
examples in the training process. In this paper, we investigate adversarial training with …