Pre-trained language models in biomedical domain: A systematic survey

B Wang, Q Xie, J Pei, Z Chen, P Tiwari, Z Li… - ACM Computing …, 2023 - dl.acm.org
Pre-trained language models (PLMs) have been the de facto paradigm for most natural
language processing tasks. This also benefits the biomedical domain: researchers from …

[HTML][HTML] Few-shot learning for medical text: A review of advances, trends, and opportunities

Y Ge, Y Guo, S Das, MA Al-Garadi, A Sarker - Journal of Biomedical …, 2023 - Elsevier
Background: Few-shot learning (FSL) is a class of machine learning methods that require
small numbers of labeled instances for training. With many medical topics having limited …

Does synthetic data generation of llms help clinical text mining?

R Tang, X Han, X Jiang, X Hu - arXiv preprint arXiv:2303.04360, 2023 - arxiv.org
Recent advancements in large language models (LLMs) have led to the development of
highly potent models like OpenAI's ChatGPT. These models have exhibited exceptional …

A survey of event extraction from text

W Xiang, B Wang - IEEE Access, 2019 - ieeexplore.ieee.org
Numerous important events happen everyday and everywhere but are reported in different
media sources with different narrative styles. How to detect whether real-world events have …

COVID-19 literature knowledge graph construction and drug repurposing report generation

Q Wang, M Li, X Wang, N Parulian, G Han, J Ma… - arXiv preprint arXiv …, 2020 - arxiv.org
To combat COVID-19, both clinicians and scientists need to digest vast amounts of relevant
biomedical knowledge in scientific literature to understand the disease mechanism and …

Crossner: Evaluating cross-domain named entity recognition

Z Liu, Y Xu, T Yu, W Dai, Z Ji, S Cahyawijaya… - Proceedings of the …, 2021 - ojs.aaai.org
Cross-domain named entity recognition (NER) models are able to cope with the scarcity
issue of NER samples in target domains. However, most of the existing NER benchmarks …

MAVEN: A massive general domain event detection dataset

X Wang, Z Wang, X Han, W Jiang, R Han, Z Liu… - arXiv preprint arXiv …, 2020 - arxiv.org
Event detection (ED), which means identifying event trigger words and classifying event
types, is the first and most fundamental step for extracting event knowledge from plain text …

Distant supervision for relation extraction beyond the sentence boundary

C Quirk, H Poon - arXiv preprint arXiv:1609.04873, 2016 - arxiv.org
The growing demand for structured knowledge has led to great interest in relation extraction,
especially in cases with limited supervision. However, existing distance supervision …

[HTML][HTML] Automated systems for the de-identification of longitudinal clinical narratives: Overview of 2014 i2b2/UTHealth shared task Track 1

A Stubbs, C Kotfila, Ö Uzuner - Journal of biomedical informatics, 2015 - Elsevier
Abstract The 2014 i2b2/UTHealth Natural Language Processing (NLP) shared task featured
four tracks. The first of these was the de-identification track focused on identifying protected …

Community challenges in biomedical text mining over 10 years: success, failure and the future

CC Huang, Z Lu - Briefings in bioinformatics, 2016 - academic.oup.com
One effective way to improve the state of the art is through competitions. Following the
success of the Critical Assessment of protein Structure Prediction (CASP) in bioinformatics …