Text mining approaches for dealing with the rapidly expanding literature on COVID-19

LL Wang, K Lo - Briefings in Bioinformatics, 2021 - academic.oup.com
More than 50 000 papers have been published about COVID-19 since the beginning of
2020 and several hundred new papers continue to be published every day. This incredible …

Unsupervised and self-supervised deep learning approaches for biomedical text mining

M Nadif, F Role - Briefings in Bioinformatics, 2021 - academic.oup.com
Biomedical scientific literature is growing at a very rapid pace, which makes increasingly
difficult for human experts to spot the most relevant results hidden in the papers …

Self-alignment pretraining for biomedical entity representations

F Liu, E Shareghi, Z Meng, M Basaldella… - arXiv preprint arXiv …, 2020 - arxiv.org
Despite the widespread success of self-supervised learning via masked language models
(MLM), accurately capturing fine-grained semantic relationships in the biomedical domain …

BERN2: an advanced neural biomedical named entity recognition and normalization tool

M Sung, M Jeong, Y Choi, D Kim, J Lee, J Kang - Bioinformatics, 2022 - academic.oup.com
In biomedical natural language processing, named entity recognition (NER) and named
entity normalization (NEN) are key tasks that enable the automatic extraction of biomedical …

[HTML][HTML] Building a PubMed knowledge graph

J Xu, S Kim, M Song, M Jeong, D Kim, J Kang… - Scientific data, 2020 - nature.com
PubMed® is an essential resource for the medical domain, but useful concepts are either
difficult to extract or are ambiguous, which has significantly hindered knowledge discovery …

[HTML][HTML] Enhancing phenotype recognition in clinical notes using large language models: PhenoBCBERT and PhenoGPT

J Yang, C Liu, W Deng, D Wu, C Weng, Y Zhou… - Patterns, 2024 - cell.com
To enhance phenotype recognition in clinical notes of genetic diseases, we developed two
models—PhenoBCBERT and PhenoGPT—for expanding the vocabularies of Human …

Pre-trained language model for biomedical question answering

W Yoon, J Lee, D Kim, M Jeong, J Kang - Joint European conference on …, 2019 - Springer
The recent success of question answering systems is largely attributed to pre-trained
language models. However, as language models are mostly pre-trained on general domain …

ChimerDB 4.0: an updated and expanded database of fusion genes

YE Jang, I Jang, S Kim, S Cho, D Kim… - Nucleic acids …, 2020 - academic.oup.com
Fusion genes represent an important class of biomarkers and therapeutic targets in cancer.
ChimerDB is a comprehensive database of fusion genes encompassing analysis of deep …

[HTML][HTML] A pre-training and self-training approach for biomedical named entity recognition

S Gao, O Kotevska, A Sorokine, JB Christian - PloS one, 2021 - journals.plos.org
Named entity recognition (NER) is a key component of many scientific literature mining
tasks, such as information retrieval, information extraction, and question answering; …

Exploring new digital therapeutics technologies for psychiatric disorders using BERTopic and PatentSBERTa

E Jeon, N Yoon, SY Sohn - Technological Forecasting and Social Change, 2023 - Elsevier
Abstract Coronavirus disease 2019 (COVID-19) has accelerated the growth of the digital
therapeutics (DTx) market; therefore, development strategies for new DTx products are …