Data-driven materials research enabled by natural language processing and information extraction

EA Olivetti, JM Cole, E Kim, O Kononova… - Applied Physics …, 2020 - pubs.aip.org
Given the emergence of data science and machine learning throughout all aspects of
society, but particularly in the scientific domain, there is increased importance placed on …

Unleashing the power of knowledge extraction from scientific literature in catalysis

Y Zhang, C Wang, M Soukaseum… - Journal of Chemical …, 2022 - ACS Publications
Valuable knowledge of catalysis is often hidden in a large amount of scientific literature.
There is an urgent need to extract useful knowledge to facilitate scientific discovery. This …

基于深度学习的命名实体识别研究综述.

何玉洁, 杜方, 史英杰, 宋丽娟 - Journal of Computer …, 2021 - search.ebscohost.com
命名实体识别技术是信息抽取, 机器翻译, 问答系统等多种自然语言处理技术中一项重要的基本
任务. 近年来, 基于深度学习的命名实体识别技术成为一大研究热点. 为了方便研究者们了解基于 …

A deep active learning-based and crowdsourcing-assisted solution for named entity recognition in Chinese historical corpora

C Yan, X Tang, H Yang, J Wang - Aslib Journal of Information …, 2023 - emerald.com
Purpose The majority of existing studies about named entity recognition (NER) concentrate
on the prediction enhancement of deep neural network (DNN)-based models themselves …

[HTML][HTML] Applying Human-in-the-Loop to construct a dataset for determining content reliability to combat fake news

A Bonet-Jover, R Sepúlveda-Torres, E Saquete… - … Applications of Artificial …, 2023 - Elsevier
Annotated corpora are indispensable tools to train computational models in Natural
Language Processing. However, in the case of more complex semantic annotation …

Leveraging theory for enhanced machine learning

DJ Audus, A McDannald, B DeCost - ACS macro letters, 2022 - ACS Publications
The application of machine learning to the materials domain has traditionally struggled with
two major challenges: a lack of large, curated data sets and the need to understand the …

ChemProps: A RESTful API enabled database for composite polymer name standardization

B Hu, A Lin, LC Brinson - Journal of Cheminformatics, 2021 - Springer
The inconsistency of polymer indexing caused by the lack of uniformity in expression of
polymer names is a major challenge for widespread use of polymer related data resources …

Active learning for assisted corpus construction: A case study in knowledge discovery from biomedical text

H Cañizares-Díaz, A Piad-Morffis… - Proceedings of the …, 2021 - aclanthology.org
This paper presents an active learning approach that aims to reduce the human effort
required during the annotation of natural language corpora composed of entities and …

Enhancing IoT Security: A Deep Learning and Active Learning Approach to Intrusion Detection

HF Mahdi, BJ Khadhim - Journal of Robotics and Control (JRC), 2024 - journal.umy.ac.id
In response to the escalating demand for robust security solutions in increasingly complex
Internet of Things (IoT) networks, this study introduces an advanced Intrusion Detection …

Models and processes to extract drug-like molecules from natural language text

Z Hong, JG Pauloski, L Ward, K Chard… - Frontiers in Molecular …, 2021 - frontiersin.org
Researchers worldwide are seeking to repurpose existing drugs or discover new drugs to
counter the disease caused by severe acute respiratory syndrome coronavirus 2 (SARS …