A database of battery materials auto-generated using ChemDataExtractor

S Huang, JM Cole - Scientific Data, 2020 - nature.com
A database of battery materials is presented which comprises a total of 292,313 data
records, with 214,617 unique chemical-property data relations between 17,354 unique …

Natural language processing techniques for advancing materials discovery: a short review

JH Lee, M Lee, K Min - International Journal of Precision Engineering and …, 2023 - Springer
In the development of new industries, there is a growing demand for innovative materials.
However, locating such materials is a laborious and time-consuming endeavor. In response …

Antennas and Propagation Research From Large-Scale Unstructured Data With Machine Learning: A review and predictions.

YO Cha, AA Ihalage, Y Hao - IEEE Antennas and Propagation …, 2023 - ieeexplore.ieee.org
The past century has witnessed remarkable progress in antennas and propagation (A&P)
research, which has made dramatic changes to our society and life and has led to paradigm …

High-throughput calculation and machine learning of two-dimensional halide perovskite materials: Formation energy and band gap

W Hu, L Zhang - Materials Today Communications, 2023 - Elsevier
Both band gap and stability of halide perovskites are prerequisites for deployable
photovoltaic devices; however, many machine learning researches focus on one target …

Summary generation using natural language processing techniques and cosine similarity

S Pal, M Chang, MF Iriarte - International Conference on Intelligent …, 2021 - Springer
The COVID-19 pandemic has led to an unprecedented challenge to public health. It resulted
in global efforts to understand, record, and alleviate the disease. This research serves the …

Representing multiword chemical terms through phrase-level preprocessing and word embedding

L Huang, C Ling - ACS omega, 2019 - ACS Publications
In recent years, data-driven methods and artificial intelligence have been widely used in
chemoinformatic and material informatics domains, for which the success is critically …

ChemTEB: Chemical Text Embedding Benchmark, an Overview of Embedding Models Performance & Efficiency on a Specific Domain

AS Kasmaee, M Khodadad, MA Saloot… - arXiv preprint arXiv …, 2024 - arxiv.org
Recent advancements in language models have started a new era of superior information
retrieval and content generation, with embedding models playing an important role in …

[PDF][PDF] Lexical and semantic analysis of sacred texts using machine learning and natural language processing

N Varghese, M Punithavalli - International Journal of Scientific & …, 2019 - researchgate.net
Text mining is the process of exploring and analyzing large amounts of text data and
extracting high-quality information based on patterns and trends in data. The patterns and …

Building Materials Classification Model Based on Text Data Enhancement and Semantic Feature Extraction

Q Yan, F Jiao, W Peng - Buildings, 2024 - mdpi.com
In order to accurately extract and match carbon emission factors from the Chinese textual
building materials list and construct a precise carbon emission factor database, it is crucial to …

[PDF][PDF] Text Mining for Energy Materials

L Zhang, M He - J. Res. Sci. Eng., 2022 - scholar.archive.org
The scientific and technological progresses of the chemical and materials science
disciplines lead to a significant amount of numerical and textual data stored in the published …