An extended overview of the CLEF 2020 ChEMU lab: information extraction of chemical reactions from patents

J He, DQ Nguyen, SA Akhondi… - Proceedings of the …, 2020 - arodes.hes-so.ch
Résumé The discovery of new chemical compounds is perceived as a key driver of the
chemistry industry and many other economic sectors. The information about the new …

Chemu 2020: Natural language processing methods are effective for information extraction from chemical patents

J He, DQ Nguyen, SA Akhondi… - Frontiers in Research …, 2021 - frontiersin.org
Chemical patents represent a valuable source of information about new chemical
compounds, which is critical to the drug discovery process. Automated information extraction …

Towards artificial intelligence at scale in the chemical industry

LH Chiang, B Braun, Z Wang, I Castillo - AIChE Journal, 2022 - Wiley Online Library
Abstract In the Industry 4.0 era, the chemical industry is embracing broad adoption of
artificial intelligence (AI) and machine learning (ML) methods. This article provides a holistic …

Overview of ChEMU 2020: named entity recognition and event extraction of chemical reactions from patents

J He, DQ Nguyen, SA Akhondi, C Druckenbrodt… - Experimental IR Meets …, 2020 - Springer
In this paper, we provide an overview of the Cheminformatics Elsevier Melbourne University
(ChEMU) evaluation lab 2020, part of the Conference and Labs of the Evaluation Forum …

ChEMU-Ref: a corpus for modeling anaphora resolution in the chemical domain

B Fang, C Druckenbrodt, SA Akhondi… - Proceedings of the …, 2021 - aclanthology.org
Chemical patents contain rich coreference and bridging links, which are the target of this
research. Specially, we introduce a novel annotation scheme, based on which we create the …

Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction

Q Zhang, VSJ Huang, B Wang, J Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org
Document parsing is essential for converting unstructured and semi-structured documents-
such as contracts, academic papers, and invoices-into structured, machine-readable data …

Ensemble of deep masked language models for effective named entity recognition in health and life science corpora

N Naderi, J Knafou, J Copara, P Ruch… - Frontiers in research …, 2021 - frontiersin.org
The health and life science domains are well known for their wealth of named entities found
in large free text corpora, such as scientific literature and electronic health records. To …

Focused Contrastive Loss for Classification With Pre-Trained Language Models

J He, Y Li, Z Zhai, B Fang, C Thorne… - … on Knowledge and …, 2023 - ieeexplore.ieee.org
Contrastive learning, which learns data representations by contrasting similar and dissimilar
instances, has achieved great success in various domains including natural language …

Overview of ChEMU 2022 Evaluation Campaign: Information Extraction in Chemical Patents

Y Li, B Fang, J He, H Yoshikawa, SA Akhondi… - … Conference of the Cross …, 2022 - Springer
In this paper, we provide an overview of the Cheminformatics Elsevier Melbourne University
(ChEMU) evaluation lab 2022, part of the Conference and Labs of the Evaluation Forum …