Information extraction from scientific articles: a survey

Z Nasar, SW Jaffry, MK Malik - Scientometrics, 2018 - Springer
In last few decades, with the advent of World Wide Web (WWW), world is being overloaded
with huge data. This huge data carries potential information that once extracted, can be used …

An ontological framework for information extraction from diverse scientific sources

G Zaman, H Mahdin, K Hussain, J Abawajy… - IEEE …, 2021 - ieeexplore.ieee.org
Automatic information extraction from online published scientific documents is useful in
various applications such as tagging, web indexing and search engine optimization. As a …

Machine learning vs. rules and out-of-the-box vs. retrained: An evaluation of open-source bibliographic reference and citation parsers

D Tkaczyk, A Collins, P Sheridan, J Beel - … of the 18th ACM/IEEE on joint …, 2018 - dl.acm.org
Bibliographic reference parsing refers to extracting machine-readable metadata, such as the
names of the authors, the title, or journal name, from bibliographic reference strings. Many …

A benchmark of pdf information extraction tools using a multi-task and multi-domain evaluation framework for academic documents

N Meuschke, A Jagdale, T Spinde, J Mitrović… - International Conference …, 2023 - Springer
Extracting information from academic PDF documents is crucial for numerous indexing,
retrieval, and analysis use cases. Choosing the best tool to extract specific content elements …

[PDF][PDF] Assessment of Information Extraction Techniques, Models and Systems.

A Rahman, D Musleh, M Nabil, H Alubaidan… - Mathematical …, 2022 - academia.edu
The present article aims to review and evaluate the practiced and classical techniques,
tools, models, and systems concerning automatic information extraction (IE) from published …

Weighted high-order hidden Markov models for compound emotions recognition in text

C Quan, F Ren - Information Sciences, 2016 - Elsevier
Emotion recognition in text has attracted a great deal of attention recently due to many
practical applications and challenging research problems. In this paper, we explore an …

A hidden Markov model with dependence jumps for predictive modeling of multidimensional time-series

A Petropoulos, SP Chatzis, S Xanthopoulos - Information Sciences, 2017 - Elsevier
Abstract Hidden Markov models (HMMs) are a popular approach for modeling sequential
data, typically based on the assumption of a first-or moderate-order Markov chain. However …

Building an annotated corpus for automatic metadata extraction from multilingual journal article references

W Choi, HM Yoon, MH Hyun, HJ Lee, JW Seol, KD Lee… - PloS one, 2023 - journals.plos.org
Bibliographic references containing citation information of academic literature play an
important role as a medium connecting earlier and recent studies. As references contain …

New methods for metadata extraction from scientific literature

D Tkaczyk - arXiv preprint arXiv:1710.10201, 2017 - arxiv.org
Within the past few decades we have witnessed digital revolution, which moved scholarly
communication to electronic media and also resulted in a substantial increase in its volume …

A modular metadata extraction system for born-digital articles

D Tkaczyk, L Bolikowski, A Czeczko… - 2012 10th IAPR …, 2012 - ieeexplore.ieee.org
We present a comprehensive system for extracting metadata from scholarly articles. In our
approach the entire document is inspected, including headers and footers of all the pages …