[PDF][PDF] An Efficient Mechanism for Product Data Extraction from E-Commerce Websites.

MJ Akhtar, Z Ahmad, R Amin, SH Almotiri… - … , Materials & Continua, 2020 - academia.edu
A large amount of data is present on the web which can be used for useful purposes like a
product recommendation, price comparison and demand forecasting for a particular product …

Automatic web page segmentation and noise removal for structured extraction using tag path sequences

RP Velloso, CF Dorneles - Journal of Information and Data …, 2013 - journals-sol.sbc.org.br
Web page segmentation and data cleaning are essential steps in structured web data
extraction. Identifying a web page main content region, removing what is not important …

From web scraping to web crawling

H Nigam, P Biswas - Applications of Artificial Intelligence and Machine …, 2021 - Springer
Abstract The World Wide Web is the largest database comprising information in various
forms from text to audio/video and in many other designs. However, most of the data …

A novel alignment algorithm for effective web data extraction from singleton-item pages

OY Yuliana, CH Chang - Applied Intelligence, 2018 - Springer
Automatic data extraction from template pages is an essential task for data integration and
data analysis. Most researches focus on data extraction from list pages. The problem of data …

Information extraction for deep web using repetitive subject pattern

W Thamviset, S Wongthanavasu - World Wide Web, 2014 - Springer
In this paper, we propose an information extraction (IE) system for extracting data records
from semi-structured documents on the Deep Web using a promising proposed technique …

Predicate enrichment of aligned XPaths for wrapper induction

J Nielandt, A Bronselaer, G De Tré - Expert Systems with Applications, 2016 - Elsevier
Extracting data from various semi-structured sources is a topic that has received a lot of
attention. Wrapper induction specifically has been studied extensively, where users …

Information extraction from the web by matching visual presentation patterns

R Burget - Knowledge Graphs and Language Technology: ISWC …, 2017 - Springer
The documents available in the World Wide Web contain large amounts of information
presented in tables, lists or other visually regular structures. The published information is …

Data acquisition and information extraction for scientific knowledge base building

P Andruszkiewicz, H Rybinski - 2018 IEEE 12th International …, 2018 - ieeexplore.ieee.org
Here we present the process of data acquisition and information extraction for building a
comprehensive and accurate scientific knowledge base including conferences, publications …

A model for content enrichment of institutional repositories using Linked Data

V Kumar - Journal of Web Librarianship, 2018 - Taylor & Francis
Institutional repositories have positioned themselves as an essential service for many
libraries. Content-enriched metadata in library records is reported as being helpful to library …

[PDF][PDF] A hybrid approach for extracting web information

R Abarna, S Pradeepa - Indian Journal …, 2015 - sciresol.s3.us-east-2.amazonaws …
Mining the webpage is the predominant technique to grab the data from the internet. It is the
extracting job from the web pages in either supervised or unsupervised. Unsupervised …