[PDF][PDF] An Efficient Mechanism for Product Data Extraction from E-Commerce Websites.
MJ Akhtar, Z Ahmad, R Amin, SH Almotiri… - … , Materials & Continua, 2020 - academia.edu
A large amount of data is present on the web which can be used for useful purposes like a
product recommendation, price comparison and demand forecasting for a particular product …
product recommendation, price comparison and demand forecasting for a particular product …
Automatic web page segmentation and noise removal for structured extraction using tag path sequences
RP Velloso, CF Dorneles - Journal of Information and Data …, 2013 - journals-sol.sbc.org.br
Web page segmentation and data cleaning are essential steps in structured web data
extraction. Identifying a web page main content region, removing what is not important …
extraction. Identifying a web page main content region, removing what is not important …
From web scraping to web crawling
H Nigam, P Biswas - Applications of Artificial Intelligence and Machine …, 2021 - Springer
Abstract The World Wide Web is the largest database comprising information in various
forms from text to audio/video and in many other designs. However, most of the data …
forms from text to audio/video and in many other designs. However, most of the data …
A novel alignment algorithm for effective web data extraction from singleton-item pages
OY Yuliana, CH Chang - Applied Intelligence, 2018 - Springer
Automatic data extraction from template pages is an essential task for data integration and
data analysis. Most researches focus on data extraction from list pages. The problem of data …
data analysis. Most researches focus on data extraction from list pages. The problem of data …
Information extraction for deep web using repetitive subject pattern
W Thamviset, S Wongthanavasu - World Wide Web, 2014 - Springer
In this paper, we propose an information extraction (IE) system for extracting data records
from semi-structured documents on the Deep Web using a promising proposed technique …
from semi-structured documents on the Deep Web using a promising proposed technique …
Predicate enrichment of aligned XPaths for wrapper induction
J Nielandt, A Bronselaer, G De Tré - Expert Systems with Applications, 2016 - Elsevier
Extracting data from various semi-structured sources is a topic that has received a lot of
attention. Wrapper induction specifically has been studied extensively, where users …
attention. Wrapper induction specifically has been studied extensively, where users …
Information extraction from the web by matching visual presentation patterns
R Burget - Knowledge Graphs and Language Technology: ISWC …, 2017 - Springer
The documents available in the World Wide Web contain large amounts of information
presented in tables, lists or other visually regular structures. The published information is …
presented in tables, lists or other visually regular structures. The published information is …
Data acquisition and information extraction for scientific knowledge base building
P Andruszkiewicz, H Rybinski - 2018 IEEE 12th International …, 2018 - ieeexplore.ieee.org
Here we present the process of data acquisition and information extraction for building a
comprehensive and accurate scientific knowledge base including conferences, publications …
comprehensive and accurate scientific knowledge base including conferences, publications …
A model for content enrichment of institutional repositories using Linked Data
V Kumar - Journal of Web Librarianship, 2018 - Taylor & Francis
Institutional repositories have positioned themselves as an essential service for many
libraries. Content-enriched metadata in library records is reported as being helpful to library …
libraries. Content-enriched metadata in library records is reported as being helpful to library …
[PDF][PDF] A hybrid approach for extracting web information
R Abarna, S Pradeepa - Indian Journal …, 2015 - sciresol.s3.us-east-2.amazonaws …
Mining the webpage is the predominant technique to grab the data from the internet. It is the
extracting job from the web pages in either supervised or unsupervised. Unsupervised …
extracting job from the web pages in either supervised or unsupervised. Unsupervised …