A survey of Web crawlers for information retrieval

M Kumar, R Bhatia, D Rattan - Wiley Interdisciplinary Reviews …, 2017 - Wiley Online Library
Performance of any search engine relies heavily on its Web crawler. Web crawlers are the
programs that get webpages from the Web by following hyperlinks. These webpages are …

[PDF][PDF] An automated word embedding with parameter tuned model for web crawling

S Neelakandan, A Arun, RR Bhukya… - … Automation & Soft …, 2022 - pdfs.semanticscholar.org
In recent years, web crawling has gained a significant attention due to the drastic
advancements in the World Wide Web. Web Search Engines have the issue of retrieving …

From web catalogs to Google: a retrospective study of web search engines sustainable development

M Duka, M Sikora, A Strzelecki - Sustainability, 2023 - mdpi.com
This study presents a review of search engines and search engine optimization and shows
how the search engine landscape relates to sustainable development. We have used a …

Augmenting user interaction experience through embedded multimodal media agents in social networks

M Matsiola, C Dimoulas, G Kalliris… - Social media and the …, 2015 - igi-global.com
The current chapter proposes media agent and multi-agent models aiming at improving
mediated communication and information exchange in social networking. Great progress …

A semantic focused web crawler based on a knowledge representation schema

J Hernandez, HM Marin-Castro, M Morales-Sandoval - Applied Sciences, 2020 - mdpi.com
The Web has become the main source of information in the digital world, expanding to
heterogeneous domains and continuously growing. By means of a search engine, users can …

An adaptive focused web crawling algorithm based on learning automata

J Akbari Torkestani - Applied Intelligence, 2012 - Springer
The recent years have witnessed the birth and explosive growth of the Web. The exponential
growth of the Web has made it into a huge source of information wherein finding a document …

SpEnD: Linked data SPARQL endpoints discovery using search engines

S Yumusak, E Dogdu, H Kodaz… - … on Information and …, 2017 - search.ieice.org
Linked data endpoints are online query gateways to semantically annotated linked data
sources. In order to query these data sources, SPARQL query language is used as a …

SOF: a semi‐supervised ontology‐learning‐based focused crawler

H Dong, FK Hussain - Concurrency and Computation: Practice …, 2013 - Wiley Online Library
The rapid increase in the volume of data available on the Internet makes it increasingly
impractical for a crawler to index the whole Web. Instead, many intelligent crawlers, known …

State of the art in semantic focused crawlers

H Dong, FK Hussain, E Chang - … Science and Its Applications–ICCSA 2009 …, 2009 - Springer
Nowadays, the research of focused crawler approaches the field of semantic web, along
with the appearance of increasing semantic web documents and the rapid development of …

[PDF][PDF] A link and content hybrid approach for Arabic web spam detection

HA Wahsheh, MN Al-Kabi, IM Alsmadi - International Journal of …, 2013 - academia.edu
Some Web sites developers act as spammers and try to mislead the search engines by
using illegal Search Engine Optimizations (SEO) tips to increase the rank of their Web …