Symbolic verification of web crawler functionality and its properties
KS Shetty, S Bhat, S Singh - 2012 International Conference on …, 2012 - ieeexplore.ieee.org
Now a days people use search engines every now and then to retrieve documents from the
Web. Web crawling is the process by which a search engine gather pages from the Web to …
Web. Web crawling is the process by which a search engine gather pages from the Web to …
Removing DUST using multiple alignment of sequences
K Rodrigues, M Cristo, ES de Moura… - IEEE Transactions on …, 2015 - ieeexplore.ieee.org
A large number of URLs collected by web crawlers correspond to pages with duplicate or
near-duplicate contents. To crawl, store, and use such duplicated data implies a waste of …
near-duplicate contents. To crawl, store, and use such duplicated data implies a waste of …
Intelligent Event Finder and Management System
P Afsar, M Faizudheen, MJ Anikkadan… - … Conference on I …, 2021 - ieeexplore.ieee.org
An event can be a social or life cycle event which includes, engagement, wedding or an
education or career based events like workshops, seminars and so on. Events are …
education or career based events like workshops, seminars and so on. Events are …
Identifying equivalent links on a page
KA Ayoub, P Ionescu, IV Onut, WD Smith - US Patent 9,792,370, 2017 - Google Patents
A computer-implemented process for identifying equivalent links on a page responsive to a
determination that the crawler has not visited all required universal resource locators …
determination that the crawler has not visited all required universal resource locators …
Web Crawler with URL signature—a performance study
LK Soon, YE Ku, SH Lee - … 4th Conference on Data Mining and …, 2012 - ieeexplore.ieee.org
URL signature was proposed to be implemented in web crawling, aiming to avoid
processing duplicated web pages for further web crawling. In this paper, we present our …
processing duplicated web pages for further web crawling. In this paper, we present our …
Reducing redundantweb crawling using url signatures
The existing architecture of WWW uses URL to identify web pages. Web crawlers rely on
URL normalization in order to identify equivalent URLs, which link to the same web pages …
URL normalization in order to identify equivalent URLs, which link to the same web pages …
Clustering based URL normalization technique for Web mining
NK Nagwani - 2010 International Conference on Advances in …, 2010 - ieeexplore.ieee.org
URL (Uniform Resource Locator) normalization is an important activity in web mining. Web
data can be retrieved in smoother way using effective URL normalization technique. URL …
data can be retrieved in smoother way using effective URL normalization technique. URL …
Comparative analysis of URL shortening algorithms: Рerformance considerations in the application domain оf web search and information retrieval technology
ІВ Кириченко, ЄО Шамрай - Біоніка інтелекту, 2022 - bionics.nure.ua
URL shortening is a process of converting a long URL to a shorter one, which redirects to
the original URL. The shortened URL is used to share links, save characters in social media …
the original URL. The shortened URL is used to share links, save characters in social media …
Algorithm for detecting dynamic webpage and its importance
AK Sultania - 2012 International Conference on Radar …, 2012 - ieeexplore.ieee.org
During web search using crawling, indexing, relevance it is found that there exist many
duplicate web-pages with different URLs, these URLs are normalized when used by crawler …
duplicate web-pages with different URLs, these URLs are normalized when used by crawler …
Identifying equivalent links on a page
KA Ayoub, P Ionescu, IV Onut, WD Smith - US Patent 10,621,255, 2020 - Google Patents
(57) ABSTRACT A computer-implemented process for identifying equivalent links on a page
responsive to a determination that the crawler has not visited all required universal resource …
responsive to a determination that the crawler has not visited all required universal resource …