[PDF][PDF] Crawling and mining the dark web: A survey on existing and new approaches

MK Alshammery, AF Aljuboori - Iraqi Journal of Science, 2022 - iasj.net
The last two decades have seen a marked increase in the illegal activities on the Dark Web.
Prompt evolvement and use of sophisticated protocols make it difficult for security agencies …

Efficient sentiment-aware web crawling methods for constructing sentiment dictionary

BW On, JY Jo, H Shin, J Gim, GS Choi, SM Jung - IEEE Access, 2021 - ieeexplore.ieee.org
In traditional web crawling, all web pages crawled are first stored to databases. As a result,
this approach can store unnecessary web pages and requires additional running time for the …

[PDF][PDF] A new approach to web crawling—dhekts crawler in comparison with various crawlers

K Thirugnanasambanthan - Indian Journal …, 2021 - sciresol.s3.us-east-2.amazonaws …
Objectives: To propose a crawler to visit websites for collecting information and create a
search engine index for reference; To compare various crawler License, language used for …

[HTML][HTML] Proyecto de web semántica de autoridades en PARES: extracción y análisis inicial

M Blázquez-Ochando… - Revista panamericana de …, 2024 - scielo.org.mx
La investigación se centra en describir los tipos de autoridades del Portal de Archivos
Españoles, aportando su cuantificación, y ratio relacional, con el fin de delinear el grafo …

Web Crawler based Caching Technique for efficient Downloading

N Bhatnagar, S Johari - 2021 Sixth International Conference on …, 2021 - ieeexplore.ieee.org
With internet availability in almost all types of devices, downloading has become a
widespread activity. Data is consumed when any content is downloaded from the web …

Web Crawler for Ranking of Websites Based on Web Traffic and Page Views

N Agrawal, S Pant - Proceedings of International Conference on Machine …, 2021 - Springer
Algorithms for crawling of the web, indexing and the order of search results are determined
by the ranking of pages. In this paper, we have designed one web crawler that is based on …