作者
Anália G Lourenço, Orlando O Belo
发表日期
2006/7/11
图书
Proceedings of the 6th international Conference on Web Engineering
页码范围
265-272
简介
This paper recommends a new approach to the detection and containment of Web crawler traverses based on clickstream data mining. Timely detection prevents crawler abusive consumption of Web server resources and eventual site contents privacy or copyrights violation. Clickstream data differentiation ensures focused usage analysis, valuable both for regular users and crawler profiling. Our platform, named ClickTips, sustains a site-specific, updatable detection model that tags Web crawler traverses based on incremental Web session inspection and a decision model that assesses eventual containment. The goal is to deliver a model flexible enough to keep up with crawling continuous evolving and that is capable of detecting crawler presence as soon as possible. We use a real-world Web site case study as a support for process description, as well as, to evaluate the accuracy of the obtained classification …
引用总数
20082009201020112012201320142015201620172018201920202021202220232024314475334312322
学术搜索中的文章
AG Lourenço, OO Belo - Proceedings of the 6th international Conference on …, 2006