Web page classification: Features and algorithms

X Qi, BD Davison - ACM computing surveys (CSUR), 2009 - dl.acm.org
Classification of Web page content is essential to many tasks in Web information retrieval
such as maintaining Web directories and focused crawling. The uncontrolled nature of Web …

Text summarisation in progress: a literature review

E Lloret, M Palomar - Artificial Intelligence Review, 2012 - Springer
This paper contains a large literature review in the research field of Text Summarisation (TS)
based on Human Language Technologies (HLT). TS helps users manage the vast amount …

[PDF][PDF] Document summarization using conditional random fields.

D Shen, JT Sun, H Li, Q Yang, Z Chen - IJCAI, 2007 - academia.edu
Many methods, including supervised and unsupervised algorithms, have been developed
for extractive document summarization. Most supervised methods consider the …

[图书][B] An introduction to search engines and web navigation

M Levene - 2011 - books.google.com
This book is a second edition, updated and expanded to explain the technologies that help
us find information on the web. Search engines and web navigation tools have become …

Fast webpage classification using URL features

MY Kan, HON Thi - Proceedings of the 14th ACM international …, 2005 - dl.acm.org
We demonstrate the usefulness of the uniform resource locator (URL) alone in performing
web page classification. This approach is faster than typical web page classification, as the …

Bridging text visualization and mining: A task-driven survey

S Liu, X Wang, C Collins, W Dou… - IEEE transactions on …, 2018 - ieeexplore.ieee.org
Visual text analytics has recently emerged as one of the most prominent topics in both
academic research and the commercial world. To provide an overview of the relevant …

Robust IoT time series classification with data compression and deep learning

J Azar, A Makhoul, R Couturier, J Demerjian - Neurocomputing, 2020 - Elsevier
Abstract Internet of Things (IoT) and wearable systems are very resource limited in terms of
power, memory, bandwidth and processor performance. Sensor time series compression …

Interest-based personalized search

Z Ma, G Pant, ORL Sheng - ACM Transactions on Information Systems …, 2007 - dl.acm.org
Web search engines typically provide search results without considering user interests or
context. We propose a personalized search approach that can easily extend a conventional …

Enhancing diversity, coverage and balance for summarization through structure learning

L Li, K Zhou, GR Xue, H Zha, Y Yu - Proceedings of the 18th international …, 2009 - dl.acm.org
Document summarization plays an increasingly important role with the exponential growth of
documents on the Web. Many supervised and unsupervised approaches have been …

Detecting visually similar web pages: Application to phishing detection

TC Chen, S Dick, J Miller - ACM Transactions on Internet Technology …, 2010 - dl.acm.org
We propose a novel approach for detecting visual similarity between two Web pages. The
proposed approach applies Gestalt theory and considers a Web page as a single indivisible …