Web page classification: Features and algorithms
X Qi, BD Davison - ACM computing surveys (CSUR), 2009 - dl.acm.org
Classification of Web page content is essential to many tasks in Web information retrieval
such as maintaining Web directories and focused crawling. The uncontrolled nature of Web …
such as maintaining Web directories and focused crawling. The uncontrolled nature of Web …
Text summarisation in progress: a literature review
This paper contains a large literature review in the research field of Text Summarisation (TS)
based on Human Language Technologies (HLT). TS helps users manage the vast amount …
based on Human Language Technologies (HLT). TS helps users manage the vast amount …
[PDF][PDF] Document summarization using conditional random fields.
Many methods, including supervised and unsupervised algorithms, have been developed
for extractive document summarization. Most supervised methods consider the …
for extractive document summarization. Most supervised methods consider the …
[图书][B] An introduction to search engines and web navigation
M Levene - 2011 - books.google.com
This book is a second edition, updated and expanded to explain the technologies that help
us find information on the web. Search engines and web navigation tools have become …
us find information on the web. Search engines and web navigation tools have become …
Fast webpage classification using URL features
We demonstrate the usefulness of the uniform resource locator (URL) alone in performing
web page classification. This approach is faster than typical web page classification, as the …
web page classification. This approach is faster than typical web page classification, as the …
Bridging text visualization and mining: A task-driven survey
Visual text analytics has recently emerged as one of the most prominent topics in both
academic research and the commercial world. To provide an overview of the relevant …
academic research and the commercial world. To provide an overview of the relevant …
Robust IoT time series classification with data compression and deep learning
Abstract Internet of Things (IoT) and wearable systems are very resource limited in terms of
power, memory, bandwidth and processor performance. Sensor time series compression …
power, memory, bandwidth and processor performance. Sensor time series compression …
Interest-based personalized search
Web search engines typically provide search results without considering user interests or
context. We propose a personalized search approach that can easily extend a conventional …
context. We propose a personalized search approach that can easily extend a conventional …
Enhancing diversity, coverage and balance for summarization through structure learning
Document summarization plays an increasingly important role with the exponential growth of
documents on the Web. Many supervised and unsupervised approaches have been …
documents on the Web. Many supervised and unsupervised approaches have been …
Detecting visually similar web pages: Application to phishing detection
We propose a novel approach for detecting visual similarity between two Web pages. The
proposed approach applies Gestalt theory and considers a Web page as a single indivisible …
proposed approach applies Gestalt theory and considers a Web page as a single indivisible …