Web data mining research: a survey

B Singh, HK Singh - 2010 IEEE international conference on …, 2010 - ieeexplore.ieee.org
Web Data Mining is an important area of Data Mining which deals with the extraction of
interesting knowledge from the World Wide Web, It can be classified into three different types …

A segment-based approach to clustering multi-topic documents

A Tagarelli, G Karypis - Knowledge and information systems, 2013 - Springer
Document clustering has been recognized as a central problem in text data management.
Such a problem becomes particularly challenging when document contents are …

Explaining a bag of words with hierarchical conceptual labels

H Jiang, Y Xiao, W Wang - World Wide Web, 2020 - Springer
In natural language processing and information retrieval tasks, the bag-of-words model is
widely used to represent the semantics of texts. However, it is difficult for machines to …

[PDF][PDF] Signed approach for mining web content outliers

G Poonkuzhali, K Thiagarajan, K Sarukesi… - International Journal of …, 2009 - Citeseer
The emergence of the Internet has brewed the revolution of information storage and
retrieval. As most of the data in the web is unstructured, and contains a mix of text, video …

Semantic clustering of search engine results

SS Soliman, MF El-Sayed… - The Scientific World …, 2015 - Wiley Online Library
This paper presents a novel approach for search engine results clustering that relies on the
semantics of the retrieved documents rather than the terms in those documents. The …

Web data mining trends and techniques

UM Patil, JB Patil - Proceedings of the International Conference on …, 2012 - dl.acm.org
Web Services and Web-based applications are growing at an exponential rate. This is
generating a huge amount of Web data having its own peculiar characteristics. This in turn …

[PDF][PDF] 搜索引擎中的聚类浏览技术

李红梅, 丁振国, 周水生, 周利华 - 2008 - jcip.cipsc.org.cn
搜索引擎大多以文档列表的形式将搜索结果显示给用户, 随着Web 文档数量的剧增,
使得用户查找相关信息变得越来越困难, 一种解决方法是对搜索结果进行聚类提高其可浏览性 …

[PDF][PDF] Enhancing the search engine results through web content ranking

SS Bama, MSI Ahmed, A Saravanan - International Journal of …, 2015 - researchgate.net
The development of the Internet has made the rebellion in the field of information storage
and retrieval. In such case, search engines play a vital role in retrieving and organizing …

[PDF][PDF] 基于ontology 抽取优化初始选择的检索结果聚类

陈毅恒, 秦兵, 宋凡, 刘挺, 李生 - 电子学报, 2008 - ejournal.org.cn
本文针对互联网的数据量的不断增加, 准确搜索引擎的作用日益困难的问题,
为了提高搜索引擎返回结果结构化聚类的效果, 让信息的定位更迅速, 本文采用基于标签的聚类 …

Improve firefly heuristic optimization scheme for web based information retrieval

S Pandey, S Lahoti - 2023 IEEE International Conference on …, 2023 - ieeexplore.ieee.org
The main goal of web Information Retrieval (IR) is to find highly relevant data for Internet
users. Nowadays, huge billions of websites are populated day by day. The information …