Data mining for hypertext: A tutorial survey

S Chakrabarti - ACM SIGKDD explorations newsletter, 2000 - dl.acm.org
With over 800 million pages covering most areas of human endeavor, the World-wide Web
is a fertile ground for data mining research to make a difference to the effectiveness of …

[图书][B] Mining the Web: Discovering knowledge from hypertext data

S Chakrabarti - 2002 - books.google.com
Mining the Web: Discovering Knowledge from Hypertext Data is the first book devoted
entirely to techniques for producing knowledge from the vast body of unstructured Web data …

Ontologies improve text document clustering

A Hotho, S Staab, G Stumme - Third IEEE international …, 2003 - ieeexplore.ieee.org
Text document clustering plays an important role in providing intuitive navigation and
browsing mechanisms by organizing large sets of documents into a small number of …

[PDF][PDF] Using WordNet for Text Categorization.

Z Elberrichi, A Rahmoun, MA Bentaalah - International Arab Journal of …, 2008 - iajit.org
This paper explores a method that use WordNet concept to categorize text documents. The
bag of words representation used for text representation is unsatisfactory as it ignores …

Exploiting noun phrases and semantic relationships for text document clustering

HT Zheng, BY Kang, HG Kim - Information Sciences, 2009 - Elsevier
Text document clustering plays an important role in providing better document retrieval,
document browsing, and text mining. Traditionally, clustering techniques do not consider the …

A new unsupervised method for document clustering by using WordNet lexical and conceptual relations

D Reforgiato Recupero - Information Retrieval, 2007 - Springer
Text document clustering provides an effective and intuitive navigation mechanism to
organize a large amount of retrieval results by grouping documents in a small number of …

[PDF][PDF] Using cohesion and coherence models for text summarization

I Mani, E Bloedorn, B Gates - Intelligent text summarization symposium, 1998 - cdn.aaai.org
In this paper we investigate two classes of techniques to determine what is salient in a text,
as a means of deciding whether that information should be included in a summary. We …

Generating links to background knowledge: a case study using narrative radiology reports

J He, M de Rijke, M Sevenster… - Proceedings of the 20th …, 2011 - dl.acm.org
Automatically annotating texts with background information has recently received much
attention. We conduct a case study in automatically generating links from narrative radiology …

Context as a spurious concept

G Hirst - arXiv preprint cmp-lg/9712003, 1997 - arxiv.org
I take issue with AI formalizations of context, primarily the formalization by McCarthy and
Buvac, that regard context as an undefined primitive whose formalization can be the same in …

[PDF][PDF] Self organizing map-based document clustering using WordNet ontologies

TF Gharib, MM Fouad, A Mashat… - International Journal of …, 2012 - researchgate.net
With the rapid development of web content, retrieving relevant information is difficult task.
The efficient clustering algorithms are needed to improve the results of the retrieval …