A survey of web clustering engines

C Carpineto, S Osiński, G Romano… - ACM Computing Surveys …, 2009 - dl.acm.org
Web clustering engines organize search results by topic, thus offering a complementary
view to the flat-ranked list returned by conventional search engines. In this survey, we …

XML data clustering: An overview

A Algergawy, M Mesiti, R Nayak, G Saake - ACM Computing Surveys …, 2011 - dl.acm.org
In the last few years we have observed a proliferation of approaches for clustering XML
documents and schemas based on their structure and content. The presence of such a huge …

Clustering XML documents by patterns

M Piernik, D Brzezinski, T Morzy - Knowledge and Information Systems, 2016 - Springer
Now that the use of XML is prevalent, methods for mining semi-structured documents have
become even more important. In particular, one of the areas that could greatly benefit from in …

Return specification inference and result clustering for keyword search on xml

Z Liu, Y Chen - ACM Transactions on Database Systems (TODS), 2010 - dl.acm.org
Keyword search enables Web users to easily access XML data without the need to learn a
structured query language and to study possibly complex data schemas. Existing work has …

XEdge: clustering homogeneous and heterogeneous XML documents using edge summaries

P Antonellis, C Makris, N Tsirakis - … of the 2008 ACM symposium on …, 2008 - dl.acm.org
In this paper we propose a unified clustering algorithm for both homogeneous and
heterogeneous XML documents. Depending on the type of the XML documents, the …

The XTREEM methods for ontology learning from web documents

P Buitelaar, P Cimiano - … : bridging the gap between text and …, 2008 - books.google.com
Ontology Learning is up to now dominated by techniques which use text as input. There are
only few methods which use a different data source. The techniques which use highly …

Exploring dictionary-based semantic relatedness in labeled tree data

A Tagarelli - Information Sciences, 2013 - Elsevier
The increase in the volume and heterogeneity of semistructured data based application
scenarios has demanded for next-generation methods that are able to effectively couple …

Xml Clustering Framework Based on Document Content and Structure in a Heterogeneous Digital Library

N Samadi, SD Ravana - Malaysian Journal of Computer Science, 2023 - jice.um.edu.my
As textually published information is increasing in digital libraries, efficient retrieval methods
are required. Textual documents in a digital library are available in various structures and …

Word sense disambiguation for XML structure feature generation

A Tagarelli, M Longo, S Greco - European Semantic Web Conference, 2009 - Springer
A common limit of most existing methods that manage XML structure information is that they
do not handle the semantic meanings that might be associated to the markup tags. In this …

Enhanced associative classification of XML documents supported by semantic concepts

NT Thasleena, SC Varghese - Procedia Computer Science, 2015 - Elsevier
A novel approach based on supervised classification has been proposed to classify a given
collection of XML documents based on rule based classifier by semantically enriched …