Data-Centric Systems and Applications

MJ Carey, S Ceri, P Bernstein, U Dayal, C Faloutsos… - Italy: Springer, 2006 - Springer
The rapid growth of the Web in the past two decades has made it the largest publicly
accessible data source in the world. Web mining aims to discover useful information or …

Frameworks for entity matching: A comparison

H Köpcke, E Rahm - Data & Knowledge Engineering, 2010 - Elsevier
Entity matching is a crucial and difficult task for data integration. Entity matching frameworks
provide several methods and their combination to effectively solve different match tasks. In …

Searching and browsing linked data with swse: The semantic web search engine

A Hogan, A Harth, J Umbrich, S Kinsella… - Journal of web …, 2011 - Elsevier
In this paper, we discuss the architecture and implementation of the Semantic Web Search
Engine (SWSE). Following traditional search engine architecture, SWSE consists of …

Domain-independent data cleaning via analysis of entity-relationship graph

DV Kalashnikov, S Mehrotra - ACM Transactions on Database Systems …, 2006 - dl.acm.org
In this article, we address the problem of reference disambiguation. Specifically, we consider
a situation where entities in the database are referred to using descriptions (eg, a set of …

Linking temporal records

P Li, XL Dong, A Maurino, D Srivastava - Proceedings of the VLDB …, 2011 - dl.acm.org
Many data sets contain temporal records over a long period of time; each record is
associated with a time stamp and describes some aspects of a real-world entity at that …

Scalable and distributed methods for entity matching, consolidation and disambiguation over linked data corpora

A Hogan, A Zimmermann, J Umbrich, A Polleres… - Journal of Web …, 2012 - Elsevier
With respect to large-scale, static, Linked Data corpora, in this paper we discuss scalable
and distributed methods for entity consolidation (aka. smushing, entity resolution, object …

Exploiting relationships for domain-independent data cleaning

DV Kalashnikov, S Mehrotra, Z Chen - Proceedings of the 2005 SIAM …, 2005 - SIAM
In this paper we address the problem of reference disambiguation. Specifically, we consider
a situation where entities in the database are referred to using descriptions (eg, a set of …

Exploiting context analysis for combining multiple entity resolution systems

Z Chen, DV Kalashnikov, S Mehrotra - Proceedings of the 2009 ACM …, 2009 - dl.acm.org
Entity Resolution (ER) is an important real world problem that has attracted significant
research interest over the past few years. It deals with determining which object descriptions …

[PDF][PDF] Object-level Vertical Search.

Z Nie, JR Wen, WY Ma - CIDR, 2007 - Citeseer
Current web search engines essentially conduct document-level ranking and retrieval.
However, structured information about realworld objects embedded in static webpages and …

Web people search via connection analysis

DV Kalashnikov, Z Chen, S Mehrotra… - … on Knowledge and …, 2008 - ieeexplore.ieee.org
Nowadays, searches for webpages of a person with a given name constitute a notable
fraction of queries to web search engines. Such a query would normally return webpages …