Data-Centric Systems and Applications
The rapid growth of the Web in the past two decades has made it the largest publicly
accessible data source in the world. Web mining aims to discover useful information or …
accessible data source in the world. Web mining aims to discover useful information or …
Frameworks for entity matching: A comparison
Entity matching is a crucial and difficult task for data integration. Entity matching frameworks
provide several methods and their combination to effectively solve different match tasks. In …
provide several methods and their combination to effectively solve different match tasks. In …
Searching and browsing linked data with swse: The semantic web search engine
In this paper, we discuss the architecture and implementation of the Semantic Web Search
Engine (SWSE). Following traditional search engine architecture, SWSE consists of …
Engine (SWSE). Following traditional search engine architecture, SWSE consists of …
Domain-independent data cleaning via analysis of entity-relationship graph
DV Kalashnikov, S Mehrotra - ACM Transactions on Database Systems …, 2006 - dl.acm.org
In this article, we address the problem of reference disambiguation. Specifically, we consider
a situation where entities in the database are referred to using descriptions (eg, a set of …
a situation where entities in the database are referred to using descriptions (eg, a set of …
Linking temporal records
Many data sets contain temporal records over a long period of time; each record is
associated with a time stamp and describes some aspects of a real-world entity at that …
associated with a time stamp and describes some aspects of a real-world entity at that …
Scalable and distributed methods for entity matching, consolidation and disambiguation over linked data corpora
With respect to large-scale, static, Linked Data corpora, in this paper we discuss scalable
and distributed methods for entity consolidation (aka. smushing, entity resolution, object …
and distributed methods for entity consolidation (aka. smushing, entity resolution, object …
Exploiting relationships for domain-independent data cleaning
DV Kalashnikov, S Mehrotra, Z Chen - Proceedings of the 2005 SIAM …, 2005 - SIAM
In this paper we address the problem of reference disambiguation. Specifically, we consider
a situation where entities in the database are referred to using descriptions (eg, a set of …
a situation where entities in the database are referred to using descriptions (eg, a set of …
Exploiting context analysis for combining multiple entity resolution systems
Z Chen, DV Kalashnikov, S Mehrotra - Proceedings of the 2009 ACM …, 2009 - dl.acm.org
Entity Resolution (ER) is an important real world problem that has attracted significant
research interest over the past few years. It deals with determining which object descriptions …
research interest over the past few years. It deals with determining which object descriptions …
Web people search via connection analysis
DV Kalashnikov, Z Chen, S Mehrotra… - … on Knowledge and …, 2008 - ieeexplore.ieee.org
Nowadays, searches for webpages of a person with a given name constitute a notable
fraction of queries to web search engines. Such a query would normally return webpages …
fraction of queries to web search engines. Such a query would normally return webpages …