[PDF][PDF] Data cleaning: Problems and current approaches
E Rahm, HH Do - IEEE Data Eng. Bull., 2000 - cs.brown.edu
We classify data quality problems that are addressed by data cleaning and provide an
overview of the main solution approaches. Data cleaning is especially required when …
overview of the main solution approaches. Data cleaning is especially required when …
Semantic integration research in the database community: A brief survey
Semantic integration has been a long-standing challenge for the database community. It has
received steady attention over the past two decades, and has now become a prominent area …
received steady attention over the past two decades, and has now become a prominent area …
Neural networks for entity matching: A survey
Entity matching is the problem of identifying which records refer to the same real-world
entity. It has been actively researched for decades, and a variety of different approaches …
entity. It has been actively researched for decades, and a variety of different approaches …
[图书][B] Web data mining: exploring hyperlinks, contents, and usage data
B Liu - 2011 - Springer
Liu has written a comprehensive text on Web mining, which consists of two parts. The first
part covers the data mining and machine learning foundations, where all the essential …
part covers the data mining and machine learning foundations, where all the essential …
[图书][B] Principles of distributed database systems
MT Özsu, P Valduriez - 1999 - Springer
The first edition of this book appeared in 1991 when the technology was new and there were
not too many products. In the Preface to the first edition, we had quoted Michael Stonebraker …
not too many products. In the Preface to the first edition, we had quoted Michael Stonebraker …
[图书][B] Ontology matching
J Euzenat, P Shvaiko - 2007 - Springer
An ontology typically provides a vocabulary describing a domain of interest and a
specification of the meaning of terms in that vocabulary. Depending on the precision of this …
specification of the meaning of terms in that vocabulary. Depending on the precision of this …
A survey of approaches to automatic schema matching
E Rahm, PA Bernstein - the VLDB Journal, 2001 - Springer
Schema matching is a basic problem in many database application domains, such as data
integration, E-business, data warehousing, and semantic query processing. In current …
integration, E-business, data warehousing, and semantic query processing. In current …
Similarity flooding: A versatile graph matching algorithm and its application to schema matching
S Melnik, H Garcia-Molina… - … international conference on …, 2002 - ieeexplore.ieee.org
Matching elements of two data schemas or two data instances plays a key role in data
warehousing, e-business, or even biochemical applications. In this paper we present a …
warehousing, e-business, or even biochemical applications. In this paper we present a …
Data-Centric Systems and Applications
The rapid growth of the Web in the past two decades has made it the largest publicly
accessible data source in the world. Web mining aims to discover useful information or …
accessible data source in the world. Web mining aims to discover useful information or …