Data fusion

J Bleiholder, F Naumann - ACM computing surveys (CSUR), 2009 - dl.acm.org
The development of the Internet in recent years has made it possible and useful to access
many different information systems anywhere in the world to obtain information. While there …

[图书][B] Problems, methods, and challenges in comprehensive data cleansing

H Müller, JC Freytag - 2005 - pubs.dbs.uni-leipzig.de
Cleansing data from impurities is an integral part of data processing and maintenance. This
has lead to the development of a broad range of methods intending to enhance the accuracy …

[PDF][PDF] A survey of data quality tools.

J Barateiro, H Galhardas - Datenbank-Spektrum, 2005 - Citeseer
Data quality tools aim at detecting and correcting data problems that affect the accuracy and
efficiency of data analysis applications. We propose a classification of the most relevant …

Fusionplex: resolution of data inconsistencies in the integration of heterogeneous information sources

A Motro, P Anokhin - Information fusion, 2006 - Elsevier
Fusionplex is a system for integrating multiple heterogeneous and autonomous information
sources that uses data fusion to resolve factual inconsistencies among the individual …

[PDF][PDF] Data freshness and data accuracy: A state of the art

V Peralta - Instituto de Computacion, Facultad de Ingenieria …, 2006 - fing.edu.uy
In a context of Data Integration Systems (DIS) providing access to large amounts of data
extracted and integrated from autonomous data sources, users are highly concerned about …

Efficient similarity-based operations for data integration

E Schallehn, KU Sattler, G Saake - Data & Knowledge Engineering, 2004 - Elsevier
Dealing with discrepancies in data is still a big challenge in data integration systems. The
problem occurs both during eliminating duplicates from semantic overlapping sources as …

A computational biology database digest: data, data analysis, and data management

F Bry, P Kröger - Distributed and Parallel Databases, 2003 - Springer
Abstract Computational Biology or Bioinformatics has been defined as the application of
mathematical and Computer Science methods to solving problems in Molecular Biology that …

Declarative data fusion–syntax, semantics, and implementation

J Bleiholder, F Naumann - East European Conference on Advances in …, 2005 - Springer
In today's integrating information systems data fusion, ie, the merging of multiple tuples
about the same real-world object into a single tuple, is left to ETL tools and other specialized …

A data preparation framework based on a multidatabase language

KU Sattler, E Schallehn - Proceedings 2001 International …, 2001 - ieeexplore.ieee.org
Integration and analysis of data from different sources have to deal with several problems
resulting from potential heterogeneities. The activities addressing these problems are called …

Advanced grouping and aggregation for data integration

E Schallehn, KU Sattler, G Saake - Proceedings of the tenth international …, 2001 - dl.acm.org
New applications from the areas of analytical data processing and data integration require
powerful features to condense and reconcile available data. As outlined in [1], the general …