Data fusion
J Bleiholder, F Naumann - ACM computing surveys (CSUR), 2009 - dl.acm.org
The development of the Internet in recent years has made it possible and useful to access
many different information systems anywhere in the world to obtain information. While there …
many different information systems anywhere in the world to obtain information. While there …
[图书][B] Problems, methods, and challenges in comprehensive data cleansing
H Müller, JC Freytag - 2005 - pubs.dbs.uni-leipzig.de
Cleansing data from impurities is an integral part of data processing and maintenance. This
has lead to the development of a broad range of methods intending to enhance the accuracy …
has lead to the development of a broad range of methods intending to enhance the accuracy …
[PDF][PDF] A survey of data quality tools.
J Barateiro, H Galhardas - Datenbank-Spektrum, 2005 - Citeseer
Data quality tools aim at detecting and correcting data problems that affect the accuracy and
efficiency of data analysis applications. We propose a classification of the most relevant …
efficiency of data analysis applications. We propose a classification of the most relevant …
Fusionplex: resolution of data inconsistencies in the integration of heterogeneous information sources
Fusionplex is a system for integrating multiple heterogeneous and autonomous information
sources that uses data fusion to resolve factual inconsistencies among the individual …
sources that uses data fusion to resolve factual inconsistencies among the individual …
[PDF][PDF] Data freshness and data accuracy: A state of the art
V Peralta - Instituto de Computacion, Facultad de Ingenieria …, 2006 - fing.edu.uy
In a context of Data Integration Systems (DIS) providing access to large amounts of data
extracted and integrated from autonomous data sources, users are highly concerned about …
extracted and integrated from autonomous data sources, users are highly concerned about …
Efficient similarity-based operations for data integration
Dealing with discrepancies in data is still a big challenge in data integration systems. The
problem occurs both during eliminating duplicates from semantic overlapping sources as …
problem occurs both during eliminating duplicates from semantic overlapping sources as …
A computational biology database digest: data, data analysis, and data management
Abstract Computational Biology or Bioinformatics has been defined as the application of
mathematical and Computer Science methods to solving problems in Molecular Biology that …
mathematical and Computer Science methods to solving problems in Molecular Biology that …
Declarative data fusion–syntax, semantics, and implementation
J Bleiholder, F Naumann - East European Conference on Advances in …, 2005 - Springer
In today's integrating information systems data fusion, ie, the merging of multiple tuples
about the same real-world object into a single tuple, is left to ETL tools and other specialized …
about the same real-world object into a single tuple, is left to ETL tools and other specialized …
A data preparation framework based on a multidatabase language
KU Sattler, E Schallehn - Proceedings 2001 International …, 2001 - ieeexplore.ieee.org
Integration and analysis of data from different sources have to deal with several problems
resulting from potential heterogeneities. The activities addressing these problems are called …
resulting from potential heterogeneities. The activities addressing these problems are called …
Advanced grouping and aggregation for data integration
New applications from the areas of analytical data processing and data integration require
powerful features to condense and reconcile available data. As outlined in [1], the general …
powerful features to condense and reconcile available data. As outlined in [1], the general …