Streaming-data algorithms for high-quality clustering

L O'callaghan, N Mishra, A Meyerson… - … conference on data …, 2002 - ieeexplore.ieee.org
Streaming data analysis has recently attracted attention in numerous applications including
telephone records, Web documents and click streams. For such analysis, single-pass …

Learning object identification rules for information integration

S Tejada, CA Knoblock, S Minton - Information Systems, 2001 - Elsevier
When integrating information from multiple websites, the same data objects can exist in
inconsistent text formats across sites, making it difficult to identify matching objects using …

Semantic e-workflow composition

J Cardoso, A Sheth - Journal of intelligent information systems, 2003 - Springer
Abstract Systems and infrastructures are currently being developed to support Web services.
The main idea is to encapsulate an organization's functionality within an appropriate …

[引用][C] Data quality

RY Wang - 2001 - books.google.com
Data Quality provides an exposé of research and practice in the data quality field for
technically oriented readers. It is based on the research conducted at the MIT Total Data …

Context interchange: New features and formalisms for the intelligent integration of information

CH Goh, S Bressan, S Madnick, M Siegel - ACM Transactions on …, 1999 - dl.acm.org
The Context Interchange strategy presents a novel perspective for mediated data access in
which semantic conflicts among heterogeneous systems are not identified a priori, but are …

[PDF][PDF] Quality of service and semantic composition of workflows

AJS Cardoso - 2002 - academia.edu
Workflow management systems (WfMSs) have been used to support a variety of business
processes. As organizations adopt new working models, such as e-commerce, new …

The DaQuinCIS architecture: a platform for exchanging and improving data quality in cooperative information systems

M Scannapieco, A Virgillito, C Marchetti, M Mecella… - Information systems, 2004 - Elsevier
In cooperative information systems, the quality of data exchanged and provided by different
data sources is extremely important. A lack of attention to data quality can imply data of low …

[图书][B] Theories of geographic concepts: ontological approaches to semantic integration

M Kavouras, M Kokla - 2007 - taylorfrancis.com
Most widely available approaches to semantic integration provide ad-hoc, non-systematic,
subjective manual mappings that lead to procrustean amalgamations to fit the target …

A knowledge-based approach for duplicate elimination in data cleaning

WL Low, ML Lee, TW Ling - Information Systems, 2001 - Elsevier
Existing duplicate elimination methods for data cleaning work on the basis of computing the
degree of similarity between nearby records in a sorted database. High recall can be …

Database integration: the key to data interoperability

C Parent, S Spaccapietra - 2000 - direct.mit.edu
Most of new databases are no more built from scratch, but re-use existing data from several
autonomous data stores. To facilitate application development, the data to be re-used …