Detecting duplicate records in scientific workflow results

K Belhajjame, P Missier, CA Goble - … IPAW 2012, Santa Barbara, CA, USA …, 2012 - Springer
Scientific workflows are often data intensive. The data sets obtained by enacting scientific
workflows have several applications, eg, they can be used to identify data correlations or to …

[PDF][PDF] Exploring Linked Data at Web Scale

A Harth - 2010 - harth.org
Universities, government agencies, companies, and individuals are increasingly releasing
torrents of data encoded in structured formats. At the same time, the web is evolving from a …

[PDF][PDF] Scalable and Distributed Methods for Resolving, Consolidating, Matching and Disambiguating Entities in Linked Data Corpora

A Hogan, A Zimmermann, J Umbrich… - Journal of Web …, 2010 - aidanhogan.com
With respect to large-scale, static, Linked Data corpora, in this paper we discuss scalable
and distributed methods for:(i) entity consolidation—identifying entities which signify the …

[PDF][PDF] Performing object consolidation on the semantic web data

A Hogan, A Harth, S Decker - Science, 1959 - Citeseer
An important aspect of Semantic Web technologies is the issue of identity and uniquely
identifying resources, which is essential for integrating data across sources. Currently, there …

[PDF][PDF] Information Systems for Global Financial Markets: Emerging Developments and

AY Yap - 2012 - researchgate.net
Corporate bankruptcy has been always an active area of financial research. Furthermore,
after the Lehman Brothers' default and its consequences on the global financial system, this …

[PDF][PDF] Towards A Towards A New Token Based Framework for New Token Based Framework for New Token Based Framework for Record Linkage in Record Linkage …

HHA Ghafour, A El-Bastawissy, AA Hegazy - IJCSNS, 2011 - academia.edu
Record linkage is the process of identifying if two records represent the same real entity or
not. Record Linkage is one of the most important and most investigated issue in data quality …

[PDF][PDF] Linked Data Driven Information Systems as an enabler for

S O'Riain, A Harth, E Curry - 2011 - researchrepository …
With increased dependence on efficient use and inclusion of diverse corporate and Web
based data sources for business information analysis, financial information providers will …

[PDF][PDF] New Forms of Reasoning for the Semantic Web: Scalable & Dynamic

S Ceri, E Della Valle, J Hendler, Z Huang - 2010 - wasp.cs.vu.nl
Initiatives like Linked Open Data have resulted in a rapid growth of the Web of data, and this
growth is expected to continue. While impressive progress has been made in recent years in …

レコード同定問題に関する研究の課題と現状

相澤彰子, 大山敬三, 高須淳宏… - 電子情報通信学会論文誌 D, 2005 - search.ieice.org
単一あるいは異なる情報源の間で重複するレコードを見つけ出す 「レコード同定」 は,
データベースの品質管理やデータ統合に必須の技術である. しかしながら, このレコード間の照合は …