[图书][B] Magellan: Toward building entity matching management systems

PV Konda - 2018 - search.proquest.com
Entity matching (EM) identifies data instances that refer to the same real-world entity, such
as (David Smith, UWMadison) and (DM Smith, UWM). This problem has been a long …

Integrating data lake tables

A Khatiwada, R Shraga, W Gatterbauer… - Proceedings of the VLDB …, 2022 - dl.acm.org
We have made tremendous strides in providing tools for data scientists to discover new
tables useful for their analyses. But despite these advances, the proper integration of …

Data market platforms: Trading data assets to solve data problems

RC Fernandez, P Subramaniam… - arXiv preprint arXiv …, 2020 - arxiv.org
Data only generates value for a few organizations with expertise and resources to make
data shareable, discoverable, and easy to integrate. Sharing data that is easy to discover …

Knowledge-driven data ecosystems toward data transparency

S Geisler, ME Vidal, C Cappiello, BF Lóscio… - ACM Journal of Data …, 2021 - dl.acm.org
A data ecosystem (DE) offers a keystone-player or alliance-driven infrastructure that enables
the interaction of different stakeholders and the resolution of interoperability issues among …

Metam: Goal-oriented data discovery

S Galhotra, Y Gong… - 2023 IEEE 39th …, 2023 - ieeexplore.ieee.org
Data is a central component of machine learning and causal inference tasks. The availability
of large amounts of data from sources such as open data repositories, data lakes and data …

Medto: Medical data to ontology matching using hybrid graph neural networks

J Hao, C Lei, V Efthymiou, A Quamar, F Özcan… - Proceedings of the 27th …, 2021 - dl.acm.org
Medical ontologies are widely used to describe and organize medical terminologies and to
support many critical applications on healthcare databases. These ontologies are often …

Sudowoodo: Contrastive self-supervised learning for multi-purpose data integration and preparation

R Wang, Y Li, J Wang - 2023 IEEE 39th International …, 2023 - ieeexplore.ieee.org
Machine learning (ML) is playing an increasingly important role in data management tasks,
particularly in Data Integration and Preparation (DI&P). The success of ML-based …

Adnev: Cross-domain schema matching using deep similarity matrix adjustment and evaluation

R Shraga, A Gal, H Roitman - Proceedings of the VLDB Endowment, 2020 - dl.acm.org
Schema matching is a process that serves in integrating structured and semi-structured data.
Being a handy tool in multiple contemporary business and commerce applications, it has …

SMAT: An attention-based deep learning solution to the automation of schema matching

J Zhang, B Shin, JD Choi, JC Ho - … , ADBIS 2021, Tartu, Estonia, August 24 …, 2021 - Springer
Schema matching aims to identify the correspondences among attributes of database
schemas. It is frequently considered as the most challenging and decisive stage existing in …

Keeping the data lake in form: proximity mining for pre-filtering schema matching

A Alserafi, A Abelló, O Romero, T Calders - ACM Transactions on …, 2020 - dl.acm.org
Data lakes (DLs) are large repositories of raw datasets from disparate sources. As more
datasets are ingested into a DL, there is an increasing need for efficient techniques to profile …