[图书][B] Magellan: Toward building entity matching management systems
PV Konda - 2018 - search.proquest.com
Entity matching (EM) identifies data instances that refer to the same real-world entity, such
as (David Smith, UWMadison) and (DM Smith, UWM). This problem has been a long …
as (David Smith, UWMadison) and (DM Smith, UWM). This problem has been a long …
Integrating data lake tables
We have made tremendous strides in providing tools for data scientists to discover new
tables useful for their analyses. But despite these advances, the proper integration of …
tables useful for their analyses. But despite these advances, the proper integration of …
Data market platforms: Trading data assets to solve data problems
RC Fernandez, P Subramaniam… - arXiv preprint arXiv …, 2020 - arxiv.org
Data only generates value for a few organizations with expertise and resources to make
data shareable, discoverable, and easy to integrate. Sharing data that is easy to discover …
data shareable, discoverable, and easy to integrate. Sharing data that is easy to discover …
Knowledge-driven data ecosystems toward data transparency
A data ecosystem (DE) offers a keystone-player or alliance-driven infrastructure that enables
the interaction of different stakeholders and the resolution of interoperability issues among …
the interaction of different stakeholders and the resolution of interoperability issues among …
Metam: Goal-oriented data discovery
S Galhotra, Y Gong… - 2023 IEEE 39th …, 2023 - ieeexplore.ieee.org
Data is a central component of machine learning and causal inference tasks. The availability
of large amounts of data from sources such as open data repositories, data lakes and data …
of large amounts of data from sources such as open data repositories, data lakes and data …
Medto: Medical data to ontology matching using hybrid graph neural networks
Medical ontologies are widely used to describe and organize medical terminologies and to
support many critical applications on healthcare databases. These ontologies are often …
support many critical applications on healthcare databases. These ontologies are often …
Sudowoodo: Contrastive self-supervised learning for multi-purpose data integration and preparation
Machine learning (ML) is playing an increasingly important role in data management tasks,
particularly in Data Integration and Preparation (DI&P). The success of ML-based …
particularly in Data Integration and Preparation (DI&P). The success of ML-based …
Adnev: Cross-domain schema matching using deep similarity matrix adjustment and evaluation
Schema matching is a process that serves in integrating structured and semi-structured data.
Being a handy tool in multiple contemporary business and commerce applications, it has …
Being a handy tool in multiple contemporary business and commerce applications, it has …
SMAT: An attention-based deep learning solution to the automation of schema matching
Schema matching aims to identify the correspondences among attributes of database
schemas. It is frequently considered as the most challenging and decisive stage existing in …
schemas. It is frequently considered as the most challenging and decisive stage existing in …
Keeping the data lake in form: proximity mining for pre-filtering schema matching
Data lakes (DLs) are large repositories of raw datasets from disparate sources. As more
datasets are ingested into a DL, there is an increasing need for efficient techniques to profile …
datasets are ingested into a DL, there is an increasing need for efficient techniques to profile …