[图书][B] Data profiling
Data profiling refers to the activity of collecting data about data,{ie}, metadata. Most IT
professionals and researchers who work with data have engaged in data profiling, at least …
professionals and researchers who work with data have engaged in data profiling, at least …
Ontology-based entity matching in attributed graphs
Keys for graphs incorporate the topology and value constraints needed to uniquely identify
entities in a graph. They have been studied to support object identification, knowledge …
entities in a graph. They have been studied to support object identification, knowledge …
Fact checking in knowledge graphs with ontological subgraph patterns
Given a knowledge graph and a fact (a triple statement), fact checking is to decide whether
the fact belongs to the missing part of the graph. Facts in real-world knowledge bases are …
the fact belongs to the missing part of the graph. Facts in real-world knowledge bases are …
Contextual data cleaning with ontology functional dependencies
Functional Dependencies define attribute relationships based on syntactic equality, and
when used in data cleaning, they erroneously label syntactically different but semantically …
when used in data cleaning, they erroneously label syntactically different but semantically …
CurrentClean: Spatio-temporal cleaning of stale data
Data currency is imperative towards achieving up-to-date and accurate data analysis. Data
is considered current if changes in real world entities are reflected in the database. When …
is considered current if changes in real world entities are reflected in the database. When …
Privacy-aware data cleaning-as-a-service
Data cleaning is a pervasive problem for organizations as they try to reap value from their
data. Recent advances in networking and cloud computing technology have fueled a new …
data. Recent advances in networking and cloud computing technology have fueled a new …
RTClean: Context-aware Tabular Data Cleaning using Real-time OFDs
D Del Gaudio, T Schubert… - 2023 IEEE International …, 2023 - ieeexplore.ieee.org
Nowadays, machine learning plays a key role in developing plenty of applications, eg, smart
homes, smart medical assistance, and autonomous driving. A major challenge of these …
homes, smart medical assistance, and autonomous driving. A major challenge of these …
Discovery of temporal graph functional dependencies
L Noronha, F Chiang - Proceedings of the 30th ACM International …, 2021 - dl.acm.org
Temporal Graph Functional Dependencies (TGFDs) are a class of data quality rules
imposing topological, attribute dependency constraints over a period of time. To make …
imposing topological, attribute dependency constraints over a period of time. To make …
Test case selection and prioritization approach for automated regression testing using ontology and COSMIC measurement
Z Sakhrawi, T Labidi - Automated Software Engineering, 2024 - Springer
Regression testing is an important activity that aims to provide information about the quality
of the software product under test when changes occur. The two primary techniques for …
of the software product under test when changes occur. The two primary techniques for …
Mining Keys for Graphs
M Alipourlangouri, F Chiang - Data & Knowledge Engineering, 2024 - Elsevier
Keys for graphs are a class of data quality rules that use topological and value constraints to
uniquely identify entities in a data graph. They have been studied to support object …
uniquely identify entities in a data graph. They have been studied to support object …